ChapterPDF Available

The Modality Effect of Cognitive Load Theory

Authors:

Abstract and Figures

The modality effect, which has been investigated by cognitive load theory, predicts that learning from visualizations supplemented with written text should be less effective than learning from the same visualizations supplemented with comparable spoken text. An explanation of the effect assumes a degree of separation between the processing of visuospatial and auditory information. Due to this separability, learning only from visuospatial information (visualizations and visual text) is more likely to overload visuospatial processing, as compared to learning from visuospatial and auditory information (visualizations and auditory text), in which both the visuospatial and the auditory processors share the load of the learning material. The aims of this review chapter are to: (a) describe the modality effect, (b) provide supporting evidence using computer multimedia about STEM topics, and (c) describe studies indicating the separability of visuospatial and auditory processing. We finish by suggesting future directions for research on the modality effect.
Content may be subject to copyright.
The Modality Effect of Cognitive Load Theory
Juan C. Castro-Alonso
1(&)
and John Sweller
2
1
Center for Advanced Research in Education,
Universidad de Chile, Santiago, Chile
jccastro@ciae.uchile.cl
2
School of Education, University of New South Wales, Sydney, Australia
j.sweller@unsw.edu.au
Abstract. The modality effect, which has been investigated by cognitive load
theory, predicts that learning from visualizations supplemented with written text
should be less effective than learning from the same visualizations supplemented
with comparable spoken text. An explanation of the effect assumes a degree of
separation between the processing of visuospatial and auditory information. Due
to this separability, learning only from visuospatial information (visualizations
and visual text) is more likely to overload visuospatial processing, as compared
to learning from visuospatial and auditory information (visualizations and
auditory text), in which both the visuospatial and the auditory processors share
the load of the learning material. The aims of this review chapter are to:
(a) describe the modality effect, (b) provide supporting evidence using computer
multimedia about STEM topics, and (c) describe studies indicating the separa-
bility of visuospatial and auditory processing. We nish by suggesting future
directions for research on the modality effect.
Keywords: Modality effect or modality principle Cognitive load theory
STEM education Multicomponent working memory Multimedia learning
1 Introduction
For over 30 years, cognitive load theory [1] and the related cognitive theory of mul-
timedia learning [2] have published several educational principles based on randomized
experiments with control of variables. Under these strict methodological conditions,
many instructional principles or effects have been investigated. We focus on one of
these principles, the modality effect or modality principle [e.g., 3], which guides the
design of multimedia instructional resources that combine visualizations (e.g. anima-
tions, videos, photos, diagrams) and texts (e.g., on-screen, printed, narrated). The
modality effect occurs when multimedia that depict visualizations associated with
written text is less effective for learning than multimedia that depict the same visual-
izations associated with comparable narrated text [4].
Cognitive load theory explains the modality effect by building on Baddeley and
Hitchs multicomponent model of working memory [see 5]. In this model, working
memory includes two systems with limited capacity: (a) the phonological loop,
managing the processing of auditory information, and (b) the visuospatial sketch pad,
dealing with visual and spatial information. The multicomponent model indicates that
©Springer Nature Switzerland AG 2020
W. Karwowski et al. (Eds.): AHFE 2019, AISC 963, pp. 7584, 2020.
https://doi.org/10.1007/978-3-030-20135-7_7
jccastro@ciae.uchile.cl
auditory and visuospatial information to a substantial degree tend to be processed
separately in these limited systems [cf. 6]. Due to this separability, learning only from
visuospatial information (images supplemented with written texts) is more likely to
overload the limited visuospatial sketch pad, as compared to learning from visuospatial
and auditory information (images supplemented with spoken text), in which both the
visuospatial sketch pad and the phonological loop share the cognitive load of the
learning material.
Hence, the modality effect is produced when an overloaded system does not have
enough capacity to deal with learning, as compared to two less overloaded systems. For
example, Fig. 1shows two multimedia formats to teach the shape of bacteria. The
format on the left (Fig. 1a) supplements the images with spoken text, whereas the
design on the right (Fig. 1b) supplements the images with written text, so it is more
likely to overload the visuospatial processor. (Note that cognitive load theory usually
investigates more complex educational materials, likely to overload the visuospatial or
auditory systems [see 7], rather than the simple example we are providing).
From the diverse materials and topics where the modality effect has been investi-
gated, in the next section we provide examples of supporting results on computer
multimedia dealing with STEM (science, technology, engineering, and mathematics)
topics.
Fig. 1. Example of formats that ause both the visuospatial sketch pad and the phonological loop
(with the shapes spoken rather than written), and buse solely the visuospatial sketch pad (at the
risk of overloading it).
76 J. C. Castro-Alonso and J. Sweller
jccastro@ciae.uchile.cl
2 STEM Multimedia Evidence for the Modality Effect
In a meta-analysis of the modality effect [8], in which 43 effect sizes and more than
1,900 participants were investigated, there was an overall effect size of d= 0.72.
According to benchmarks for behavioral sciences [9], this size corresponds to a medium
to large effect. Notably, when the meta-analysis [8] compared different instructional
disciplines, a larger effect size of d= 1.20 was reported for the science domains.
Additional evidence for the modality effect using STEM multimedia has included a
variety of participants and learning content. For example, Experiment 1 reported in [10]
compared the instructional effectiveness of visual vs. auditory explanations added to
meteorology multimedia. In the study, where university students had to learn lightning
formation, it was observed that those given auditory texts outperformed those shown
the texts on-screen. Also, in [11], Experiment 1 assessed rst-year apprentices learning
to read a fusion diagram for soldering. Results showed that a group receiving the
animated visuals supplemented with auditory explanations showed higher test scores
and lower self-ratings of cognitive load than a group given animated visuals supple-
mented with written text explanations. In an experiment where university participants
studied lightning formation from different animations [12, Experiment 1], results
showed that animations with narrations obtained signicantly higher retention and
transfer scores than animations with on-screen texts.
Concerning biological sciences, in [13] the authors investigated university students
learning about sh movements. For both static and animated displays, it was observed
that narrated multimedia outperformed written multimedia in tests of retention and
transfer. Also, Experiment 2 in [14] reported university participants who studied
through computer static pictures and texts describing the structure and function of an
enzyme. Randomly, half of the sample studied from the pictures supplemented with
narrations and the other half received supplementary written texts. Results showed
higher recall scores for the group studying with images and narrations. A similar
direction of effects for the modality effect, although non-signicant, was observed for
comprehension and transfer tests. Lastly, [15] investigated the capacity of university
students to learn a health science rst-aid procedure using two animation formats. Half
of the participants were randomly assigned to animations and written texts (subtitles
below the depiction), and the other half watched the animations with narrated texts.
Results of the behavioral performance test showed that the narrated versions outper-
formed the written text formats.
Thus, there are diverse studies using STEM multimedia learning supporting the
modality effect. As described above, the effect assumes that working memory can be at
least partially separated into different processors. Evidence for this separability can help
predict the effectiveness of the modality effect under different learning conditions.
Next, we describe two areas that have supported the separation of processing between
visuospatial and auditory information.
The Modality Effect of Cognitive Load Theory 77
jccastro@ciae.uchile.cl
3 Working Memory Separability that Allows the Modality
Effect
As reviewed in [16], there are at least two research areas that show evidence for the
separability between the visuospatial and auditory subcomponents of working memory,
which will be termed here: (a) selective interference, and (b) modality organization.
Concerning selective interference, this research shows selective impairment in pro-
cessing visuospatial information when receiving additional visuospatial information
(but not when receiving extra auditory information), and selective impairment in
processing auditory information when receiving additional auditory information (but
not when receiving extra visuospatial information).
In a classic example of selective interference [17], male university students spoke
out letters while memorizing other letters. The information to be memorized was
presented either in an auditory or visual modality. It was observed that visually shown
letters were remembered for a longer period, because, in contrast to auditorily shown
letters, they were not interfered by the auditory speaking aloud process. In an example
from multimedia research [18], Experiment 1 tested female education undergraduates
learning about the cardiovascular system from a multimedia module. While learning
the contents, students were required to respond as rapidly as possible to a change in
color of an on-screen element. Showing selective interference, students were slower in
responding to the visual change when the multimedia included on-screen text, as
compared to narrated multimedia. In other words, students were more impaired in the
color visuospatial tasks by the visual text than by the auditory text of the multimedia.
Similar multimedia are shown in Fig. 2, which depicts information based on the design
in [18]. In these replicas, the on-screen text is expected to produce a slower students
response when the star at the top-right changes color from purple to green (Fig. 2a),
compared to the response when there is a narration (Fig. 2b).
Fig. 2. With on-screen text athe response to the star changing color should be slower than with
auditory text b.
78 J. C. Castro-Alonso and J. Sweller
jccastro@ciae.uchile.cl
As another example, in two dual-task experiments investigating visuospatial pro-
cessing [19], undergraduates were asked to generate and rotate mental images while
executing other simultaneous tasks. Results showed that the simultaneous task of
speaking a word (auditory task) did not interfere with the visuospatial tasks. In contrast,
the simultaneous task of localizing the source of a sound (spatial task) was deleterious
to processing the mental images.
The investigations by Robinson and colleagues are also part of the selective
interference research. The authors observed more interference between visuospatial and
verbal processing when the verbal information was presented in visuospatial cong-
urations [cf. 6]. For example, in [20] they reported four experiments with university
students learning zoology categories from different verbal displays, including less
visuospatial (written paragraphs) and more visuospatial congurations (graphic orga-
nizers and concept maps). To produce processing interference, the students were also
given verbal and visual working memory tasks. As predicted, test scores on zoology
information memorized from graphic organizers or concept maps (high in visuospatial
information) were lower when attempting a visual working memory task, as compared
to a verbal working memory task. In contrast, these visuospatial interferences were not
observed when the information was memorized from paragraphs, which relied less on
visuospatial organization than graphic organizers and concept maps. In a follow-up
with stricter controls [21], the ndings were replicated.
To exemplify these experiments, Fig. 3provides different visuospatial congura-
tions for texts about two penguin species. The facts comparing the species are given
either as paragraphs (Fig. 3a), as a graphic organizer (Fig. 3b), or as a concept map
(Fig. 3c). It can be predicted that paragraphs will be less affected by visuospatial
interference than the graphic organizer or the concept map. In short, these investiga-
tions of selective interference support the hypothesis that visuospatial information
(including texts presented in visuospatial congurations, such as concept maps), tends
to be processed separately from verbal information (including texts presented as written
paragraphs).
Fig. 3. Comparable texts congured as aparagraphs, bgraphic organizer, and cconcept map.
The Modality Effect of Cognitive Load Theory 79
jccastro@ciae.uchile.cl
In addition to selective interference investigations, another area that supports the
separability of processing visuospatial and auditory information is modality organi-
zation. This research shows that, when retrieving elements from memory, the modality
of the memorized items is pervasive and is even stronger than semantic or other types
of grouping categories. As a classic example, in [22] the author employed an original
strategy to present sets of four simultaneous words to university students. Two words
were shown visually, and the two remaining were presented auditorily (one word per
ear). Students had to report the four words from memory. The results showed that
participants reported the words in blocks of the same modality (i.e., auditory &
auditory, visual & visual), even though the four stimuli were simultaneous. In a follow-
up study, it was observed that this modality order was stronger than an associative or
semantic order between pairs of words of different modalities. For example, when a
pair such as girl (auditory) & boy (visual) was presented as stimuli, the association
girlboy was weaker than the modality of each word. The words were always reported
in modality-determined blocks, even if this order broke the associations between word
pairs. Figure 4shows examples of stimuli and responses when the modality of the
stimuli was congruent with the semantic association of the word pairs (Fig. 4a, girl and
boy in auditory modality) or when it was non-congruent (Fig. 4b, boy and day in visual
modality).
Another study related to modality organization [23], investigated the response times
of undergraduates receiving repeated and novel words, presented either visually or
auditorily. As expected, results showed a faster response time when repeated words
were presented (same word in studying and test times), compared to novel words.
Notably, the effect was larger when the same modality was used in studying and test
times, compared to cross modality results. These ndings indicate that processing the
Fig. 4. The modality organization shown in stimuli and response time, when the modality of the
stimuli is either congruent a, or non-congruent bwith the semantic association.
80 J. C. Castro-Alonso and J. Sweller
jccastro@ciae.uchile.cl
modality of a word occurs earlier than processing its meaning. Analogous ndings with
numerical stimuli were reported in [24], regarding an experiment with university stu-
dents. The students received a series of interspersed auditory and visual one-digit
numbers. It was observed that participants who reported the digits grouped according to
modality outperformed those grouping the digits according to their presentation order
and mixing the modalities. In other words, it was more efcient to memorize the two
modalities in parallel rather than intermixing them.
In conclusion, modality organization research supports that items presented visu-
ospatially or auditorily tend to be remembered attached to the modality in which they
were presented. Thus, both modality organization and selective interference investi-
gations support the suggestion that visuospatial processing in working memory is at
least to some extent independent from auditory processing. As described next, future
directions for research in these areas of investigation may inform cognitive load theory
and the modality effect.
4 Future Directions for Research
A possible direction for further study of the modality effect concerns different designs
of visualizations and texts. For example, under visualization conditions that demand
more working memory, such as transient animations [25], the modality effect should be
larger than under less demanding conditions. The design of the auditory text can also
be important, as voice narrations can be more effective than machine narrations [26].
In addition, selective interference and modality organization research may inform
more ne-grained analyses to investigate the modality effect. For example, selective
interference can include the time factor, as giving a space or lapse of time can be effective
to allow working memory to replenish resources and avoid interference [cf. 27].
Also, there are current investigations relating the modality effect to other effects of
cognitive load theory or the cognitive theory of multimedia learning [see 4]. For
example, links have been established with the redundancy effect [see 28], the expertise
reversal effect [see 29], the transient information effect [see 30], and the signaling
principle [see 31].
For example, if the narration merely reiterates information in the visualization, this
will likely produce a negative redundancy effect [28] instead of a positive modality
effect. Also, related to the redundancy [28] and expertise reversal effects [7,29], if an
on-screen text contains information already known by the students, then it is more
appropriate to discard it rather than to present it as narration. Similarly, boundary
conditions for the modality effect entail the integration of ndings from the modality
effect (supporting narrated texts) and the transient information effect (discouraging
transient narrated texts) [e.g., 32].
Regarding the signaling principle [31], sometimes a short on-screen text that signals
important visual information can be an effective learning asset. For example, in [15],
short labels placed at relevant areas of the screen had positive effects on learning.
Future research should investigate the most appropriate length, format, and placement
to produce these positive signaling effects of texts, while avoiding a cognitive overload
of the visuospatial processor.
The Modality Effect of Cognitive Load Theory 81
jccastro@ciae.uchile.cl
Lastly, there are known factors that inuence multimedia learning, which may also
affect the modality effect, such as gender [33] and visuospatial processing or spatial
ability [34].
5 Conclusion
The modality effect has been investigated by cognitive load theory and the cognitive
theory of multimedia learning. The effect has shown that learning from visualizations
supplemented with written text is less effective than learning from the same visual-
izations supplemented with comparable narrations. These ndings assume that there is
a degree of separation in the working memory processing of visuospatial and auditory
information, and that written text will tend to overload the visuospatial processor more
than auditory text, because written text will be processed simultaneously with the
visual learning elements. As auditory text removes the load from the visuospatial
processor and onto the auditory processor, it leaves more capacity for the visuospatial
processor to deal with the visual learning elements. The modality effect has been
supported by diverse educational multimedia depicting STEM concepts. Similarly, the
foundation of the effect, that is the separability between visuospatial and auditory
processing, has also been systematically investigated. We described two research areas
showing this separability, namely, selective interference and modality organization.
Future directions for research, including boundary conditions, will provide additional
support for the modality effect and its relationship with other ndings of cognitive load
theory and the derived cognitive theory of multimedia learning.
Acknowledgments. Support from PIACONICYT Basal Funds for Centers of Excellence
Project FB0003, and CONICYT Fondecyt 11180255, is gratefully acknowledged. We thank
Ignacio Jarabran for helping with the illustrations.
References
1. Sweller, J., Ayres, P., Kalyuga, S.: Cognitive Load Theory. Springer, New York (2011)
2. Mayer, R.E. (ed.): The Cambridge Handbook of Multimedia Learning, 2nd edn. Cambridge
University Press, New York (2014)
3. Mousavi, S.Y., Low, R., Sweller, J.: Reducing cognitive load by mixing auditory and visual
presentation modes. J. Educ. Psychol. 87(2), 319334 (1995)
4. Low, R., Sweller, J.: The modality principle in multimedia learning. In: Mayer, R.E. (ed.)
The Cambridge Handbook of Multimedia Learning, 2nd edn., pp. 227246. Cambridge
University Press, New York (2014)
5. Baddeley, A.: Working memory. Science 255(5044), 556559 (1992)
6. Clark, J., Paivio, A.: Dual coding theory and education. Educ. Psychol. Rev. 3(3), 149210
(1991)
7. Chen, O., Kalyuga, S., Sweller, J.: The expertise reversal effect is a variant of the more
general element interactivity effect. Educ. Psychol. Rev. 29(2), 393405 (2017)
8. Ginns, P.: Meta-analysis of the modality effect. Learn. Instr. 15(4), 313331 (2005)
9. Cohen, J.: Statistical Power Analysis for the Behavioral Sciences. Erlbaum, Hillsdale (1988)
82 J. C. Castro-Alonso and J. Sweller
jccastro@ciae.uchile.cl
10. Schmidt-Weigand, F., Kohnert, A., Glowalla, U.: A closer look at split visual attention in
system- and self-paced instruction in multimedia learning. Learn. Instr. 20(2), 100110
(2010)
11. Kalyuga, S., Chandler, P., Sweller, J.: Managing split-attention and redundancy in
multimedia instruction. Appl. Cogn. Psychol. 13(4), 351371 (1999)
12. Moreno, R., Mayer, R.E.: Cognitive principles of multimedia learning: the role of modality
and contiguity. J. Educ. Psychol. 91(2), 358368 (1999)
13. Kühl, T., Scheiter, K., Gerjets, P., Edelmann, J.: The inuence of text modality on learning
with static and dynamic visualizations. Comput. Hum. Behav. 27(1), 2935 (2011)
14. Seufert, T., Schütze, M., Brünken, R.: Memory characteristics and modality in multimedia
learning: an aptitude-treatment-interaction study. Learn. Instr. 19(1), 2842 (2009)
15. de Koning, B.B., van Hooijdonk, C.M.J., Lagerwerf, L.: Verbal redundancy in a procedural
animation: on-screen labels improve retention but not behavioral performance. Comput.
Educ. 107,4553 (2017)
16. Penney, C.G.: Modality effects and the structure of short-term verbal memory. Mem. Cogn.
17(4), 398422 (1989)
17. Kroll, N.E.A., Parks, T., Parkinson, S.R., Bieber, S.L., Johnson, A.L.: Short-term memory
while shadowing: recall of visually and of aurally presented letters. J. Exp. Psychol. 85(2),
220224 (1970)
18. Brünken, R., Steinbacher, S., Plass, J.L., Leutner, D.: Assessment of cognitive load in
multimedia learning using dual-task methodology. Exp. Psychol. 49(2), 109119 (2002)
19. Bruyer, R., Scailquin, J.-C.: The visuospatial sketchpad for mental images: testing the
multicomponent model of working memory. Acta Psychol. 98(1), 1736 (1998)
20. Robinson, D.H., Katayama, A.D., Fan, A.-C.: Evidence for conjoint retention of information
encoded from spatial adjunct displays. Contemp. Educ. Psychol. 21(3), 221239 (1996)
21. Robinson, D.H., Robinson, S.L., Katayama, A.D.: When words are represented in memory
like pictures: evidence for spatial encoding of study materials. Contemp. Educ. Psychol. 24
(1), 3854 (1999)
22. Murdock Jr., B.B.: Four-channel effects in short-term memory. Psychon. Sci. 24(4), 197
198 (1971)
23. Kirsner, K., Smith, M.C.: Modality effects in word identication. Mem. Cogn. 2(4), 637640
(1974)
24. Penney, C.G.: Order of report in bisensory verbal short-term memory. Can. J. Psychol. 34
(2), 190195 (1980)
25. Castro-Alonso, J.C., Ayres, P., Wong, M., Paas, F.: Learning symbols from permanent and
transient visual presentations: dont overplay the hand. Comput. Educ. 116,113 (2018)
26. Mayer, R.E., DaPra, C.S.: An embodiment effect in computer-based learning with animated
pedagogical agents. J. Exp. Psychol. Appl. 18(3), 239252 (2012)
27. Chen, O., Castro-Alonso, J.C., Paas, F., Sweller, J.: Extending cognitive load theory to
incorporate working memory resource depletion: evidence from the spacing effect. Educ.
Psychol. Rev. 30(2), 483501 (2018)
28. Kalyuga, S., Sweller, J.: The redundancy principle in multimedia learning. In: Mayer, R.E.
(ed.) The Cambridge Handbook of Multimedia Learning, 2nd edn., pp. 247262. Cambridge
University Press, New York (2014)
29. Kalyuga, S., Ayres, P., Chandler, P., Sweller, J.: The expertise reversal effect. Educ. Psychol.
38(1), 2331 (2003)
30. Ayres, P., Paas, F.: Making instructional animations more effective: a cognitive load
approach. Appl. Cogn. Psychol. 21(6), 695700 (2007)
The Modality Effect of Cognitive Load Theory 83
jccastro@ciae.uchile.cl
31. van Gog, T.: The signaling (or cueing) principle in multimedia learning. In: Mayer, R.E.
(ed.) The Cambridge Handbook of Multimedia Learning, 2nd edn., pp. 263278. Cambridge
University Press, New York (2014)
32. Leahy, W., Sweller, J.: Cognitive load theory, modality of presentation and the transient
information effect. Appl. Cogn. Psychol. 25(6), 943951 (2011)
33. Castro-Alonso, J.C., Wong, M., Adesope, O.O., Ayres, P., Paas, F.: Gender imbalance in
instructional dynamic versus static visualizations: a meta-analysis. Educ. Psychol. Rev.
(2019). Advance Online Publication
34. Castro-Alonso, J.C., Uttal, D.H.: Spatial ability for university biology education. In: Nazir,
S., Teperi, A.-M., Polak-Sopińska, A. (eds.) Advances in Human Factors in Training,
Education, and Learning Sciences: Proceedings of the AHFE 2018 International Conference
on Human Factors in Training, Education, and Learning Sciences, pp. 283291. Springer
(2019)
84 J. C. Castro-Alonso and J. Sweller
jccastro@ciae.uchile.cl
... In the humanities, where learning often involves abstract reasoning and interpretive analysis, segmenting content and reducing extraneous complexity has been shown to enhance comprehension and engagement (Sweller, 2020). For neurodiverse learners, CLT-aligned strategies-such as multimodal content presentation and reduction in redundant information-can facilitate access and reduce unnecessary processing demands (Castro-Alonso & Sweller, 2020;Le Cunff et al., 2024). ...
Article
Cognitive Load Theory (CLT) and the Cognitive Theory of Multimedia Learning (CTML) have long served as foundational frameworks in instructional design. However, their applicability to contemporary, technologically mediated learning environments remains under-theorized. This review critically examines CLT and CTML, focusing on their assumptions, empirical contributions, and current limitations in addressing the complexities of dynamic, AI-enhanced educational settings. The discussion is further enriched through engagement with complementary perspectives, including self-regulated learning, dual process theory, and connectivism. These frameworks illuminate conceptual convergences but also expose theoretical tensions, particularly regarding unresolved constructs such as germane cognitive load and the methodological challenges associated with real-time cognitive load measurement. In response to these gaps, this paper proposes the Cognitive Load-Aware Modulation (CLAM) strategy—a conceptual model designed to extend cognitive load principles in adaptive, ethically responsive learning environments. Synthesizing insights from cognitive psychology, educational technology, and affective computing, CLAM supports the design of personalized, data-driven instructional systems attuned to learners’ cognitive and emotional states. The model emerges not merely as a theoretical contribution, but as a future-oriented framework rooted in the critical synthesis of the reviewed literature. Its practical applications for real-world educational settings are outlined, and its empirical validation constitutes the next phase of our ongoing research project.
... Another one is the split-attention effect (Arevalo-Mercado et al., 2023;de Koning, 2024;S. Zhang et al., 2022), which emphasizes the importance of integrating related information sources; and the modality effect (Cao et al., 2009;Castro-Alonso & Sweller, 2020;Leahy & Sweller, 2011) which leverages the benefits of presenting information through multiple sensory channels. These approaches have demonstrated efficacy across diverse educational contexts, including complex professional training environments. ...
Article
Full-text available
Junior Secondary School (JSS) or middle school education is peculiar as it involves the introduction of a wide array of subjects across the sciences, arts, humanities, business and vocational fields to young learners. This situation can be ovewhelming, resulting in high cognitive load (CL), with consequent poor learning outcomes, and other negative issues including high dropout rates, requiring urgent attention. AI tools have been explored for addressing multiple learning issues. AI chatbots are particularly useful based on their ability to support individualized learning, pre-training, and other concepts that can facilitate CL management. This study evaluated the impact of an AI-based chatbot system for reducing students' CL and improving learning outcomes, attitude and retention among JSS students. A quasi-experimental study with 120 students was conducted over an 8 week period with 24 learning sessions. The experimental group (N=60) learnt using 'iLearnTech', an AI Chatbot developed specifically for the study. The control group (N=60) learnt through the traditional approach with no chatbot. Learning content was based on the JSS Basic Technology education, a precursor to TVET. Data was collected using the Cognitive Load Measure, the Basic Technology Achievement Test, Students' Attitude Survey (SAS), and Students' Retention Test. The experimental group exhibited huge reductions in CL and corresponding improvements in learning outcomes, attitude and retention. The results also confirmed known relationship between the dependent variables and highlights the potential of AI powered educational tools for addressing diverse educational issues including promoting equitable access, and sustainable education in developing nations and resource-constrained environments. This work contributes to ongoing discussions on AI applications in education. Its novelty lies in its exploration of AI technology in addressing CL issues in the context of junior secondary education. Implications for educational policy and practice, particularly curriculum design and e-learning integration are highlighted.
... An analysis of Alexa use among older adults in long-term care suggest that nearly 22% of the interactions with Alexa relate to well-being, as residents get to know Alexa, ask for advice, and call or message others [35]. Higher levels of internet use among older adults can predict higher levels of social support, reduced feelings of loneliness, and better psychological well-being and life satisfaction [36]. Thus, internet-based interactive communication technologies (ICTs) may increase perceived social support and decrease loneliness [24,26]. ...
Article
Full-text available
This study examines the potential of AI-powered personal voice assistants (PVAs) in reducing loneliness and increasing social support among older adults. With the aging population rapidly expanding, innovative solutions are essential. Prior research has indicated the effectiveness of various interactive communication technologies (ICTs) in mitigating loneliness, but studies focusing on PVAs, particularly considering their modality (audio vs. video), are limited. This research aims to fill this gap by evaluating how voice assistants, in both audio and video formats, influence perceived loneliness and social support. This study examined the impact of voice assistant technology (VAT) interventions, both audio-based (A-VAT) and video-based (V-VAT), on perceived loneliness and social support among 34 older adults living alone. Over three months, participants engaged with Amazon Alexa™ PVA through daily routines for at least 30 min. Using a hybrid natural language processing framework, interactions were analyzed. The results showed reductions in loneliness (Z = −2.99, p < 0.01; pre-study loneliness mean = 1.85, SD = 0.61; post-study loneliness mean = 1.65, SD = 0.57), increases in social support post intervention (Z = −2.23, p < 0.05; pre-study social support mean = 5.44, SD = 1.05; post-study loneliness mean = 5.65, SD = 1.20), and a correlation between increased social support and loneliness reduction when the two conditions are combined (ρ = −0.39, p < 0.05). In addition, V-VAT was more effective than A-VAT in reducing loneliness (U = 85.50, p < 0.05) and increasing social support (U = 95, p < 0.05). However, no significant correlation between changes in perceived social support and changes in perceived loneliness was observed in either intervention condition (V-VAT condition: ρ = −0.24, p = 0.37; A-VAT condition: ρ = −0.46, p = 0.06). This study’s findings could significantly contribute to developing targeted interventions for improving the well-being of aging adults, addressing a critical global issue.
Article
Full-text available
Background. The rapid integration of digital tools in education has transformed classroom environments, creating new opportunities and challenges for instructional design. One key area of focus is the management of cognitive load, which refers to the mental effort required to process information during learning. Cognitive Load Theory (CLT) offers insights into how instructional materials can be optimized to improve learning outcomes. In digital classrooms, the effective design of instructional content becomes even more critical due to the increased multimedia elements and potential for cognitive overload Purpose. This study aims to explore the implications of Cognitive Load Theory (CLT) for instructional design in digital classrooms. It examines how digital tools, such as multimedia content and interactive activities, impact learners' cognitive load and suggests strategies for reducing extraneous cognitive load to enhance learning efficiency and effectiveness. Method. A mixed-methods approach was used, combining quantitative surveys to assess students' cognitive load during digital learning activities and qualitative interviews with instructors to understand their perspectives on instructional design challenges. The study was conducted across several digital learning environments in higher education. Results. The findings indicate that digital learning environments often lead to high cognitive load, particularly when multimedia content is poorly integrated. However, using principles from CLT, such as segmenting information and reducing unnecessary complexity, can significantly lower cognitive load and improve student learning outcomes. Both students and instructors reported that well-designed digital content led to better engagement and more efficient learning. Conclusion. The study concludes that applying Cognitive Load Theory to instructional design in digital classrooms can enhance learning by minimizing cognitive overload. Educators should be mindful of cognitive load when creating digital learning experiences to improve student performance and engagement.
Article
Full-text available
Intonation is an essential skill to master when learning a musical instrument, especially one with variable intonation, such as a brass or bowed string instrument. A common problem is that beginner students often struggle with the intonation of these instruments during their learning process. As such, they tend to dedicate a large amount of time to practice, while the teaching staff spend a lot of time correcting related errors. This validation study is part of a research project of a broader scope that has designed and implemented a piece of software called Plectrus, for the real-time assessment of intonation in bowed string and brass instrument learning. To understand whether this software met the detected educational needs, a statistical evaluation study was conducted using a questionnaire filled out by teachers from music schools and conservatories in Spain and Chile (n=41). The questionnaire included three dimensions of assessment (technical, educational, and overall). Dimensions on the use of digital educational technology, professional experience, gender, age, and professional training acted as covariates to analyze their influence on the dimensions of assessment. The quantitative results were triangulated with qualitative data collected from two focus groups (n=10) with the purpose of ensuring the credibility of the interpretation of the results. The results indicate that Plectrus was received very positively by teaching staff in the three dimensions of assessment; positive correlations between the three dimensions were also found, as well as a positive correlation between the use of educational digital technology and the evaluation of the software. The results of this evaluation of the Plectrus software support its effectiveness and usefulness for the development of intonation in brass and bowed string instruments at the initial level of musical studies.
Thesis
Case management as an approach requires professionals and users to work together. This in turn requires users to be made aware of the options, requirements and responsibilities involved, as well as their owns rights. This research project therefore sees information as an essential step towards empowerment. Informed users are better able to express their needs and expectations of the case management approach. For this to happen, however, comprehensible information about the concept is essential. Therefore, the aim of this research project is to develop information material on case management with comprehensible content. This should contribute to increasing knowledge of case management among (potential) users. Thus, potential users will be able to make an informed decision to participate in the case management process, while users will be able to reflect on their experiences with the process. The guiding question of this project is, how to develop information-and reflection instruments to foster exploration of case management experiences. This research project represents the concept of case management in several different forms: In addition to a text-based approach, visual and interactive methods were chosen to explain the concept. Furthermore, the project sought to develop case management information materials in a participatory manner. These materials would then be discussed with (potential) users of case management and serve as a stimulus for reflection on the professional implementation of case management. In the course of the research process, people with disabilities, juveniles and people with a migration background were selected as target groups for the information materials. Together with experts and practitioners, they were involved in the participatory research process. The results of this work confirm the validity of a multifaceted approach to the research question. Information texts in simple language, explanatory videos and a workshop concept were developed. The analysis and reflection of the findings should serve as a starting point for the development of applicable and target-group appropriate (multimedia) information material in the future. In addition to discussing the practical implications of the research findings, an outlook on possible future research topics is also provided. --------------------LINK: https://permalink.obvsg.at/AC16865651 -------- Poster and Animation: https://research.fhstp.ac.at/projekte/erfahrungen-von-case-management-nutzer-innen-fuer-theorie-und-praxis-nutzbar-machen
Article
Full-text available
This systematic review with narrative synthesis aimed to examine the available evidence on the relationship between neurodiversity and cognitive load in online learning. Despite the known relationship between working memory impairments and neurodiversity, there has been a lack of systematic investigation into how cognitive load impacts neurodivergent students in online learning environments. This review, which includes 90 studies conducted in 21 countries, reveals that the majority (92%) did not consider neurodiversity as a potential factor influencing cognitive load in online learning. The few that did suggest distinct patterns of cognitive load in online learning for neurodivergent students, suggesting a need for more research in this area. The results also point to the potential for methodological advancements in measuring cognitive load in online learning, highlighting that 80% of studies with only neurotypical participants and 60% of studies including neurodivergent participants relied solely on subjective measures of cognitive load. This review contributes to the field by offering a comprehensive overview of our current knowledge of the relationship between neurodiversity and cognitive load in online learning, identifying significant gaps and suggesting directions for future studies.
Article
Full-text available
Studies comparing the instructional effectiveness of dynamic versus static visualizations have produced mixed results. In this work, we investigated whether gender imbalance in the participant samples of these studies may have contributed to the mixed results. We conducted a meta-analysis of randomized experiments in which groups of students learning through dynamic visualizations were compared to groups receiving static visualizations. Our sample focused on tasks that could be categorized as either biologically secondary tasks (science, technology, engineering, and mathematics: STEM) or biologically primary tasks (manipulative–procedural). The meta-analysis of 46 studies (82 effect sizes and 5474 participants) revealed an overall small-sized effect (g+ = 0.23) showing that dynamic visualizations were more effective than static visualizations. Regarding potential moderators, we observed that gender was influential: the dynamic visualizations were more effective on samples with less females and more males (g+ = 0.36). We also observed that educational level, learning domain, media compared, and reporting reliability measures moderated the results. We concluded that because many visualization studies have used samples with a gender imbalance, this may be a significant factor in explaining why instructional dynamic and static visualizations seem to vary in their effectiveness. Our findings also support considering the gender variable in research about cognitive load theory and instructional visualizations.
Chapter
Full-text available
Studying and pursuing careers of Science, Technology, Engineering, and Mathematics (STEM) fields demand spatial ability. Completing a university degree in biology is no exception. The aim of this study is to summarize key findings showing that there is a two-way relation between university biology education and spatial ability. The first aspect of this relation is the most investigated: spatial ability facilitates learning biology. However, the other aspect is also possible: learning biology may improve spatial ability. We present empirical evidence to support both possibilities. The focus is on university biology, and the spatial abilities of mental rotation and mental folding (spatial visualization). We present findings showing that these spatial abilities affect university biology learning and achievement from textual and visual materials. We also present correlational studies and experiments showing that university biology learning positively affects mental rotation and mental folding.
Article
Full-text available
Depletion of limited working memory resources may occur following extensive mental effort resulting in decreased performance compared to conditions requiring less extensive mental effort. This “depletion effect” can be incorporated into cognitive load theory that is concerned with using the properties of human cognitive architecture, especially working memory, when designing instruction. Two experiments were carried out on the spacing effect that occurs when learning that is spaced by temporal gaps between learning episodes is superior to identical, massed learning with no gaps between learning episodes. Using primary school students learning mathematics, it was found that students obtained lower scores on a working memory capacity test (Experiments 1 and 2) and higher ratings of cognitive load (Experiment 2) after massed than after spaced practice. The reduction in working memory capacity may be attributed to working memory resource depletion following the relatively prolonged mental effort associated with massed compared to spaced practice. An expansion of cognitive load theory to incorporate working memory resource depletion along with instructional design implications, including the spacing effect, is discussed.
Article
Full-text available
Within the framework of cognitive load theory, the element interactivity and the expertise reversal effects usually are not treated as closely related effects. We argue that the two effects may be intertwined with the expertise reversal effect constituting a particular example of the element interactivity effect. Specifically, the element interactivity effect relies on changes in element interactivity due to changes in the type of material being learned, while the expertise reversal effect also relies on changes in relative levels of element interactivity but in this case, due to changes in relative levels of expertise. If so, both effects rely on equivalent changes in element interactivity with the changes induced by different factors. Empirical evidence is used to support this contention.
Article
Cognitive Load Theory John Sweller, Paul Ayres, Slava Kalyuga Effective instructional design depends on the close study of human cognitive architecture—the processes and structures that allow people to acquire and use knowledge. Without this background, we might recognize that a teaching strategy is successful, but have no understanding as to why it works, or how it might be improved. Cognitive Load Theory offers a novel, evolutionary-based perspective on the cognitive architecture that informs instructional design. By conceptualizing biological evolution as an information processing system and relating it to human cognitive processes, cognitive load theory bypasses many core assumptions of traditional learning theories. Its focus on the aspects of human cognitive architecture that are relevant to learning and instruction (particularly regarding the functions of long-term and working memory) puts the emphasis on domain-specific rather than general learning, resulting in a clearer understanding of educational design and a basis for more effective instructional methods. Coverage includes: • The analogy between evolution by natural selection and human cognition. • Categories of cognitive load and their interactions in learning. • Strategies for measuring cognitive load. • Cognitive load effects and how they lead to educational innovation. • Instructional design principles resulting from cognitive load theory. Academics, researchers, instructional designers, cognitive and educational psychologists, and students of cognition and education, especially those concerned with education technology, will look to Cognitive Load Theory as a vital addition to their libraries.
Article
Instructional dynamic pictures (animations and videos) contain transient visual information. Consequently, when learning from dynamic pictures, students must process in working memory the current images while trying to remember the images that left the screen. This additional activity in working memory may lead dynamic pictures to be less suitable instructional materials than comparable static pictures, which are more permanent. In order to directly show the influence of transient visual information on dynamic learning environments, we designed a well-matched comparison between a permanent and a transient presentation of an abstract-symbol memory task on the computer. In the task, 104 university students (50% females) had to memorize the type, color, and position of the symbols in a rectangular configuration. In addition, an embodied cognition factor was included where the symbols in the task were either shown with a precision grasping static hand or not. We also assessed how individual characteristics (spatial ability, spatial memory span, and gender) influenced performance. Results showed that (a) permanent outperformed transient presentations, (b) observing hands hindered learning, and (c) high spatial ability and high spatial memory span were beneficial, but gender did not affect performance.
Article
Multimedia learning research has shown that presenting the same words as spoken text and as written text to accompany graphical information hinders learning (i.e., redundancy effect). However, recent work showed that a “condensed” form of written text (i.e., on-screen labels) that overlaps with the spoken text, and thus is only partially redundant, can actually foster learning. This study extends this line of research by focusing on the usefulness of on-screen labels in an animation explaining a procedural task (i.e., first-aid procedure). The experiment had a 2 × 2 × 2 between-subject design (N = 129) with the factors spoken text (yes vs. no), written text (yes vs. no), and on-screen labels (yes vs. no). Learning outcomes were measured as retention accuracy and behavioral performance accuracy. Results showed that on-screen labels improved retention accuracy (but not behavioral performance accuracy) of the procedure, especially when presented together with spoken text. So, on-screen labels appear to be promising for learning from procedural animations.
Chapter
The redundancy principle (or redundancy effect) suggests that redundant material interferes with rather than facilitates learning. Redundancy occurs when the same information is presented concurrently in multiple forms or is unnecessarily elaborated. According to cognitive load theory, coordinating redundant information with essential information increases working memory load, which may interfere with learning. Eliminating such redundant information removes the requirement to coordinate multiple sources of information. Accordingly, instructional designs that eliminate redundant material can be superior to those that include redundancy. This chapter summarizes research and theory concerned with the effect of processing redundant information in multimedia learning, a history of research in instructional redundancy, the conditions of applicability of this principle, and its instructional implications.
Chapter
The capacity limitations of working memory are a major impediment when students are required to learn new material. Furthermore, those limitations are relatively inflexible. Nevertheless, there is one technique that can effectively expand working memory capacity. Under certain well-defined conditions, presenting some information in visual mode and other information in auditory mode can expand effective working memory capacity and so reduce the effects of an excessive cognitive load. This effect is called the modality effect or modality principle. It is an instructional principle that can substantially increase learning. This chapter discusses the theory and data that underpin the principle and the instructional implications that flow from the principle.