Conference PaperPDF Available

The Effect of Reading Code Aloud on Comprehension: An Empirical Study with School Students


Abstract and Figures

In recent times, programming is increasingly taught to younger students in schools. While learning programming is known to be difficult, we can lighten the learning experience of this age group by adopting pedagogies that are common to them, but not as common in CS education. One of these pedagogies is Reading Aloud (RA), a familiar strategy when young children and beginners start learning how to read in their natural language. RA is linked with a better comprehension of text for beginner readers. We hypothesize that reading code aloud during introductory lessons will lead to better code comprehension. To this end, we design and execute a controlled experiment with the experimental group participants reading the code aloud during the lessons. The participants are 49 primary school students between 9 and 13 years old, who follow three lessons in programming in Python. The lessons are followed by a comprehension assessment based on Bloom's taxonomy. The results show that the students of the experimental group scored significantly higher in the Remembering-level questions compared to the ones in the control group. There is no significant difference between the two groups in their answers to the Understanding-level questions. Furthermore, the participants in both groups followed some of the instructed vocalizations more frequently such as the variable's assignment (is). Vocalizing the indentation spaces in a for -loop was among the least followed. Our paper suggests that using RA for teaching programming in schools will contribute to improving code comprehension with its effect on syntax remembering.
Content may be subject to copyright.
The Eect of Reading Code Aloud on Comprehension:
An Empirical Study with School Students
Alaaeddin Swidan and Felienne Hermans
Delft University of Technology
Delft, The Netherlands
In recent times, programming is increasingly taught to younger
students in schools. While learning programming is known to be
dicult, we can lighten the learning experience of this age group
by adopting pedagogies that are common to them, but not as com-
mon in CS education. One of these pedagogies is Reading Aloud
(RA), a familiar strategy when young children and beginners start
learning how to read in their natural language. RA is linked with a
better comprehension of text for beginner readers. We hypothesize
that reading code aloud during introductory lessons will lead to
better code comprehension. To this end, we design and execute a
controlled experiment with the experimental group participants
reading the code aloud during the lessons. The participants are 49
primary school students between 9 and 13 years old, who follow
three lessons in programming in Python. The lessons are followed
by a comprehension assessment based on Bloom’s taxonomy. The
results show that the students of the experimental group scored
signicantly higher in the Remembering-level questions compared
to the ones in the control group. There is no signicant dierence
between the two groups in their answers to the Understanding-level
questions. Furthermore, the participants in both groups followed
some of the instructed vocalizations more frequently such as the
variable’s assignment (is). Vocalizing the indentation spaces in a for
-loop was among the least followed. Our paper suggests that using
RA for teaching programming in schools will contribute to improv-
ing code comprehension with its eect on syntax remembering.
Reading Aloud (RA), Programming Education, Primary School,
Bloom’s Taxonomy
ACM Reference Format:
Alaaeddin Swidan and Felienne Hermans. 2019. The Eect of Reading Code
Aloud on Comprehension: An Empirical Study with School Students. In
ACM Global Computing Education Conference 2019 (CompEd ’19), May 17–
19, 2019, Chengdu, Sichuan, China. ACM, New York, NY, USA, 7 pages.
Permission to make digital or hard copies of all or part of this work for personal or
classroom use is granted without fee provided that copies are not made or distributed
for prot or commercial advantage and that copies bear this notice and the full citation
on the rst page. Copyrights for components of this work owned by others than the
author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or
republish, to post on servers or to redistribute to lists, requires prior specic permission
and/or a fee. Request permissions from
CompEd ’19, May 17–19, 2019, Chengdu, Sichuan, China
©2019 Copyright held by the owner/author(s). Publication rights licensed to ACM.
ACM ISBN 978-1-4503-6259-7/19/05.. .$15.00
Programming is increasingly taught to younger students, in some
countries as part of the curriculum of primary and secondary
schools [
]. We know, however, that learning programming is
dicult [
]. The question arises on how do we make learn-
ing programming less dicult for younger students? One way could
be applying pedagogies we know work for this age group but are
uncommon in programming education.
Young children start to learn how to read by learning the connec-
tion between symbols, one or more letters in this case, and sounds
and then combining them into words and sentences. Reading text
aloud is encouraged for beginners since it focuses thoughts, help
memorization and improves comprehension of text [
]. Also
in mathematics, the same approach to reading aloud can be noticed
in vocalizing simple operations and equations, or when introducing
a new symbol [14, 35].
Although in later development stages and adulthood silent read-
ing becomes the norm, our brains seem to be always ready for
reading aloud. Studies have shown that the brain sends signals
to the primary motor cortex, controlling the lips and the mouth,
during silent reading [
]. This brain activity is called subvocal-
ization, which is used in particular when learners face long and new
words. In programming education, educators seem to spend little
eort on reading code aloud to, or with the students. The lack of
this phonology knowledge leaves students with an extra cognitive
load when reading code to understand functionality. In this regard,
one study measured the subvocalization of experienced developers
during programming tasks and showed that the subvocalization sig-
nals could dierentiate the diculty of the programming task [
Therefore, we hypothesize that training students in reading code
aloud will lead them to spend less cognitive eort on the reading
mechanics and thus improve their comprehension of code.
Therefore, the purpose of this paper is a rst quantication of
the eect of reading code aloud during lessons on school students’
comprehension of basic programming concepts. Furthermore, we
investigate how students benet from the practice of Reading Aloud
(RA) by following it as a sort of a guideline later.
To this end, we design and execute a controlled experiment in
which 49 primary school students receive three lessons of program-
ming in Python. The students are divided into two groups which
get the same teaching materials and times. The students in the
experimental group, however, are asked to repeat reading the code
aloud following the instructor. We assess students’ learning based
on Bloom’s taxonomy. Since the participants are absolute beginners
in programming, we focus our assessment on the rst two levels of
the taxonomy: the Remembering-level and the Understanding-level.
In this paper, we answer the following research questions:
Paper Session: Pre-college Part 2
CompEd ’19, May 17–19, 2019, Chengdu, Sichuan, China
What is the eect of reading code aloud on the perfor-
mance of students in the Remembering-level questions?
What is the eect of reading code aloud on the perfor-
mance of students in the understanding-level questions?
How do students follow the vocalization guideline when
they read code later?
Results show that the students in the experimental group scored
signicantly higher in the Remembering-level questions compared
to the students in the control group. There is no signicant dier-
ence between the two groups in their answers to the Understanding-
level questions. The analysis shows that particular code vocaliza-
tions, such as the variable’s assignment, are common among the
two groups. On the other hand, the participants in both groups
least vocalize the spaces needed for indentation in a for loop and
list brackets. The following sections contain the details of the ex-
periment’s design and results.
We provide an overview of research related to Reading Aloud (RA),
particularly, the RA role in reading education for young students
(Section 2.1) and previous literature involving the use of voice
in programming environments (Section 2.2). We also overview
selected prior research on the use of Bloom’s taxonomy in assessing
programming comprehension (Section 2.3).
2.1 RA and Comprehension: Natural Language
Most psychologists nowadays believe that reading is a process of
sounding out words mentally even for skilled readers [
]. Brain
studies [
] show that the primary motor cortex is active
during reading, “presumably because it is involved with mouth
movements used in reading aloud” [
, p. 90]. Therefore, it becomes
highly important for beginner readers to learn the connection be-
tween sounds and symbols, or phonics. Previous research found
that systematic phonics instruction produces higher achievement
for beginning readers, where they can read many more new words
compared to students following other approaches. For these rea-
sons, in the United States, phonics has been included in reading
programs in schools nationwide [
]. As a verbal approach, reading
aloud (RA) helps in focuses thoughts and transforming it in spe-
cic ways, causing changes in cognition [
]. Takeuchi et al. [
highlight that RA is eective for children language development in
“phonological awareness, print concepts, comprehension, and vocab-
ulary” [
]. Bus et al. [
] reports that reading books aloud brings
young children “into touch with story structures and schemes and
literacy conventions which are prerequisites for understanding texts”.
Several experiments related to comprehension report that students
identied the sounding out of words, or loudly repeating text as
a means to regulate their understanding while reading [
When comparing RA to silent reading, research has found that
students comprehend signicantly more information when they
read aloud versus reading silently [
]. Although other studies
showed opposite results [
], there seems a consensus exists among
researchers that the eects of reading aloud may dier based on the
reading prociency of the students: beginning readers, regardless
of age, beneted from reading aloud rather than silently [
Finally, Santoro et al. [
] stress the importance of careful planning
when reading aloud is aimed at improving the comprehension of
students. RA activities, in this case, should be combined with “ex-
plicit comprehension instruction” and “active, engaging in discussions
about the text”.
2.2 The Role of Voice in Programming and CS
One main use of code vocalization is as an assistive technology that
helps programmers who suer from specic disabilities or stress in-
juries (RSI) to program in an ecient matter [
]. Another
area where code vocalization is essentially practiced is the remote
peer-programming [
]. Vocalizing code can also be an element in
some teaching strategies especially the direct instruction, modeling
and think-aloud [
]. However, in all of these cases, the way
in which people vocalize code is not systematic, standardized, or
agreed upon. In addition, there is some ambiguity over what to
vocalize and on what granularity level: tokens, blocks or compi-
lation units [
]. These factors lead to challenges for professional
programmers and learners alike [
]. For example, [
] mention
the problematic issue of how to vocalize symbols, and when to
speak out or leave specic symbols. Price et al. [
] mention the
eect of natural language’s exibility on the diculty of vocalizing
programming commands, as multiple words could be used to do the
same thing (for example begin class or create a class). Another eect
of natural language is the ambiguity of the meaning of some words
in dierent contexts, for example, add value to a variable and add
a method to a class. These challenges show that the use of natural
language in programming needs more attention from programming
designers and educators. Recent work of Hermans et al. [
] calls
for the programming languages to have phonology guidelines that
specify how a construct should be vocalized. Finally, related is the
work of Parnin [
] who investigated the role of subvocalization
on code comprehension. Subvocalization is the process of the brain
sending electrical signals to the tongue, lips, or vocal cords when
reading silently. Silent reading is a relatively new technique for
humanity. Therefore, when reading, especially the complicated seg-
ments or even words, the brain instructs the lips and the tongue to
perform the read-aloud but without a voice. Their experiment on
code reading showed that measuring the subvocalization signals
can be an indication of the diculty of a programming task.
2.3 Bloom’s Taxonomy in CS Education
When it comes to the assessment of learning processes, Bloom’s
taxonomy is one of the common frameworks educators follow [
]. In this framework [
], Bloom identies six levels of cogni-
tive skills that educators should aim at fullling with their students.
The levels are Remembering, Understanding, Applying, Analyz-
ing, Evaluating, and Creating. These cognitive levels are ordered
from low to high, simple to complex and concrete to abstract, and
each is a prerequisite to the next. This classication is combined
with a practical guideline that educators could use to evaluate the
learning outcome of their students by forming questions with cer-
tain verbs. In this way, it stimulates the cognitive process of the
required level of the taxonomy. In CS education there appear two
main usages of the taxonomy. First, the use of Bloom’s taxonomy
Paper Session: Pre-college Part 2
CompEd ’19, May 17–19, 2019, Chengdu, Sichuan, China
as a tool to measure the learning progress of students and how
they perform in introductory courses in particular. Some research
chose to build the assessment from scratch depending on the tax-
onomy. This includes the work of Whalley et al. [
] who assessed
the reading and comprehension skills of students in introductory
programming courses, creating a set of questions that conform to
Bloom’s taxonomy. The results show that the students performed
consistently with the cognitive diculty levels indicated by the
taxonomy. Similar is the work of Thompson et al. [
] who created
another set of programming questions per the main categories of
the taxonomy, discussing each item and showing educators how
to interpret the results. Both works have been insightful to our
research. Second, other researchers have applied the taxonomy
to evaluate existing programming exams or courses. Lister [
argues that the taxonomy should be used as a framework of assess-
ment, not learning since it provides a reliable tool with a standard
level of assessment. Despite its popularity among researchers, there
seems a consensus that applying the taxonomy to programming
questions is challenging since a programming problem consists of
several building concepts which makes isolating the problem to
one cognitive category a hard job to do [
]. Although the
challenging task to map the assessment items to Bloom’s cognitive
levels will always depend on the interpretation of the educators,
the taxonomy should still provide a valuable tool to explore the
cognitive processes involved in any programming exercise.
The goal of this study is to answer an overarching research question:
how does reading code aloud during lessons aect the students’
learning of programming concepts? To this end, we designed and
ran a controlled experiment with primary school students. In this
section, we describe the setup and design of the experiment in
addition to the theoretical basis we use for the assessment.
3.1 Setup
We provided Python lessons to 49 primary school students in the
Netherlands. We split the participants into a control and an experi-
mental group. Both groups received the same lessons: three lessons
of 1.5 hours each given by the authors of this paper, one lesson
per week. We gave the lessons to the groups subsequently: rst the
experimental group, followed by a break, followed by the control
group. The students knew they were going to learn programming
during the lessons but they were not aware of the experiment’s
goal. We asked the consent of the parents to collect the anonymous
data needed for the research.
3.2 Participants
Participants are 49 students of one primary school in Rotterdam,
the Netherlands. The programming lessons are provided as part
of extracurricular activities arranged by the school, taking place
during school days in a computer lab at the school. As shown in
Figure 1, a total of 49 school children between 9 and 13 years with
an average age of 11.12 years participated in the study. Participants
were 28 boys, 20 girls, and 1 participant who chose not to spec-
ify their gender. The control group consisted of 24 children (age
average=11.167 years, 6 girls - 17 boys - 1 unspecied), while the
experimental group consisted of 25 children (age average=11.08
Figure 1: Age in years versus count of participants per group,
mean=11.12 years. Both groups have equal age means
years, 14 girls, 11 boys). We could not control the split of groups
since they are school classes hence the non-balance in gender.
3.3 Lesson Design and Materials
Each lesson starts by introducing a small working program. One
teacher shows a program on the interactive white-board explain-
ing the code per line and highlighting the concepts included. The
lessons include the following concepts primarily:
Variables Setting and retrieving a variable’s value
Creating lists of integers and strings, accessing and mod-
ifying lists through built-in functions
For-Loops Using loops for repeating certain operations
Function use
Calling built-in functions and using functions
from packages.
During the program explanation, the teachers encourage the
students to express their thoughts on what the code does via inter-
active questions, such as What do you think happens if we change this
value? According to [
], reading text aloud aiming at improving
the comprehension should be combined with “active and engaging
discussions about the text”. Following, the students are instructed
to work in pairs to carry out specic exercises according to the
lessons’ material. During the lessons, an online compiler for Python
(Repl.it1) was used. The nal assessment questions are on-line 2.
3.4 RA Design and Implementation
Understandably, there exists no guideline on how to read code.
When reading code, however, people tend to nd that there are
ambiguous words, symbols, and even punctuation, and vocalizing
them is both challenging and subjective [
]. Consider an example
as simple as the variable assignment a=10, is it vocalized as “a is
ten”,“a equals ten”,“a gets 10” or “set a to ten”. In this experiment,
we follow a similar approach to [
] where the code is read as
if the person is telling another beginner student what to type into
a computer. For both the experimental and the control group, the
instructor read the code aloud to the students during and following
the explanation of a concept within the code, a for-loop for instance.
Only the students in the experimental group, however, were asked
to repeat the reading activity: all-together and aloud. We consis-
tently read all keywords, symbols identier names and punctuation
marks that are essential to the working of a program, for example
quotation marks, brackets, colons and white spaces necessary for
Paper Session: Pre-college Part 2
CompEd ’19, May 17–19, 2019, Chengdu, Sichuan, China
indentation. The full list of what and how we vocalized code during
the lessons are presented in Table 1. We call this list the vocalization
guideline, and we use it later to answer RQ3 which investigates
the extent to which the students follow the taught guideline later
during the assessment. We do this investigation for both groups
since all the students in both groups listened to the teacher vocaliz-
ing the code during the lesson, but only the experimental group’s
participants performed it themselves.
3.5 Assessment
We choose to assess only the two basic levels of Bloom’s taxon-
omy: Remembering and Understanding. According to Lister [
the two categories are sucient for beginners when we want to
assess the eectiveness of their code reading. When relating these
two categories to programming assessment, Thompson et al. [
provide useful insights into how to interpret them into program-
ming assessment terms. Remembering can be related to activities
centered around identifying a programming construct or recalling
the implementation of a concept in a piece of code. For example
by “recall the syntax rules for that construct and use those rules to
recognize that construct in the provided code”[
]. For the Under-
standing category, it includes translating an algorithm from one
form to another, plus explaining or presenting an example of an
algorithm or a design pattern. For example, tracing a piece of code
into its expected output. Multiple choice questions are suitable to
assess these two basic levels for beginners [
]. We developed an
11-questions nal assessment exam: 9 are multiple choice questions,
one is of ll-in type in addition to vocalizing the code snippet, and
one only requires the student to vocalize a code snippet. We aimed
that the questions cover i) all of the programming concepts we
taught (see section 3.3), and ii) for each concept to have a question
assessing the two targeted levels of Bloom’s taxonomy. Table 2
shows the questions and their mappings to Bloom’s levels.
3.5.1 Following the Vocalization Guideline. We ask the students
in both groups to answer two vocalization questions (Question 9
and 11). The students need to write down in words how they would
vocalize a code snippet to another beginner student. Although the
students in the control group did not read the code aloud themselves,
they listened to the instructor performing the RA. Therefore, we
ask both groups to answer these questions. We use the students’
answers to address RQ3.
In this section, we provide the answers to our research questions.
4.1 RQ1: What is the eect of RA on the
To answer this question, we investigate the answers to the questions
in the Remembering-level (7 questions) (see Table 2). The control
group has a mean of 3.58 while the experimental group has a mean
of 4.56. To test the equality of means we use the Mann-Whitney U
Test since the sample size is relatively small and the presence of
some outliers. The results (Table 3) show that the dierence between
the control and experimental groups is signicant (p=0.003). The
eect size r= 0.42 which indicates a large eect [11, 18].
Age factor
: There is no relationship between the student’s age and
the Remembering-level score across the two groups. All age groups
have equal means in the two experimental groups.
4.2 RQ2: What is the eect of RA on the
To answer this question, we investigate the answers to the questions
in the Understanding-level (3 questions) (see Table 2). The control
group has a mean of 0.92 while the experimental group has a mean
of 0.90. Similar to RQ1, we use the Mann-Whitney U Test to check
the equality of the means. The test results (Table 3) show that
the dierence between the control and experimental groups is not
signicant (p= 0.93).
4.3 RQ3: How do students follow the
vocalization guideline when they read code
Figure 2: The vocalization score means by group
To answer this question we analyze students’ answers to the
vocalizing questions (Question 9 and 11 in Table 2), where we
asked students to write down, in the answer paper, how would they
vocalize two small code snippets.
The vocalization guideline is the way we chose to vocalize the
code snippets provided during the lessons. It is summarized in Ta-
ble 1. We grade the student’s answers following the guideline; a
point is given every time the guideline is followed, and the maxi-
mum possible is 14 points.
4.3.1 Following the Guideline: As expected there exists a sig-
nicant dierence between the two groups in following the vo-
calization guideline (see Figure 2). This is expected because of the
intervention we did in the experimental group. The experimental
group who read the code aloud themselves scored an average of
10.20, while the students in the control group, who only listened to
the code being read, scored an average of 6.79. The Mann-Whitney
test suggests the dierence between the two means is signicant
(U=168.5, p= 0.009, r= 0.375 (a medium to large eect)).
4.3.2 Most and Least Followed Vocalizations. We analyzed the
followed vocalization guidelines observed in both groups (Table 4.
We notice that some vocalizations are frequent in both control and
experimental groups especially the variable assignment (is), comma
and single quotation mark. However, the colon in for-loop goes
from one of the most frequent, in the experimental group, to the
one of the least frequent in the control group. This dierence can
be linked to the intervention exercise making a lasting memory for
the participants in the experimental group.
Paper Session: Pre-college Part 2
CompEd ’19, May 17–19, 2019, Chengdu, Sichuan, China
Table 1: The vocalization guideline used during the lessons
Vocalization Item Description Code How Code was Vocalized
V1 Setting a variable value temperature =8temperature is eight
V2 Function-calling with round brackets
for i in range(10):
temperature = temperature + 1
for i in range open round bracket ten close round bracket
V3 For-loop colon colon
V4 For-loop indentation space space space
V5 Plus sign in expressions temperature is temperature plus one
V6 Symbols in identiers (underscore)
healthy_food = ['apple','banana']
healthy underscorefood is
V7 List square bracket (open) open square bracket
V8 Strings single quotation (begin) single quotation apple single quotation
V9 Comma separation between list items comma
V8 (Repeated) Strings single quotation (end) single quotation banana single quotation
V7(Repeated) List square bracket (close) close square bracket
V10 Function use: calling from a package with dot food = random.choice(healthy_food) food is random dot choice open round bracket healthy underscore food close round bracket
Table 2: The list of questions and their corresponding Bloom’s cognitive level
# Concept(s) Bloom’s level Prerequisite Knowledge Student’s Action(s) to Answer
1 List Create/Modify
The syntax to create a list of string literals
Replace syntactically incorrect line by a correct option
2 Variables
The syntax to increase an integer variable’s value
Replace an empty line with a syntactically correct
3 Function use
Remembering The syntax to call a function with a variable parameter
Replace syntactically incorrect line by
a correct option
4 Function use
The syntax to call the print function with a string
Function use & Variables
The correct syntax to print a variable’s value
Sequential execution &
Same indentation for each line of a Python block
Identify/recognize/locate the cause of the error from
7 For-loop
For-loop syntax
Indentation eect on lines being within/outside
a loop
Trace and predict the outcome of a for-loop with a
print statement within, followed by another print
8 For-loop
For-loop syntax
Identify/recognize the syntactically correct for-loop
to get a specic outcome
For-loop & Variables
(with Vocalize)
For-loop syntax
Indentation eect on lines being within/outside
a loop
Trace and predict the outcome of a loop that
increases the value of a variable. Then write in words
how you would vocalize the code.
List Create/Modify &
Function use
Understanding The syntax of List creation, List access & modication
using built-in functions
Trace code and interpret its use in one of low-,medium-
or high natural language descriptions
Vocalize only - Vocalize code as if your are reading it to a friend
Write, in words, on the answer sheet how you would
vocalize the code snippet
Table 3: The dierence by group in the answer score means
to each category of questions.
Remembering-level Score
7 Questions
Understanding-level Score
3 Questions
Mean 4.56 3.58 0.90 0.92
Std. Dev. 1.00 1.28 0.76 0.75
Mann Whitney U156.5 443.5 304.5 295.5
z-score 2.97 0.09
(2-tailed) p0.003 0.93
Signicant Yes (r= 0.42) No
4.3.3 Eect of Following the Guideline: Within one group, we
analyzed whether following the guideline aects the answers to
either Remembering- or Understanding-level questions. Results so
far showed that students in the experimental group are more likely
to score higher in following the vocalization guideline, and at the
same time are more likely to score higher in the Remembering-level
score, than the students in the control group. However, comparing
the students’ score within the experimental group itself does not
show a relationship between following the vocalization guideline
and the score in neither the Remembering- nor Understanding-level
category. The control group, however, reveals a dierent behavior.
Results show that students within the control group who followed
the vocalization guidelines scored higher on the Understanding-
level questions (see Figure 3). According to the ANOVA test, the
vocalization varies across the quantiles of the Understanding-level
score (F=8.232, p=0.002). We highlight again that students in this
group were not instructed to repeat the reading of the code, they
only listened to the instructor reading aloud the code snippets.
5.1 Reection and explanation of the results
The main nding in this study is the signicant eect RA has on
remembering the syntax of the programming constructs taught to
the students. We believe this result encourages teachers in primary
schools to practice code vocalization as pedagogy in their program-
ming classes. While learning how to program is unique and known
to be dicult, it is still a learning process. We can, therefore, use
pedagogies from other domains to help make programming easier
to learn for younger students in particular. With that in mind, we
can explain the eect of RA we observe in this study from two
Paper Session: Pre-college Part 2
CompEd ’19, May 17–19, 2019, Chengdu, Sichuan, China
Figure 3: The variance of following the vocalization guide-
line among the participants in the control group and and the
relation with the score on Understanding-level questions
Table 4: The most and least followed vocalizations following
the guideline in Table 1
Most Followed Least Followed
The Experimental Group (n=25)
V3 - Colon in for-loop (:) 22 V7 - List square brackets 11
V1 - Variable assignment (is) 21 V4 - Space indentation in for-loops 16
V9 - Comma (,)
V8 - String single quotation (’)
V10 - Dot in function call
19 V5 - Plus sign 17
The Control Group (n=24)
V1 - Variable assignment 14.25 V4 - Space indentation in for-loops 5
V6 - Underscore in variable names 14 V7 - List square brackets 6
V9 - Comma (,)
V8 - String single quotation (’) 13
V3 - Colon in for-loop (:)
V2 - Round bracket for function call
V10 - Dot in function call
V5 - Plus sign
angles. First, RA improves the learning environment by utilizing
a familiar technique to young students. This subsequently raises
focus and attention of the students. When attention is gained and
sustained learning can happen as “attention is a prerequisite for
learning” [
, p. 3]. Secondly, RA helps in automating the retrieval
of basic knowledge required for cognitive development [
]. Accord-
ing to the neo-Piagetian theories of cognitive development [
students in their initial phase of learning programming are at the
sensorimotor stage. At that stage, students mostly struggle in in-
terpreting the semantics of the code they read, which aects their
performance in tracing tasks in particular [
]. Practicing RA
could potentially help in reducing the struggle because it automates
the remembering of the language constructs, and helps the student
moving faster to the next development phases.
5.2 RA and the granularity of the vocalization
There is currently no standard guideline specifying how constructs
of Python, or other programming languages, should be vocalized.
As presented in Section 2.2, there are various granularities and
strategies one can read code with. In this study, however, we fol-
low a specic technique to vocalization (Section 3.4) which can be
considered of a low granularity, focusing on syntax rather than
semantics or relations within the code. Nevertheless, when teach-
ing young and novice students, the RA method we follow could
help teachers create a benchmark where students know how to call
all the elements in their programming environment. We see from
the answers of the control group students that there exist some
variances in calling specic symbols. For example, calling the single
(’) a“single quotation”,“apostrophe”, or “upper comma”. This variance
shows the challenges that beginners face to identify symbols in the
rst place. An extra cognitive eort is spent on remembering rather
than on conceptual understanding. We hypothesize that higher
granularities of vocalization of code structures or semantics can be
integrated into the following phases. To determine the best vocal-
ization method teachers start with, however, is out of this study’s
scope and is an opportunity for future research.
5.3 Threats to validity
Our study involves some threats to its validity. First, the split of
the participants into the two groups might have inuenced the
results. The split was introduced by the school structure; i.e., per
class. However, we randomly selected one class as the experimental
group and the other as the control group. The second threat, the
authors being the teachers at the same time could introduce a bias
in favor of the experimental group. To reduce the eect of such
bias we ensured that both groups studied the same materials over
the same amount of time with the same teacher. The main teacher
was accompanied by another teacher who among other things
observed the teaching given to the two groups. By these steps, we
ensured that the only dierence between the two groups would be
the reading-aloud method. Third threat, a wrongful assignment of
a question to one of Bloom’s cognitive levels by the authors. This
is a common challenge for researchers in similar studies [
and future experiments will lead to rening this process. Finally,
a threat to the external validity of our study is the diculty to
generalize its results. This is, however, an inherent issue in similar
studies with small sample size [
]. To overcome this threat we
should replicate the study across dierent participants in the future.
Our paper aims at measuring the eect of reading code aloud during
programming lessons on comprehension. We perform a controlled
experiment with 49 school students aged between 9 and 13. We
assess the students’ comprehension of basic programming concepts
after three Python programming lessons. The assessment is based
on Bloom’s taxonomy and focused on the of remembering and
understanding levels. The results show that the participants in the
experimental group score signicantly higher in remembering-
level questions. However, the two groups perform similarly in
understanding-level questions. Furthermore, we observe that the
participants in both groups vocalize specic constructs more often
than others. For example, the variable’s assignment (is) and punctu-
ation symbols (comma, underscore and quotation mark). Our paper
suggests that using RA for teaching programming in schools will
contribute to improving comprehension among young students. In
particular, it will improve remembering the syntax, paving the way
to spending more cognitive eort on the higher level understanding
of the concepts. For future work, we aim at experimenting with
dierent RA approaches with dierent code granularities to nd
the best approach to improve code comprehension at this age.
Paper Session: Pre-college Part 2
CompEd ’19, May 17–19, 2019, Chengdu, Sichuan, China
Lorin W Anderson and David Krathwohl. 2001. A Taxonomy for Learning, Teach-
ing, and Assessing. New York.
[2] Richard Arends. 2012. Learning to teach. McGraw-Hill.
[3] Craig Barton. 2018. How I wish I’d taught maths. John Catt Educational Ltd.
Andrew Begel. Programming By Voice: A Domain-specic Application of Speech
Recognition. (????).
A. Begel and S.L. Graham. Spoken Programs. 2005 IEEE Symposium on Visual
Languages and Human-Centric Computing (VL/HCC’05) (????).
A. Begel and S.L. Graham. 2006. An Assessment of a Speech-Based Programming
Environment. Visual Languages and Human-Centric Computing (VL/HCC’06)
(2006). DOI:
B.S. Bloom. 1956. Taxonomy of Educational Objectives: The Classication of
Educational Goals. Number v. 1 in Taxonomy of Educational Objectives: The
Classication of Educational Goals. D. McKay.
B. Du Boulay. 1986. Some Diculties of Learning to Program. Journal of Edu-
cational Computing Research 2, 1 (1986), 57–73.
3LFX-9RRF-67T8-UVK9 arXiv:
Adriana G. Bus, Marinus H. van IJzendoorn, and Anthony D. Pellegrini. 1995.
Joint Book Reading Makes for Success in Learning to Read: A Meta-Analysis on
Intergenerational Transmission of Literacy. Review of Educational Research 65, 1
(1995), 1–21. DOI:
G. Canfora, A. Cimitile, and C.A. Visaggio. Lessons learned about distributed
pair programming: what are the knowledge needs to address? WET ICE 2003.
Proceedings. Twelfth IEEE International Workshops on Enabling Technologies: In-
frastructure for Collaborative Enterprises, 2003. (????).
Barry H Cohen. 2008. Explaining Psychological Statistics (3 ed.). John Wiley &
Ryan Deschambault. 2011. Thinking-Aloud as Talking-in-Interaction: Reinter-
preting How L2 Lexical Inferencing Gets Done. Language Learning 62, 1 (2011),
266–301. DOI:
A. Desilets. 2001. VoiceGrip: A Tool for Programming-by-Voice. International
Journal of Speech Technology 4, 2 (01 Jun 2001), 103–116.
Richard Fateman. 1998. How can we speak math. Journal of Symbolic Computation
Sue Fitzgerald, Beth Simon, and Lynda Thomas. 2005. Strategies that students use
to trace code. Proceedings of the 2005 international workshop on Computing edu-
cation research - ICER ’05 (2005).
Joan M Francioni and Ann C Smith. 2002. Computer science accessibility for
students with visual disabilities. In ACM SIGCSE Bulletin, Vol. 34. ACM, 91–95.
Andrea D. Hale, Renee O. Hawkins, Wesley Sheeley, Jennifer R. Reynolds,
Shonna Jenkins, Ara J. Schmitt, and Daniel A. Martin. 2010. An investigation
of silent versus aloud reading comprehension of elementary students using
Maze assessment procedures. Psychology in the Schools 48, 1 (2010), 4–13.
[18] John Hattie. 2009. Visible learning. Routledge.
Felienne Hermans and Efthimia Aivaloglou. 2017. Teaching Software Engineer-
ing Principles to K-12 Students: A MOOC on Scratch. In Proceedings of the 39th
International Conference on Software Engineering: Software Engineering and Ed-
ucation Track (ICSE-SEET ’17). IEEE Press, Piscataway, NJ, USA, 13–22.
Felienne Hermans, Alaaeddin Swidan, and Efthimia Aivaloglou. 2018. Code
phonology. Proceedings of the 26th Conference on Program Comprehension - ICPC
’18 (2018). DOI:
John M. Keller. 1987. Development and use of the ARCS model of instructional
design. Journal of Instructional Development 10, 3 (1987), 2–10.
Sherry Kragler, Linda Martin, and Virginia Schreier. 2015. Investigating Young
Children’s Use of Reading Strategies: A Longitudinal Study. Reading Psychology
36, 5 (2015), 445–472. DOI:
Raymond Lister. 2005. Computer Science Teachers as Amateurs, Students and
Researchers. In In Proceedings of the 5 th Baltic Sea Conference on Computing
Education Research. (Koli. 3–12.
Raymond Lister, Elizabeth S. Adams, Sue Fitzgerald, William Fone, John Hamer,
Morten Lindholm, Robert McCartney, Jan Erik Moström, Kate Sanders, Otto
Seppälä, Beth Simon, and Lynda Thomas. 2004. A Multi-national Study of Reading
and Tracing Skills in Novice Programmers. SIGCSE Bull. 36, 4 (June 2004), 119–150.
Linda E. Martin and Sherry Kragler. 2011. Becoming a Self-Regulated Reader: A
Study of Primary-Grade Students’ Reading Strategies. Literacy Research and In-
struction 50, 2 (2011), 89–104.
Michael F. Mascolo. 2015. Neo-Piagetian Theories of Cognitive Development.
International Encyclopedia of the Social & Behavioral Sciences (2015), 501–510.
DOI: 08-097086- 8.23097-3
R. Steve McCallum, Shannon Sharp, Sherry Mee Bell, and Thomas George. 2004.
Silent versus oral reading comprehension and eciency. Psychology in the Schools
41, 2 (2004), 241–246. DOI:
Chris Parnin. 2011. Subvocalization - Toward Hearing the Inner Thoughts of
Developers. 2011 IEEE 19th International Conference on Program Comprehension
(2011). DOI:
M. Perrone-Bertolotti, L. Rapin, J.-P. Lachaux, M. Baciu, and H. LÅŻvenbruck.
2014. What is that little voice inside my head? Inner speech phenomenology, its
role in cognitive performance, and its relation to self-monitoring. Behavioural
Brain Research 261 (2014), 220–239.
David E Price, DA Dahlstrom, Ben Newton, and Joseph L Zachary. 2002. O to
See the Wizard: using a" Wizard of Oz" study to learn how to design a spoken
language interface for programming. In Frontiers in Education, 2002. FIE 2002.
32nd Annual, Vol. 1. IEEE, T2G–T2G.
Suzanne M Prior and Katherine A Welling. 2001. " Read in Your Head": A
Vygotskian Analysis of the Transition from Oral to Silent Reading. Reading
Psychology 22, 1 (2001), 1–15.
Katherine E. Purswell and Dee C. Ray. 2014. Research With Small Samples.
Counseling Outcome Research and Evaluation 5, 2 (2014), 116–126.
Keith Rayner, Barbara R. Foorman, Charles A. Perfetti, David Pesetsky, and
Mark S. Seidenberg. 2002. How Should Reading be Taught? Scientic American
286, 3 (2002), 84–91. DOI:camerican0302-84
Lana Edwards Santoro, David J. Chard, Lisa Howard, and Scott K. Baker. 2008.
Making the Very Most of Classroom Read-Alouds to Promote Comprehension
and Vocabulary. The Reading Teacher 61, 5 (2008), 396–408.
Marcel Schmeier and Ruud Bijman. 2017. Eectief rekenonderwijs op de basisschool
(1st edition ed.). Uitgeverij Pica.
Lindsey Snell and Mr Jim Cunningham. 2000. An investigation into programming
by voice and development of a toolkit for writing voice-controlled applications.
Osamu Takeuchi, Maiko Ikeda, and Atsushi Mizumoto. 2012. Reading Aloud
Activity in L2 and Cerebral Activation. RELC Journal 43, 2 (2012), 151–167.
Donna Teague and Raymond Lister. 2014. Longitudinal Think Aloud Study
of a Novice Programmer. In Proceedings of the Sixteenth Australasian Comput-
ing Education Conference - Volume 148 (ACE ’14). Australian Computer Society,
Inc, Darlinghurst, Australia, Australia, 41–50.
Errol Thompson, Andrew Luxton-Reilly, Jacqueline L. Whalley, Minjie Hu, and
Phil Robbins. 2008. Bloom’s Taxonomy for CS Assessment. In Proceedings of the
Tenth Conference on Australasian Computing Education - Volume 78 (ACE ’08).
Australian Computer Society, Inc., Darlinghurst, Australia, Australia, 155–161.
Jacqueline Whalley and Nadia Kasto. 2014. A qualitative think-aloud study of
novice programmers’ code writing strategies. Proceedings of the 2014 conference
on Innovation & technology in computer science education - ITiCSE ’14 (2014).
Jacqueline L. Whalley, Raymond Lister, Errol Thompson, TonyClear, Phil Robbins,
P. K. Ajith Kumar, and Christine Prasad. 2006. An Australasian Study of Reading
and Comprehension Skills in Novice Programmers, Using the Bloom and SOLO
Taxonomies. In Proceedings of the 8th Australasian Conference on Computing
Education - Volume 52 (ACE ’06). Australian Computer Society, Inc., Darlinghurst,
Australia, Australia, 243–252.
Paper Session: Pre-college Part 2
CompEd ’19, May 17–19, 2019, Chengdu, Sichuan, China
... In all the activities that people carry out daily, reading comprehension is immersed, and even more so in this time when technology is developing with great speed due to the pandemic and education needs it to continue developing, in that sense they have found several studies that show the importance of reading comprehension. Swidan & Hermans (2019) proposed an evaluation at the beginning of the research based on Bloom's taxonomy, which focuses on the levels of memory and comprehension, after applying the instruments it was obtained as a result that the participants in the experimental group score significantly higher in the memory level questions, however, the two groups performed similarly in the comprehension level questions, that is, there was no significant difference. However, Ceyhan & Yıldız (2021) differ from the previous research, they carried out an analysis and they revealed that practices based on the skill of interactive reading aloud increased the reading comprehension levels of second grade students, Teachers modeled reading for students by reading at the appropriate pace, pronouncing accurately, and observing intonation and punctuation marks, and with an appropriate pause, they let the students repeat the words they had difficulty pronouncing. ...
Full-text available
The purpose of this review article is to analyze publications made on reading comprehension strategies in primary school students in a pandemic. To carry out this work, 36 articles that were found in the databases were analyzed: Scopus, ProQuest, Ebsco and Scielo in English and Spanish, between the years 2017 and 2021, to carry out the search of the present investigation, the following were used Equations in the databases indicated above: strategies AND reading comprehension AND primary school students, pandemic strategies AND reading comprehension AND elementary school students AND pandemic, a double entry table was used to organize important information from each article found. The results showed that the knowledge and development of reading comprehension strategies promote reading competence.
... Especially block-based programming languages are currently being used in K-12 programming education extensively, most commonly using Scratch (Resnick et al., 2009) andAlice (Conway et al., 1994). In addition to blocks-based languages, robots (Kazakoff, Sullivan & Bers, 2013;Ludi, Bernstein & Mutch-Jones, 2018;Swidan & Hermans, 2017) and textual programming languages (Hermans, 2020;Price & Barnes, 2015;Swidan & Hermans, 2019) are also frequently used. Other approaches use no computers at all, referred to as unplugged computing (Hermans & Aivaloglou, 2017). ...
Full-text available
Computer science education (CSEd) research within K-12 makes extensive use of empirical studies in which children participate. Insight in the demographics of these children is important for the purpose of understanding the representativeness of the populations included. This literature review studies the demographics of subjects included in K-12 CSEd studies. We have manually inspected the proceedings of three of the main international CSEd conferences: SIGCSE, ITiCSE and ICER, of five years (2014–2018), and selected all papers pertaining to K-12 CSEd experiments. This led to a sample of 134 papers describing 143 studies. We manually read these papers to determine the demographic information that was reported on, investigating the following categories: age/grade, gender, race/ethnic background, location, prior computer science experience, socio-economic status (SES), and disability. Our findings show that children from the United States, boys and children without computer science experience are included most frequently. Race and SES are frequently not reported on, and for race as well as for disabilities there appears a tendency to report these categories only when they deviate from the majority. Further, for several demographic categories different criteria are used to determine them. Finally, most studies take place within schools. These insights can be valuable to correctly interpret current knowledge from K-12 CSEd research, and furthermore can be helpful in developing standards for consistent collection and reporting of demographic information in this community.
... Programming instruction is improved by reading code out loud (Swidan & Hermans 2019), so it stands to reason that statistics programming instruction could be similarly improved. ...
Full-text available
Computing makes up a large and growing component of data science and statistics courses. Many of those courses, especially when taught by faculty who are statisticians by training, teach R as the programming language. A number of instructors have opted to build much of their teaching around the use of the tidyverse. The tidyverse, in the words of its developers, "is a collection of R packages that share a high-level design philosophy and low-level grammar and data structures, so that learning one package makes it easier to learn the next" (Wickham et al. 2019). The shared principles have led to the widespread adoption of the tidyverse ecosystem. No small part of this usage is because the tidyverse tools have been intentionally designed to ease the learning process and cognitive load for users as they engage with each new piece of the larger ecosystem. Moreover, the functionality offered by the packages within the tidyverse spans the entire data science cycle, which includes data import, visualisation, wrangling, modeling, and communication. We believe the tidyverse provides an effective and efficient pathway to data science mastery for students at a variety of different levels of experience. In this paper, we introduce the tidyverse from an educator's perspective, touching on the what (a brief introduction to the tidyverse), the why (pedagogical benefits, opportunities, and challenges), the how (scoping and implementation options), and the where (details on courses, curricula, and student populations).
... In accordance with this, teaching end-user computing focuses on tools, and on navigating the interface. However, recently, more and more teachers have been recognizing the importance of real-world computer problem-solving and of building and recalling algorithms and schemata in end-user computing, as well (Pólya 1954;Hubwieser 2004;Kahneman 2011;Sestoft 2011;Sweller et al. 2011;Csernoch et al. 2015;Csernoch and Dani 2017;Hermans 2019;Swidan and Hermans 2019). We are convinced that concept-based problem-solving approaches, which have been proved effective and efficient in other school subjects, should be introduced in all topics of informatics education, and not exclusively in programming. ...
Full-text available
In Hungary, K-12 informatics/computer science education focuses on mostly surface-based methods. This approach can be observed in the teaching of several topics in the subject, of which we focus on spreadsheet management. This is further emphasized by regulatory documents – the Hungarian National Core Curriculum and Hungarian Curriculum Frameworks –, where handling algorithms, calling schemata, and problem-solving in general are only assigned to the programming topic. In the process of fulfilling the requirements of the school curricula and the various tool-centered exams, students become familiar with the software interfaces and how to navigate them, instead of developing computational thinking skills and learning how to approach and solve real-world problems. Our educational system is based on a spiral teaching approach; therefore, spreadsheet management is taught throughout several grades in a small number of lessons. Prior research shows that students learning spreadsheet management with surface-approach methods do not build up a reliable knowledge structure. These students cannot solve problems in contexts differing to the ones in which they learned the topic and cannot use their surface navigation abilities in different software environments. Our research group focuses on spreadsheeting with an algorithm-building and problem-solving method at the center of the teaching-learning process. For this purpose, we have developed and introduced the Sprego (Spreadsheet Lego) methodology. Sprego is based on Pólya’s four-step concept-based problem-solving approach, and its efficiency has already been proved compared to traditional low-mathability surface-approach methods. In the comparison of the low- and high-mathability approaches, several further questions arise, and amongst them one crucial aspect is how the different methods support the schema-construction and knowledge built up in long-term memory. In this paper we discuss this question using a delayed post-test that was carried out one year after the treatment period. We focused on the students’ achievement both in the experimental (Sprego) and control (traditional surface-approaches) groups based on the methods used one year prior to the administration of the delayed post-test. The results show that students who learned the spreadsheet management topic with Sprego achieved significantly better scores on the delayed tests than those students who used low-mathability approaches.
Full-text available
The paper presents the details of a four-year project to test the effectiveness of teaching spread-sheeting with spreadsheet programming, instead of the traditional, widely accepted surface approach methods. The novel method applied in the project, entitled Sprego (Spreadsheet Lego), is a concept-based problem-solving approach adapted from the didactics of other sciences and computer programming. In the experimental group contextualized, real-world programming problems are presented in a spreadsheet environment. A semi-unplugged data-driven analysis is carried out based on each problem, which is followed by the building of a feasible algorithm, expressed by natural language expressions. The coding is completed in the following step by applying a limited number of spreadsheet (Sprego) functions, multilevel, and array formulas. The final steps of the process are discussion and debugging. On the other hand, classical, tool-centered approaches are applied in the control groups. Our research reveals that the traditional surface approach methods for teaching spreadsheeting do not provide long lasting, reliable knowledge which would provide students and end-users with effective problem-solving strategies, while Sprego does. Beyond this finding, the project proves that Sprego supports schema construction and extended abstraction, which is one of the major hiatus points of traditional surface navigation methods. The project also reveals that developing computational thinking skills should not be downgraded, and the misconceptions of self-taught end-users and user-friendly applications should be reconsidered, especially their application in educational environments. Gaining effective computer problem-solving skills and knowledge-transfer abilities is not magic, but a time-consuming process which requires consciously developed and effective methods, and teachers who accept the incremental nature of the sciences.
Full-text available
Neo-Piagetian theories of cognitive development emerged as attempts to preserve core theoretical and empirically supported aspects of Jean Piaget's seminal theory of intellectual development while addressing criticisms leveled against the theory. Neo-Piagetian theories preserve three basic ideas from Piaget's theory: (1) the unit of cognitive analysis is the scheme or psychological structure; (2) psychological structures undergo qualitative transformation over time; and (3) higher order structures develop through the differentiation and coordination of lower level structures. After a brief discussion of similarities and differences among prominent neo-Piagetian theories, one representative approach (dynamic skill theory) is discussed in depth. The discussion concludes with a description of recent advances in neo-Piagetian systems theory.
Conference Paper
Full-text available
This paper presents part of a larger long term study into the cognitive aspects of the early stages of learning to write computer programs Tasks designed to trigger learning events were used to provide the opportunity to observe student learning, in terms of the development and modification of cognitive structures or schemata, during think aloud sessions. A narrative analysis of six students' attempts to solve these tasks is presented. The students' progression in learning and attitudinal approaches to learning is examined and provides some insight into the cognitive processes involved in learning computer programming.
Conference Paper
Full-text available
Recent research from within a neo-Piagetian perspective proposes that novice programmers pass through the sensorimotor and preoperational stages before being able to reason at the concrete operational stage. However, academics traditionally teach and assess introductory programming as if students commence at the concrete operational stage. In this paper, we present results from a series of think aloud sessions with a single student, known by the pseudonym “Donald”. We conducted the sessions mainly over one semester, with an additional session three semesters later. Donald first manifested predominately sensorimotor reasoning, followed by preoperational reasoning, and finally concrete operational reasoning. This longitudinal think aloud study of Donald is the first direct observational evidence of a novice programmer progressing through the neo-Piagetian stages.
Full-text available
This article explores the cerebral mechanism of reading aloud activities in L2 learners. These activities have been widely used in L2 learning and teaching, and its effect has been reported in various Asian L2 learning contexts. However, the reasons for its effectiveness have not been examined. In order to fill in this gap, two studies using a brain-imaging technique, near-infrared spectroscopy, were conducted in order to determine a cerebral basis for the effectiveness of reading aloud activities. Study 1 investigated learners with high L2 proficiency to show the difference in cerebral activation between L2 and L1 learners as they read a passage aloud. The effect of material difficulty was also examined in this study. Study 2 then examined learners with both high and low L2 proficiency to show the effect of material difficulty vis-à-vis the learners’ L2 proficiency. The effect of repeated reading aloud activities was also investigated in this study. These studies show that: Reading aloud in L2 results in a higher degree of cerebral activation than reading aloud in L1. Reading material beyond learners’ L2 ability aloud results in low brain activation. Repetition of the same normal reading aloud activity in L2 does not necessarily increase (or decrease) the level of cerebral activation. However, including a repetitive cognitively demanding reading aloud activity does cause high brain activation. On the basis of these findings, this article provides a cerebral basis for the effectiveness of reading aloud activities in L2 learning.
Conference Paper
When children learn to read, they almost invariably start with oral reading: reading the words and sentences out loud. Experiments have shown that when novices read text aloud, their comprehension is better then when reading in silence. This is attributed to the fact that reading aloud focuses the child's attention to the text. We hypothesize that reading code aloud could support program comprehension in a similar way, encouraging novice programmers to pay attention to details. To this end we explore how novices read code, and we found that novice programmers vocalize code in different ways, sometimes changing vocalization within a code snippet. We thus believe that in order to teach novices to read code aloud, an agreed upon way of reading code is needed. As such, this paper proposes studying code phonology, ultimately leading to a shared understanding about how code should be read aloud, such that this can be practiced. In addition to being valuable as an educational and diagnostic tool for novices, we believe that pair programmers could also benefit from standardized communication about code, and that it could support improved tools for visually and physically disabled programmers.
Single case designs (SCDs) and randomized small group (RSG) designs are two options for researchers who have limited resources and who would like to demonstrate the experimental effect of an intervention. The authors address considerations for internal and external validity in each design and provide an overview of the strengths and limitations of the various statistical analyses in each design. Effective researchers are well-informed regarding research design and match small-n participant designs to appropriate research questions. Examples of research questions and research design are discussed.
In this three-year study, the authors examined the reading strategy use of 16 primary-grade students as they read fiction and expository text. The students (9 boys and 7 girls) represented low, average, and above-average readers. Structured interviews and observational data were collected each year as they progressed from first to third grade. Even though students dropped some strategies while beginning to use others, statistical analyses of the interview data revealed few statistical significant differences in the students’ reported strategy use. The data indicated there was alignment between students’ reported and observed strategy use. The analyses of the observational data also revealed the students had high accuracy percentages but low retelling scores.