ArticlePublisher preview available

Foundations of intuitive power analyses in children and adults

Authors:
To read the full-text of this research, you can request a copy directly from the authors.

Abstract and Figures

Decades of research indicate that some of the epistemic practices that support scientific enquiry emerge as part of intuitive reasoning in early childhood. Here, we ask whether adults and young children can use intuitive statistical reasoning and metacognitive strategies to estimate how much information they might need to solve different discrimination problems, suggesting that they have some of the foundations for ‘intuitive power analyses’. Across five experiments, both adults (N = 290) and children (N = 48, 6–8 years) were able to precisely represent the relative difficulty of discriminating populations and recognized that larger samples were required for populations with greater overlap. Participants were sensitive to the cost of sampling, as well as the perceptual nature of the stimuli. These findings indicate that both young children and adults metacognitively represent their own ability to make discriminations even in the absence of data, and can use this to guide efficient and effective exploration. Adults and children can represent the relative difficulty of discriminating two populations and recognize that larger samples are required for populations with greater overlap. This suggests that they have foundations for ‘intuitive power analyses’.
This content is subject to copyright. Terms and conditions apply.
Articles
https://doi.org/10.1038/s41562-022-01427-2
Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA. e-mail: lschulz@mit.edu
In science, we use power analyses to estimate how much evidence
is needed to test the hypotheses we are considering. For instance,
if we suspect that a drug will cure 90% of patients while a placebo
will work 30% of the time, we will conclude that a relatively small
sample of patients will enable us to test the drug’s efficacy; by con-
trast, if we believe the drug will cure only 50% of patients, we know
we will need a larger sample to show an effect. Exact power analy-
ses, of course, require formal training in statistics and an estimate
of the size of the effect the researcher aims to demonstrate. But the
intuitive foundation of these analyses—that it takes more evidence
to distinguish some hypotheses than others—may be part of com-
mon sense reasoning more broadly. Decades of research suggest
that some of the epistemic practices that support scientific enquiry
emerge as part of intuitive reasoning in early childhood. Here we
look at whether lay adults and young children can use these intui-
tive metacognitive abilities to represent how much information they
need to distinguish between populations depending on the degree
to which the populations overlap.
This question is grounded in a long tradition of work suggest-
ing that children are sensitive to statistics in the environment and
engage in selective exploration to maximize expected information
gain. Infants attend to the transitional probability of events in both
auditory1 and visual2 stimuli and can infer abstract rules from pat-
terns of data (for example, the difference between ABA and ABB
patterns3). Infants can also infer probable outcomes simply from
the number of modal possibilities; thus, for instance, if three blue
objects and one red one are spinning in an open container, infants
look longer if the red object falls out than if a blue one does4.
Additionally, infants are sensitive to the relationship between sam-
ples and populations: If most of the objects in a box are red, babies
expect most of the objects pulled from the box to be red but suspend
this inference if the objects are drawn from somewhere other than
the box (that is, the experimenter’s lap) or are drawn selectively by
an experimenter searching through the box rather than randomly5.
Infants also actively explore to reduce uncertainty. Seven- and
8-month-olds are more likely to look away from events that are
very predictable or unpredictable relative to moderately surpris-
ing events6 and 12-month-olds not only look longer at events that
violate their intuitive theories, but intervene to explore the viola-
tions (for example, dropping objects that appeared to violate grav-
ity; banging objects that appeared to violate solidity7).
Childrens sensitivity to the relationship between hypotheses
and evidence becomes increasingly sophisticated from toddlerhood
to middle childhood. Toddlers are able to integrate observed data
with their previous beliefs, and their inferences are not only sensi-
tive to the content of a sample, but also whether that sample was
drawn randomly or selectively8. When preschoolers observe events
that cannot be explained by a known cause, they posit unobserved
variables to explain the event9,10, and selectively explain and explore
events that violate their causal theories11,12. Preschoolers can also
use the base rate of events to distinguish more and less probable
hypotheses13,14, isolate variables in a causal system to distinguish
candidate causes11, and integrate the evidence they observe with the
testimony they hear from knowledgeable sources15.
However, it is less clear to what extent young children have an
explicit understanding of the relationship between the evidence
they observe and the knowledge they will gain. Some studies sug-
gest that at least some precursors to metacognition emerge early.
Preschoolers correctly distinguish objects they can and cannot
name16, spend more time considering response options given unin-
formative versus informative task instructions17, selectively with-
hold answers1822, ask for help23 on items they struggle to remember
and show more pupillary dilation and give higher confidence rat-
ings to remembered items24,25, see also refs. 18,22,2628. However, young
children may know when they are more or less certain about infor-
mation without using this knowledge to increase opportunities for
learning. In self-paced learning tasks, 5-, 6- and 7-year-olds are more
confident about correctly than incorrectly remembered items but
only 6- and 7-year-olds accurately anticipate which items they will
have trouble learning and dedicate more study time to these items29.
Consistent with this, school-aged children often struggle with meta-
cognitive tasks in the context of test-taking: children are overcon-
fident in their memory30,31, and choose randomly when given the
chance to restudy test items rather than choosing on the basis of
their previous performance32. Introspective self-reports of knowl-
edge also improve from early school age to adolescence3337. Thus,
Foundations of intuitive power analyses in
children and adults
Madeline C. Pelz, Kelsey R. Allen, Joshua B. Tenenbaum and Laura E. Schulz  ✉
Decades of research indicate that some of the epistemic practices that support scientific enquiry emerge as part of intuitive
reasoning in early childhood. Here, we ask whether adults and young children can use intuitive statistical reasoning and meta-
cognitive strategies to estimate how much information they might need to solve different discrimination problems, suggesting
that they have some of the foundations for ‘intuitive power analyses’. Across five experiments, both adults (N= 290) and chil-
dren (N= 48, 6–8 years) were able to precisely represent the relative difficulty of discriminating populations and recognized
that larger samples were required for populations with greater overlap. Participants were sensitive to the cost of sampling, as
well as the perceptual nature of the stimuli. These findings indicate that both young children and adults metacognitively rep-
resent their own ability to make discriminations even in the absence of data, and can use this to guide efficient and effective
exploration.
NATURE HUMAN BEHAVIOUR | VOL 6 | NOVEMBER 2022 | 1557–1568 | www.nature.com/nathumbehav 1557
Content courtesy of Springer Nature, terms of use apply. Rights reserved
... In particular, research on decision making and education suggests that systematic, efficient, and effective information search and scientific reasoning emerge late in development, around age 10 or even later, in adolescence (e.g., Betsch et al., 2018;Davidson, 1996;Mata et al., 2011). In contrast, research from cognitive, developmental, and computational psychology has provided evidence that, when tested with age-appropriate methods, toddlers and preschoolers are capable of efficient and adaptive exploration and information search, and of designing informative interventions for causal learning (e.g., Cook et al., 2011;Pelz et al., 2022;Ruggeri et al., 2019;Swaboda et al., 2022). (For the sociodemographic characteristics of the studies reviewed herein, see Table S1.) ...
Article
Research on the development of active learning and information search behaviors has been growing rapidly, drawing interest from multiple disciplines, from developmental psychology to cognitive science and artificial intelligence. These different perspectives can open pathways to understanding how preschool‐age children grow into adaptive and efficient active learners. However, the lack of a shared vocabulary, operationalizations, and research paradigms has led to limited cross‐talk and some conflicting findings. In this article, we advocate for using a shared operationalization of a “good” information‐search strategy, as a function of its efficiency and effectiveness within a given ecology, based on the information‐theoretic measure of expected information gain and observed behavioral outcomes, respectively. We also discuss factors that should be considered when designing experiments that examine children's information‐search competence, specifically, using formal models as performance benchmarks and accounting for children's prior knowledge, assumptions, and self‐generated goals.
... Nevertheless, people generally have intuition and the ability to reason about data distributions of combinatoric contexts that they might never experience. In fact, cognitive science research shows that intuitive reasoning about statistical power analysis begins early in childhood [42]. ...
Preprint
Full-text available
Across a wide variety of domains, there exists a performance gap between machine learning models' accuracy on dataset benchmarks and real-world production data. Despite the careful design of static dataset benchmarks to represent the real-world, models often err when the data is out-of-distribution relative to the data the models have been trained on. We can directly measure and adjust for some aspects of distribution shift, but we cannot address sample selection bias, adversarial perturbations, and non-stationarity without knowing the data generation process. In this paper, we outline two methods for identifying changes in context that lead to distribution shifts and model prediction errors: leveraging human intuition and expert knowledge to identify first-order contexts and developing dynamic benchmarks based on desiderata for the data generation process. Furthermore, we present two case-studies to highlight the implicit assumptions underlying applied machine learning models that tend to lead to errors when attempting to generalize beyond test benchmark datasets. By paying close attention to the role of context in each prediction task, researchers can reduce context shift errors and increase generalization performance.
Article
Intuitive statistical inferences refer to making inferences about uncertain events based on limited probabilistic information, which is crucial for both human and non-human species’ survival and reproduction. Previous research found that 7- and 8-year-old children failed in intuitive statistical inference tasks after heuristic strategies had been controlled. However, few studies systematically explored children’s heuristic strategies of intuitive statistical inferences and their potential numerical underpinnings. In the current research, Experiment 1 (N = 81) examined 7- to 10-year-olds’ use of different types of heuristic strategies; results revealed that children relied more on focusing on the absolute number strategy. Experiment 2 (N = 99) and Experiment 3 (N = 94) added continuous-format stimuli to examine whether 7- and 8-year-olds could make genuine intuitive statistical inferences instead of heuristics. Results revealed that both 7- and 8-year-olds and 9- and 10-year-olds performed better in intuitive statistical inference tasks with continuous-format stimuli, even after focusing on the absolute number strategy had been controlled. The results across the three experiments preliminarily hinted that the ratio processing system might rely on the approximate number system. Future research could clarify what specific numerical processing mechanism may be used and how it might support children’s statistical intuitions.
Preprint
Full-text available
Background Efficient information sampling is crucial for human inference and decision-making, even for young children. Information sampling is also closely associated with the core symptoms of autism spectrum disorder (ASD), since both the social interaction difficulties and repetitive behaviors suggest that autistic people may sample information from the environment distinctively. Previous research on information sampling in ASD focused mainly on adolescents and adults, and on whether they over- or under-sample. The specific ways in which autistic children sample information, especially when facing explicit costs and adapting to environmental changes, remain unclear. Methods We employed an adapted bead task to investigate the sampling behavior of 24 autistic and 41 neurotypical children, matched for age and IQ. In each trial of our experiment, children gathered information about an unknown target isle by drawing samples from it and then guessed the target between two isles based on their samples. In conditions where sampling was costly, children needed to weigh the benefits of information against the costs of acquiring additional samples. Through computational modeling and intricate behavioral measures, we revealed how the two groups of children differed in sampling decisions and underlying cognitive mechanisms. Results Under conditions involving costs, autistic children showed less efficient sampling than their neurotypical peers. This inefficiency was due to their increased variability in the number of samples taken across trials rather than a systematic bias. Computational models indicated that while both groups shared a similar decision process, autistic children’s sampling decisions were less influenced by dynamic changes and more driven by recent evidence, thus leading to their increased sampling variation and reduced efficiency. Limitations To refine ASD subtyping and correlate symptom severity with behavioral characteristics and computational findings, future research may need larger participant groups and more comprehensive clinical assessments. Conclusions This study reveals an inefficiency of autistic children in information sampling and tracks down this inefficiency to their increased sampling variability, primarily due to their cognitive preference for more local and static information. These findings are consistent with several influential behavioral theories of ASD and highlight the needs of a multi-level understanding of cognitive flexibility in ASD.
Article
Full-text available
Effective curiosity-driven learning requires recognizing that the value of evidence for testing hypotheses depends on what other hypotheses are under consideration. Do we intuitively represent the discriminability of hypotheses? Here we show children alternative hypotheses for the contents of a box and then shake the box (or allow children to shake it themselves) so they can hear the sound of the contents. We find that children are able to compare the evidence they hear with imagined evidence they do not hear but might have heard under alternative hypotheses. Children (N = 160; mean: 5 years and 4 months) prefer easier discriminations (Experiments 1-3) and explore longer given harder ones (Experiments 4-7). Across 16 contrasts, children’s exploration time quantitatively tracks the discriminability of heard evidence from an unheard alternative. The results are consistent with the idea that children have an “intuitive psychophysics”: children represent their own perceptual abilities and explore longer when hypotheses are harder to distinguish. People can infer unobserved causes of perceptual data (e.g. the contents of a box from the sound made by shaking it). Here the authors show that children compare what they hear with what they would have heard given other causes, and explore longer when the heard and imagined sounds are hard to discriminate.
Article
Full-text available
It is often unclear which course of action gives the best outcome. We can reduce this uncertainty by gathering more information; but gathering information always comes at a cost. For example, a sports player waiting too long to judge a ball’s trajectory will run out of time to intercept it. Efficient samplers must therefore optimize a trade-off: when the costs of collecting further information exceed the expected benefits, they should stop sampling and start acting. In visually guided tasks, adults can make these trade-offs efficiently, correctly balancing any reductions in visuomotor uncertainty against cost factors associated with increased sampling. To investigate how this ability develops during childhood, we tested 6-11 year-olds, adolescents, and adults on a visual localization task in which the costs and benefits of sampling were formalized in a quantitative framework. This allowed us to compare participants to each other, and to an ideal observer who maximizes expected reward. Visual sampling became substantially more efficient between 6-11 years, converging onto adult performance in adolescence. Younger children systematically under-sampled information relative to the ideal observer and varied their sampling strategy more. Further analyses suggested that young children used a suboptimal decision rule that insufficiently accounted for the chance of task failure, in line with a late developing ability to compute with probabilities and costs. We therefore propose that late development of efficient information sampling, a crucial element of real-world decision-making under risk, may form an important component of sub-optimality in child perception, action, and decision-making
Article
Full-text available
When participants studied a list of paired associates for several study-test cycles, their judgments of learning (JOLs) exhibited relatively good calibration on the 1st cycle, with a slight overconfidence. However, a shift toward marked underconfidence occurred from the 2nd cycle on. This underconfidence-with-practice (UWP) effect was very robust across several experimental manipulations, such as feedback or no feedback regarding the correctness of the answer, self-paced versus fixed-rate presentation, different incentives for correct performance, magnitude and direction of associative relationships, and conditions producing different degrees of knowing. It was also observed both in item-by-item JOLs and in aggregate JOLs. The UWP effect also occurred for list learning and for the memory of action events. Several theoretical explanations for this counterintuitive effect are discussed.
Article
Full-text available
Prior research suggests that young children selectively inform others depending on others' knowledge states. Yet, little is known whether children selectively inform others depending on their own knowledge states. To explore this issue, we manipulated 3- to 4-year-old children's knowledge about the content of a box and assessed the impact on their decisions to inform another person. Moreover, we assessed the presence of uncertainty gestures while they inform another person in light of the suggestions that children's gestures reflect early developing, perhaps transient, epistemic sensitivity. Finally, we compared children's performance in the informing context to their explicit verbal judgment of their knowledge states to further confirm the existence of a performance gap between the two tasks. In their decisions to inform, children tend to accurately assess their ignorance, whereas they tend to overestimate their own knowledge states when asked to explicitly report them. Moreover, children display different levels of uncertainty gestures depending on the varying degrees of their informational access. These findings suggest that children's implicit awareness of their own ignorance may be facilitated in a social, communicative context.
Article
Full-text available
Given the overwhelming quantity of information available from the environment, how do young learners know what to learn about and what to ignore? We found that 11-month-old infants (N = 110) used violations of prior expectations as special opportunities for learning. The infants were shown events that violated expectations about object behavior or events that were nearly identical but did not violate expectations. The sight of an object that violated expectations enhanced learning and promoted information-seeking behaviors; specifically, infants learned more effectively about objects that committed violations, explored those objects more, and engaged in hypothesis-testing behaviors that reflected the particular kind of violation seen. Thus, early in life, expectancy violations offer a wedge into the problem of what to learn. Copyright © 2015, American Association for the Advancement of Science.
Article
Full-text available
Given the overwhelming quantity of information available from the environment, how do young learners know what to learn about and what to ignore? We found that 11-month-old infants (N = 110) used violations of prior expectations as special opportunities for learning. The infants were shown events that violated expectations about object behavior or events that were nearly identical but did not violate expectations. The sight of an object that violated expectations enhanced learning and promoted information-seeking behaviors; specifically, infants learned more effectively about objects that committed violations, explored those objects more, and engaged in hypothesis-testing behaviors that reflected the particular kind of violation seen. Thus, early in life, expectancy violations offer a wedge into the problem of what to learn.
Article
Full-text available
Little is known about the mechanisms underlying a ubiquitous behavior in preschoolers, help-seeking. We tested the hypothesis that preschoolers' awareness of their own uncertainty is associated with help-seeking. Three-, 4-, and 5-year-olds (N = 125) completed a perceptual identification task twice: once independently and once when they could request help from a confederate whose competence level was manipulated. Consistent with our hypothesis, participants sought help more frequently on trials for which, when required to answer independently, they expressed lower confidence. Children in the bad-helper condition were slower to respond after receiving help than those in the good-helper condition. Finally, females and children with more advanced theory of mind were more likely to seek help, identifying additional factors that relate to help-seeking. © 2014 John Wiley & Sons Ltd.
Article
Many decisions are based on beliefs concerning the likelihood of uncertain events such as the outcome of an election, the guilt of a defendant, or the future value of the dollar. Occasionally, beliefs concerning uncertain events are expressed in numerical form as odds or subjective probabilities. In general, the heuristics are quite useful, but sometimes they lead to severe and systematic errors. The subjective assessment of probability resembles the subjective assessment of physical quantities such as distance or size. These judgments are all based on data of limited validity, which are processed according to heuristic rules. However, the reliance on this rule leads to systematic errors in the estimation of distance. This chapter describes three heuristics that are employed in making judgments under uncertainty. The first is representativeness, which is usually employed when people are asked to judge the probability that an object or event belongs to a class or event. The second is the availability of instances or scenarios, which is often employed when people are asked to assess the frequency of a class or the plausibility of a particular development, and the third is adjustment from an anchor, which is usually employed in numerical prediction when a relevant value is available.
Article
A fundamental task of language acquisition is to extract abstract algebraic rules. Three experiments show that 7-month-old infants attend longer to sentences with unfamiliar structures than to sentences with familiar structures. The design of the artificial language task used in these experiments ensured that this discrimination could not be performed by counting, by a system that is sensitive only to transitional probabilities, or by a popular class of simple neural network models. Instead, these results suggest that infants can represent, extract, and generalize abstract algebraic rules.