Ansgar D. Endress’s research while affiliated with University of London and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (48)


Table 3 continued
Demographics of the full sample and the restricted sample, where those participants were excluded whose accuracy on word vs. part-words trials did not exceed 50%
Design of the actual words and phantom-words Phantom-word: ABC
Descriptives of accuracy scores and difference scores for the main sample
Transitional probabilities outweigh frequency of occurrence in statistical learning of simultaneously presented visual shapes
  • Article
  • Full-text available

December 2024

·

2 Reads

Memory & Cognition

Ansgar D Endress

Statistical learning is a mechanism for detecting associations among co-occurring elements in many domains and species. A key controversy is whether it leads to memory for discrete chunks composed of these associated elements, or merely to pairwise associations among elements. Critical evidence for the mere-association view comes from the “phantom-word” phenomenon, where learners recognize statistically coherent but unattested items better than actually presented items with weaker internal associations, suggesting that they prioritize pairwise associations over memories for discrete units. However, this phenomenon has only been demonstrated for sequentially presented stimuli, but not for simultaneously presented visual shapes, where learners might prioritize discrete units over pairwise associations. Here, I ask whether the phantom-word phenomenon can be observed with simultaneously presented visual shapes. Learners were familiarized with scenes combining two triplets of visual shapes (hereafter “words”). They were then tested on their recognition of these words vs. part-words (attested items with weaker internal associations), of phantom-words (unattested items with strong internal associations) vs. part-words, and of words vs. phantom-words. Learners preferred both words and phantom-words over part-words and showed no preference for words over phantom-words. This suggests that, as for sequentially input, statistical learning in simultaneously presented shapes leads primarily to pairwise associations rather than to memories for discrete chunks. However, as, in some analyses, the preference for words over part-words was slightly higher than for phantom-words over part-words, the results do not rule out that, for simultaneous presented items, learners might have some limited sensitivity to frequency of occurrence.

Download

Hebbian learning can explain rhythmic neural entrainment to statistical regularities

February 2024

·

31 Reads

·

2 Citations

In many domains, learners extract recurring units from continuous sequences. For example, in unknown languages, fluent speech is perceived as a continuous signal. Learners need to extract the underlying words from this continuous signal and then memorize them. One prominent candidate mechanism is statistical learning, whereby learners track how predictive syllables (or other items) are of one another. Syllables within the same word predict each other better than syllables straddling word boundaries. But does statistical learning lead to memories of the underlying words—or just to pairwise associations among syllables? Electrophysiological results provide the strongest evidence for the memory view. Electrophysiological responses can be time‐locked to statistical word boundaries (e.g., N400s) and show rhythmic activity with a periodicity of word durations. Here, I reproduce such results with a simple Hebbian network. When exposed to statistically structured syllable sequences (and when the underlying words are not excessively long), the network activation is rhythmic with the periodicity of a word duration and activation maxima on word‐final syllables. This is because word‐final syllables receive more excitation from earlier syllables with which they are associated than less predictable syllables that occur earlier in words. The network is also sensitive to information whose electrophysiological correlates were used to support the encoding of ordinal positions within words. Hebbian learning can thus explain rhythmic neural activity in statistical learning tasks without any memory representations of words. Learners might thus need to rely on cues beyond statistical associations to learn the words of their native language. Research Highlights Statistical learning may be utilized to identify recurring units in continuous sequences (e.g., words in fluent speech) but may not generate explicit memory for words. Exposure to statistically structured sequences leads to rhythmic activity with a period of the duration of the underlying units (e.g., words). I show that a memory‐less Hebbian network model can reproduce this rhythmic neural activity as well as putative encodings of ordinal positions observed in earlier research. Direct tests are needed to establish whether statistical learning leads to declarative memories for words.


In defense of epicycles: Embracing complexity in psychological explanations

December 2022

·

12 Reads

Mind & Language

Is formal simplicity a guide to learning in humans, as simplicity is said to be a guide to the acceptability of theories in science? Does simplicity determine the difficulty of various learning tasks? I argue that, similarly to how scientists sometimes preferred complex theories when this facilitated calculations, results from perception, learning and reasoning suggest that formal complexity is generally unrelated to what is easy to learn and process by humans, and depends on assumptions about available representational and processing primitives. “Simpler” hypotheses are preferred only when they are also easier to process. Historically, “simpler”, easier‐to‐process, scientific theories might also be preferred if they are transmitted preferentially. Empirically viable complexity measures should build on the representational and processing primitives of actual learners, even if explanations of their behaviour become formally more complex.


Hebbian, correlational learning provides a memory-less mechanism for Statistical Learning irrespective of implementational choices: Reply to Tovar and Westermann (2022)

October 2022

·

17 Reads

·

2 Citations

Cognition

Statistical learning relies on detecting the frequency of co-occurrences of items and has been proposed to be crucial for a variety of learning problems, notably to learn and memorize words from fluent speech. Endress and Johnson (2021) (hereafter EJ) recently showed that such results can be explained based on simple memory-less correlational learning mechanisms such as Hebbian Learning. Tovar and Westermann (2022) (hereafter TW) reproduced these results with a different Hebbian model. We show that the main differences between the models are whether temporal decay acts on both the connection weights and the activations (in TW) or only on the activations (in EJ), and whether interference affects weights (in TW) or activations (in EJ). Given that weights and activations are linked through the Hebbian learning rule, the networks behave similarly. However, in contrast to TW, we do not believe that neurophysiological data are relevant to adjudicate between abstract psychological models with little biological detail. Taken together, both models show that different memory-less correlational learning mechanisms provide a parsimonious account of Statistical Learning results. They are consistent with evidence that Statistical Learning might not allow learners to learn and retain words, and Statistical Learning might support predictive processing instead.



Fig. 1 Likelihood ratio in favor of the hypothesis that a behavior is representative of a population (rather than an exception) as a function of the prior frequency of exceptions (í µí¼–), the type frequency (T) and the ratio α/β (facets)
Fig. 2 First-party acceptability as a function of valency and type frequency. The contours represent the distribution of responses, the dots the sample averages, and the error bars the standard deviations
Fig. 4 Behavior prevalence as a function of valency, type frequency, and question generality (gendered vs. general question). The contours represent the distribution of responses, the dots the sample averages, and the error bars the standard deviations
Generic learning mechanisms can drive social inferences: The role of type frequency

April 2022

·

20 Reads

Memory & Cognition

How do we form opinions about typical and morally acceptable behavior in other social groups despite variability in behavior? Similar learning problems arise during language acquisition, where learners need to infer grammatical rules (e.g., the walk/walk-ed past-tense) despite frequent exceptions (e.g., the go/went alternation). Such rules need to occur with many different words to be learned (i.e., they need a high type frequency ). In contrast, frequent individual words do not lead to learning. Here, we ask whether similar principles govern social learning. Participants read a travel journal where a traveler observed behaviors in different imaginary cities. The behaviors were performed once by many distinct actors (high type frequency) or frequently by a single actor (low type frequency), and could be good, neutral or bad. We then asked participants how morally acceptable the behavior was (in general or for the visited city), and how widespread it was in that city. We show that an ideal observer model estimating the prevalence of behaviors is only sensitive to the behaviors’ type frequency, but not to how often they are performed. Empirically, participants rated high type frequency behaviors as more morally acceptable more prevalent than low type frequency behaviors. They also rated good behaviors as more acceptable and prevalent than neutral or bad behaviors. These results suggest that generic learning mechanisms and epistemic biases constrain social learning, and that type frequency can drive inferences about groups. To combat stereotypes, high type frequency behaviors might thus be more effective than frequently appearing individual role models.


Socio-Cultural Values Are Risk Factors for COVID-19-Related Mortality

February 2022

·

16 Reads

·

6 Citations

Cross-Cultural Research

To assess whether socio-cultural values are population-level risk factors for health, I sought to predict COVID-19-related mortality between 2 weeks and 6 months after the first COVID-19-related death in a country based on values extracted from the World Values Survey for different country sets, after controlling for various confounding variables. COVID-19-related mortality was increased in countries endorsing political participation but decreased in countries with greater trust in institutions and materialistic orientations. The values were specific to COVID-19-related mortality, did not predict general health outcomes, and values predicting increased COVID-19-related mortality predicted decreased mortality from other outcomes (e.g., environmental-related mortality).


Memory and Proactive Interference for spatially distributed items

February 2022

·

39 Reads

·

7 Citations

Memory & Cognition

Our ability to briefly retain information is often limited. Proactive Interference (PI) might contribute to these limitations (e.g., when items in recognition tests are difficult to reject after having appeared recently). In visual Working Memory (WM), spatial information might protect WM against PI, especially if encoding items together with their spatial locations makes item-location combinations less confusable than simple items without a spatial component. Here, I ask (1) if PI is observed for spatially distributed items, (2) if it arises among simple items or among item-location combinations, and (3) if spatial information affects PI at all. I show that, contrary to views that spatial information protects against PI, PI is reliably observed for spatially distributed items except when it is weak. PI mostly reflects items that appear recently or frequently as memory items, while occurrences as test items play a smaller role, presumably because their temporal context is easier to encode. Through mathematical modeling, I then show that interference occurs among simple items rather than item-location combinations. Finally, to understand the effects of spatial information, I separate the effects of (a) the presence and (b) the predictiveness of spatial information on memory and its susceptibility to PI. Memory is impaired when items are spatially distributed, but, depending on the analysis, unaffected by the predictiveness of spatial information. In contrast, the susceptibility to PI is unaffected by either manipulation. Visual memory is thus impaired by PI for spatially distributed items due to interference from recent memory items (rather than test items or item-location combinations).


When forgetting fosters learning: A neural network model for statistical learning

February 2021

·

30 Reads

·

32 Citations

Cognition

Learning often requires splitting continuous signals into recurring units, such as the discrete words constituting fluent speech; these units then need to be encoded in memory. A prominent candidate mechanism involves statistical learning of co-occurrence statistics like transitional probabilities (TPs), reflecting the idea that items from the same unit (e.g., syllables within a word) predict each other better than items from different units. TP computations are surprisingly flexible and sophisticated. Humans are sensitive to forward and backward TPs, compute TPs between adjacent items and longer-distance items, and even recognize TPs in novel units. We explain these hallmarks of statistical learning with a simple model with tunable, Hebbian excitatory connections and inhibitory interactions controlling the overall activation. With weak forgetting, activations are long-lasting, yielding associations among all items; with strong forgetting, no associations ensue as activations do not outlast stimuli; with intermediate forgetting, the network reproduces the hallmarks above. Forgetting thus is a key determinant of these sophisticated learning abilities. Further, in line with earlier dissociations between statistical learning and memory encoding, our model reproduces the hallmarks of statistical learning in the absence of a memory store in which items could be placed.


Statistical learning and memory

November 2020

·

66 Reads

·

9 Citations

Cognition

Learners often need to identify and remember recurring units in continuous sequences, but the underlying mechanisms are debated. A particularly prominent candidate mechanism relies on distributional statistics such as Transitional Probabilities (TPs). However, it is unclear what the outputs of statistical segmentation mechanisms are, and if learners store these outputs as discrete chunks in memory. We critically review the evidence for the possibility that statistically coherent items are stored in memory and outline difficulties in interpreting past research. We use Slone and Johnson's (2018) experiments as a case study to show that it is difficult to delineate the different mechanisms learners might use to solve a learning problem. Slone and Johnson (2018) reported that 8-month-old infants learned coherent chunks of shapes in visual sequences. Here, we describe an alternate interpretation of their findings based on a multiple-cue integration perspective. First, when multiple cues to statistical structure were available, infants' looking behavior seemed to track with the strength of the strongest one — backward TPs, suggesting that infants process multiple cues simultaneously and select the strongest one. Second, like adults, infants are exquisitely sensitive to chunks, but may require multiple cues to extract them. In Slone and Johnson's (2018) experiments, these cues were provided by immediate chunk repetitions during familiarization. Accordingly, infants showed strongest evidence of chunking following familiarization sequences in which immediate repetitions were more frequent. These interpretations provide a strong argument for infants' processing of multiple cues and the potential importance of multiple cues for chunk recognition in infancy.


Citations (40)


... For a critical discussion of the evidence supporting the memory view as well as alternative interpretations thereof, see Endress and de Seyssel (under review) and Endress et al. (2020). Support for the mere-association view comes from several key observations, including computational modeling of behavioral and electrophysiological statistical learning results with memory-less Hebbian mechanisms (Endress & Johnson, 2021;Endress, 2024), and an almost complete inability to consciously recall statistical defined items such as words even when their statistical structure has been demonstrably learned (Batterink, 2020;Endress & de Seyssel, under review). ...

Reference:

Transitional probabilities outweigh frequency of occurrence in statistical learning of simultaneously presented visual shapes
Hebbian learning can explain rhythmic neural entrainment to statistical regularities
  • Citing Article
  • February 2024

... Regularity extraction, i.e., the learning processes underlying MMN elicitation include many facets. One of the facets is statistical learning, which is a diverse topic on its own that is covered by theories from cognitive psychology on various cognitive abilities in which the statistics of (co-)occurrences of events matter such as language acquisition (Saffran et al., 1996;Thiessen, 2017;Frost et al., 2019;Conway, 2020;Endress and Johnson, 2023). The Markovian view on the MMN we discuss in the present paper naturally is a simplification and it disregards many aspects of statistical learning. ...

Hebbian, correlational learning provides a memory-less mechanism for Statistical Learning irrespective of implementational choices: Reply to Tovar and Westermann (2022)
  • Citing Article
  • October 2022

Cognition

... 3,13,14,15 Several studies indicate that social distancing is one of the most effective public health measures that was highly recommended to avoid contracting COVID-19 at its inception before the discovery of vaccines. 20 This study is in line with several other social and epidemiological studies, which show that social networks and social contact patterns have a major influence on the swift spread of contagious diseases, as in the case of the COVID-19 pandemic. 1,21,22 Sociocultural practices, whose precondition is sociality and collective action, become the determining factor in whether public health measures to prevent the spread of COVID-19 are successful or not. ...

Socio-Cultural Values Are Risk Factors for COVID-19-Related Mortality
  • Citing Article
  • February 2022

Cross-Cultural Research

... This enables our students to store item-location combinations, giving them spatial cues for memory items in addition to the semantic cues. With practice, they can potentially bind all items together in a configuration and reduce proactive interference-when older memories interfere with the retrieval of newer, similar information-for other recently studied items [62]. Location and gaze direction can be an additional cue to memories when multiple memories compete [62][63][64]. ...

Memory and Proactive Interference for spatially distributed items

Memory & Cognition

... For a critical discussion of the evidence supporting the memory view as well as alternative interpretations thereof, see Endress and de Seyssel (under review) and Endress et al. (2020). Support for the mere-association view comes from several key observations, including computational modeling of behavioral and electrophysiological statistical learning results with memory-less Hebbian mechanisms (Endress & Johnson, 2021;Endress, 2024), and an almost complete inability to consciously recall statistical defined items such as words even when their statistical structure has been demonstrably learned (Batterink, 2020;Endress & de Seyssel, under review). ...

When forgetting fosters learning: A neural network model for statistical learning
  • Citing Article
  • February 2021

Cognition

... However, the question of whether statistical learning truly facilitates the memorization of these units is controversial. An alternative view proposes that statistical learning primarily supports the formation of pairwise associations among co-occurring elements (e.g., syllables) rather than the memorization of units (Endress & de Seyssel, under review;Endress, Slone, & Johnson, 2020). ...

Statistical learning and memory
  • Citing Article
  • November 2020

Cognition

... For instance, many explanations for visual WM capacity limits appeal to competition in sensory networks with strong spatial and/or feature organization (e.g., Schneegans, Taylor, & Bays, 2020). Analogous behavioral constraints in modalities that lack such organization inspire questions as to which structural features impose capacity limits and whether WM in chemical senses might show the same flexibility as other modalities-for instance, in sensitivity to factors like chunking, presentation format, or serial biases (Chung, Brady, & Störmer, 2024;Wang & Alais, 2024;Endress & Szabó, 2020;Lau & Maus, 2019). Comparisons between modalities also strike at broader questions about the extent to which processing limits arise via modalityspecific competitive interactions or shared central bottlenecks (Pashler, 1984;Baddeley & Hitch, 1974;Broadbent, 1958) and how this issue impacts current theoretical explanations of WM capacity limits (e.g., Wennberg & Serences, 2024;Bouchacourt & Buschman, 2019). ...

Sequential Presentation Protects Working Memory From Catastrophic Interference
  • Citing Article
  • May 2020

Cognitive Science A Multidisciplinary Journal

... For example, when typically adult readers were exposed to a grammar containing sequential dependencies vs. patterns of repeating elements, they succeeded in generalizing the rules learned to new items only when grammar contained repetition-based rules and failed when the sequences contained nonrepetitive patterns (Gomez, Gerken, & Schvaneveldt, 2000; see also Kahta & Schiff, 2019). These pieces of evidence further suggested that the ability to detect and generalize repetition-based rules, such as ABB and ABA (also known as mini-grammars), constitutes a building block of the ability to identify more complex rule-based patterns, like those required to acquire the grammar structures of language (Endress, 2020;Endress, Dehaene-Lambertz, & Mehler, 2007;Lotz & Kinder, 2006). ...

A Simple, Biologically Plausible Feature Detector for Language Acquisition

... Finally, we consider it essential to note that a dual-process account faces some limitations and reasonable criticisms. First, it must be stressed that dual-process theory as a framework aims to explain human behavior in terms of two processes and their interactions at a domain-general level, with no specific predictions for particular domains (Endress, 2019). It remains an open question whether dual-process theory can explain human behavior in situations that require domain-specific decisions, like those in the context of sport. ...

Duplications and Domain-Generality

... Recent studies have examined the effects of MOT training on working memory utilizing diverse experimental paradigms and assessment methods. Comparisons between a MOT training group and a control group led to the discovery that MOT training significantly improves the capacity and precision of working memory [46,47]. The ability of subjects' working memory can be effectively enhanced through MOT training. ...

Category-based grouping in working memory and multiple object tracking

Visual Cognition