Simon W. Townsend’s research while affiliated with University of Zurich and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (95)


Figure 3: Transition network based on the concurrence of the different time-windows (i.e., 434 transitions that are significant in at least three out of four time-windows) including cluster 435 assignment based on 'optimal clustering' algorithm and loops. 436 437 When comparing the similarity of which transitions were significant in the different time-438
Figure 4: Network of significant transitions that are shared between the rapid-fire sequences 446 and solitary gesture sequences (left, red), occur only in the rapid-fire sequences (centre, green) 447 or only in the solitary-gestures-with-waiting sequences (right, blue). Significant transitions are 448 marked by edges between gesture actions. Loops indicate that a gesture action significantly 449 follows itself. 450 451 From Figure 4, we see that those transitions that were specific to rapid-fire sequences were 452
Figure 5: Entropy ratio values of conditional probabilities with antecedents of one unit. Colours 491 indicate the time-window. Entropy ratios (observed/expected entropy) closer to 0 are considered 492 more predictable than random assignment, 1 or above are random distributions. 493 494
Figure 6: Accuracy values of Naive Bayes classifier (correct predictions/all predictions), 514 predicting the consequent units based on antecedents of one unit. Colours indicate the time-515 window. 516 517
Figure 9: Comparison of unit-level prediction accuracy for the Naive Bayes classifier for each 578 gesture -prediction based on the single antecedent B in blue, for the two-antecedent AB 579 combination in red, with the difference on the right. Where only one dot is visible, the predictive 580 power was the same. 581 582 583 584 585 586 587 588 589 590 591

+3

Predictability of next elements in chimpanzee gesture sequences
  • Preprint
  • File available

November 2024

·

43 Reads

Alexander Mielke

·

·

·

[...]

·

Catherine Hobaiter

Recent research has produced evidence for basic combinatorial abilities in the vocal systems of different animal species. Here, we investigate the structure of gesture sequences in Eastern chimpanzees ( Pan troglodytes schweinfurthii ) to detect whether gestural communication shows non-random combinations and how combinatorial rules influence predictability. Gesture, as compared to vocalization, offers greater flexibility in how signals are combined—for example overlapping in time — and as the parsing of signals into sequences is dependent on researcher decisions, we employ a multiverse approach, considering four different definitions of what constitutes a ‘sequence’ based on varying time thresholds. Our results indicate that sequences tend to be short (even with the most liberal time-window) and that transitions between some gesture types occur more frequently than expected by chance, with some transitions showing significant association across all time-windows. These transitions often involve repetition, suggesting persistence as a key aspect of chimpanzee gestural sequences. Information about previous gestures reduced uncertainty in predicting subsequent gestures. The order of gestures within sequences appears to be less critical than their cooccurrence, challenging assumptions based on the linear patterning derived from vocal communication. Our findings highlight the importance of methodological choices in sequence definition and suggest that chimpanzee gestural communication is characterised by a mix of predictability and flexibility, with implications for understanding the evolution of complex communication systems.

Download

Beyond bigrams: call sequencing in the common marmoset (Callithrix jacchus) vocal system

November 2024

·

12 Reads

Over the last two decades, an emerging body of research has demonstrated that non-human animals exhibit the ability to combine context-specific calls into larger sequences. These structures have frequently been compared with language’s syntax, whereby linguistic units are combined to form larger structures, and leveraged to argue that syntax might not be unique to language. Currently, however, the overwhelming majority of examples of call combinations are limited to simple sequences comprising just two calls which differ dramatically from the open-ended hierarchical structuring of the syntax found in language. We revisit this issue by taking a whole-repertoire approach to investigate combinatoriality in common marmosets (Callithrix jacchus). We use Markov chain models to quantify the vocal sequences produced by marmosets providing evidence for structures beyond the bigram, including three-call and even combinations of up to eight or nine calls. Our analyses of these longer vocal sequences are suggestive of potential further internal organization, including some amount of recombination, nestedness and non-adjacent dependencies. We argue that data-driven, whole-repertoire analyses are fundamental to uncovering the combinatorial complexity of non-human animals and will further facilitate meaningful comparisons with language’s combinatoriality.


Stimuli and experimental design
(A) The pitch contours of 6 acoustic categories. (B) Experimental design. In the Learning phase, standard triplets were repeated 60 times. In the Test phase, 2 types of target trials (Correct and Incorrect conditions) were separated by a jittering baseline condition in which the same standard triplets with the Learning phase were presented. (C) Measurement phases for 2 experiments.
The statistically significant effects in the channel-by-channel permutation t test analysis
(A) Experiment 1: neonates. (B) Experiment 2: 6- to 7-month-olds. Left, middle, and right panels show brain regions that were activated significantly for 3 condition contrasts of Correct vs. Baseline, Incorrect vs. Baseline, and Incorrect vs. Correct, respectively. The circles denote significant channels (solid circles: permutation p < 0.01; dotted circles: permutation p < 0.05). The data underlying this figure can be found at https://osf.io/84yu9/.
The FC patterns in Experiment 1: Neonates
(A) The most significantly increased FCs for 3 phases against a respective zero baseline. Different colors and line thickness indicate different thresholds. Both darker color and thicker line indicate the more stringent threshold. (B) The FCs with significant changes for the contrasts of Learning phase vs. Pre-Rest phase, and Post-Rest phase vs. Pre-Rest phase. The data underlying this figure can be found at https://osf.io/84yu9/. FC, functional connectivity.
Functional brain network underlying NAD learning in Experiment 1: Neonates
(A) Relations between prefrontal activation during the Test phase (learning success) and FCs with significant changes from the Pre-Rest phase to the Learning phase (FCs during learning). Seed channels are indicated by circles with 6 different colors and non-seed channels indicated by black circles. The lines with 6 different colors represent FCs displaying significantly negative correlation with activations in seed channels. (B) The negative correlation between activation in Ch 34 and FC strength (Ch 9–Ch 34; Ch 2–Ch 9). The blue dots represent data from 15 neonates. (C) 2D and 3D maps of the learning-related brain network, where the strength of all FCs was negatively correlated with activation in the 6 seed channels. (D) Relations between FC strength during the Pre-Rest phase and FC strength changes from the Pre-Rest phase to the Learning phase (Learning minus Pre-Rest). Blue lines indicate negative correlations. (E) The scatter plots from 2 canonical negative correlations in Fig 4D. The blue dots represent data from 15 neonates. The data underlying this figure can be found at https://osf.io/84yu9/. FC, functional connectivity; NAD, nonadjacent dependency.
Functional reorganization of brain regions supporting artificial grammar learning across the first half year of life

October 2024

·

53 Reads

Pre-babbling infants can track nonadjacent dependencies (NADs) in the auditory domain. While this forms a crucial prerequisite for language acquisition, the neurodevelopmental origins of this ability remain unknown. We applied functional near-infrared spectroscopy in neonates and 6- to 7-month-old infants to investigate the neural substrate supporting NAD learning and detection using tone sequences in an artificial grammar learning paradigm. Detection of NADs was indicated by left prefrontal activation in neonates while by left supramarginal gyrus (SMG), superior temporal gyrus (STG), and inferior frontal gyrus activation in 6- to 7-month-olds. Functional connectivity analyses further indicated that the neonate activation pattern during the test phase benefited from a brain network consisting of prefrontal regions, left SMG and STG during the rest and learning phases. These findings suggest a left-hemispheric learning-related functional brain network may emerge at birth and serve as the foundation for the later engagement of these regions for NAD detection, thus, providing a neural basis for language acquisition.


Seeds of language-like generativity in bird call combinations

Language is unbounded in its generativity, enabling the flexible combination of words into novel sentences. Critically, these constructions are intelligible to others due to our ability to derive a sentence’s compositional meaning from the semantic relationships among its components. Some animals also concatenate meaningful calls into compositional-like combinations to communicate more complex information. However, these combinations are structurally highly stereotyped, suggesting a bounded system of holistically perceived signals that impedes the processing of novel variants. Using long-term data and playback experiments on pied babblers, we demonstrate that, despite production stereotypy, they can nevertheless process structurally modified and novel combinations of their calls, demonstrating a capacity for deriving meaning compositionally. Furthermore, differential responses to artificial combinations by fledglings suggest that this compositional sensitivity is acquired ontogenetically. Our findings demonstrate animal combinatorial systems can be flexible at the perceptual level and that such perceptual flexibility may represent a precursor of language-like generativity.


Raw data illustrating variation in the number of NVBs (non-vocal behaviours) produced at significantly above chance frequencies with the different call types analysed in this study. Boxes indicate the inter quartile range (IQR), the central line depicts the median and the whiskers extend to the maximum and minimum values excluding outliers. Circles represent outliers, and crosses represent means for each call type
Full list of NVBs (non-vocal behaviour) annotated in this study with corresponding behavioural description used to assign NVBs. The term “specific individual” refers to the individual who is closest to the signaler
Illustration of procedure for entering each communication event into a suitable dataset for implementing the multiple-NVBs collocation analysis
List of 31 single NVBs (non-vocal behaviour) and 7 call types included in this analysis. Colour codes denote strength of attraction/repulsion between NVBs and each call type: darkest green = strongest attraction, darkest red = strongest repulsion. All values above 1.3 represent co-occurrence at above-chance level with 95% confidence interval, while values below − 1.3 represent significant repulsion between components
All combinations of vocal and visual components that were found to co-occur more frequently than expected by chance
Vocal-visual combinations in wild chimpanzees

October 2024

·

76 Reads

Behavioral Ecology and Sociobiology

Living organisms throughout the animal kingdom habitually communicate with multi-modal signals that use multiple sensory channels. Such composite signals vary in their communicative function, as well as the extent to which they are recombined freely. Humans typically display complex forms of multi-modal communication, yet the evolution of this capacity remains unknown. One of our two closest living relatives, chimpanzees, also produce multi-modal combinations and therefore may offer a valuable window into the evolutionary roots of human communication. However, a currently neglected step in describing multi-modal systems is to disentangle non-random combinations from those that occur simply by chance. Here we aimed to provide a systematic quantification of communicative behaviour in our closest living relatives, describing non-random combinations produced across auditory and visual modalities. Through recording the behaviour of wild chimpanzees from the Kibale forest, Uganda we generated the first repertoire of non-random combined vocal and visual components. Using collocation analysis, we identified more than 100 vocal-visual combinations which occurred more frequently than expected by chance. We also probed how multi-modal production varied in the population, finding no differences in the number of visual components produced with vocalisations as a function of age, sex or rank. As expected, chimpanzees produced more visual components alongside vocalizations during longer vocalization bouts, however, this was only the case for some vocalization types, not others. We demonstrate that chimpanzees produce a vast array of combined vocal and visual components, exhibiting a hitherto underappreciated level of multi-modal complexity.


The locations of the Kokolopori and Luikotale field sites on a map of the Democratic Republic of Congo
Spectograms depicting whistle-high hoots (W + HHs) produced the Kokolopori field site (left) and the LuiKotale field site (right)
The proportion of whistle-high hoot (W + HH) call combinations produced in each of four contexts (arrival, feed, rest, and travel) among two populations of bonobos
Cross-population variation in usage of a call combination: evidence of signal usage flexibility in wild bonobos

August 2024

·

25 Reads

Animal Cognition

The arbitrary relationship between signifier and signified is one of the features responsible for language’s extreme lability, adaptability, and expressiveness. Understanding this arbitrariness and its emergence is essential in any account of the evolution of language. To shed light on the phylogeny of the phenomenon, comparative data examining the relationship between signal form and function in the communication systems of non-humans is central. Here we report the results of a study on the production and usage the whistle-high hoot call combination (W + HH) from two distant populations of wild bonobos (Pan paniscus): Lui Kotale, DRC, and Kokolopori, DRC. We find that the context in which bonobos produce the W + HHs varies systematically between populations. Our results suggest that variation in W + HH production may represent an example of signal-adjustment optionality, a key component of arbitrariness.



Call combination production is linked to the social environment in Western Australian magpies (Gymnorhina tibicen dorsalis)

May 2024

·

50 Reads

·

1 Citation

It has recently become clear that some language-specific traits previously thought to be unique to humans (such as the capacity to combine sounds) are widespread in the animal kingdom. Despite the increase in studies documenting the presence of call combinations in non-human animals, factors promoting this vocal trait are unclear. One leading hypothesis proposes that communicative complexity co-evolved with social complexity owing to the need to transmit a diversity of information to a wider range of social partners. The Western Australian magpie (Gymnorhina tibicen dorsalis) provides a unique model to investigate this proposed link because it is a group-living, vocal learning species that is capable of multi-level combinatoriality (independently produced calls contain vocal segments and comprise combinations). Here, we compare variations in the production of call combinations across magpie groups ranging in size from 2 to 11 birds. We found that callers in larger groups give call combinations: (i) in greater diversity and (ii) more frequently than callers in smaller groups. Significantly, these observations support the hypothesis that combinatorial complexity may be related to social complexity in an open-ended vocal learner, providing an important step in understanding the role that sociality may have played in the development of vocal combinatorial complexity. This article is part of the theme issue ‘The power of sound: unravelling how acoustic communication shapes group dynamics’.


Fig 1. Stimuli and experimental design. (A) The pitch contours of six acoustic categories. (B) Experimental design. In the Learning phase, standard triplets were repeated 60 times. In the Test phase, 2 types of target trials (Correct and Incorrect conditions) were separated by a jittering baseline condition in which the same standard triplets with the Learning phase were presented. (C) Measurement phases for two experiments.
Fig 3. The FC patterns in Experiment 1: neonates. (A) The most significantly increased FCs for three phases against a respective zero baseline. Different colors and line thickness indicate different thresholds. Both darker color and thicker line indicate the more stringent threshold. (B) The FCs with significant changes for the contrasts of Learning phase vs. Pre-Rest phase, and Post-Rest phase vs. Pre-Rest phase.
Fig 4. Functional brain network underlying NAD learning in Experiment 1: neonates. (A) Relations between prefrontal activation during the test phase (learning
Functional reorganization of brain regions supporting non-adjacent dependency learning across the first half year of life

April 2024

·

70 Reads

Pre-babbling infants can track nonadjacent dependencies (NADs) in the auditory domain. While this forms a crucial prerequisite for language acquisition, the neurodevelopmental origins of this ability remain unknown. We applied functional nearinfrared spectroscopy in neonates and 6-7-month-old infants to investigate the neural substrate supporting NAD learning using tone sequences in an artificial grammar learning paradigm. Detection of NADs was indicated by left prefrontal activation in neonates while by left supramarginal gyrus (SMG), superior temporal gyrus (STG), and inferior frontal gyrus activation in 6-7-month-olds. Functional connectivity analyses further indicated that the neonate activation pattern during the test phase benefited from a brain network consisting of prefrontal regions, left SMG and STG during the rest and learning phases. These findings suggest a left-hemispheric learning-related functional brain network may emerge at birth and be strengthened by complex auditory input across the first half year of life, providing a neural basis for language acquisition.


Vocal-visual combinations in wild chimpanzees

May 2023

·

58 Reads

Human communication is strikingly multi-modal, relying on vocal utterances combined with visual gestures, facial expressions and more. Recent efforts to describe multi-modal signal production in our ape relatives have shed important light on the evolutionary trajectory of this core hallmark of human language. However, whilst promising, a systematic quantification of primate signal production which filters out random combinations produced across modalities is currently lacking. Here, through recording the communicative behaviour of wild chimpanzees from the Kibale forest, Uganda we address this issue and generate the first repertoire of non-random combined vocal and visual components. Using collocation analysis, we identify more than 100 vocal-visual combinations which occur more frequently than expected by chance. We also probe how multi-modal production varies in the population, finding no differences between individuals as a function of age, sex or rank. The number of visual components exhibited alongside vocalizations was, however, associated with vocalization type and duration. We demonstrate that chimpanzees produce a vast array of combined vocal and visual components, exhibiting a hitherto underappreciated level of combinatorial complexity. We conclude that a multi-modal approach is crucial to accurately representing the communicative abilities of non-human primates.


Citations (67)


... Walsh et al. [47] study the link between vocal and social complexity. The Social Complexity Hypothesis for Communicative Complexity states that social and vocal complexity should coevolve, and higher vocal complexity should therefore be found in more complex social environments [17]. ...

Reference:

The power of sound: unravelling how acoustic communication shapes group dynamics
Call combination production is linked to the social environment in Western Australian magpies (Gymnorhina tibicen dorsalis)

... Thus, if Minimal Compositionality were to replace Rizzi's '1-merge', it would give rise to a slightly more liberal system, with non-recursive combinations of multiple calls, not necessarily limited to two. On the other hand, if Minimal Compositionality is added to Rizzi's '1-merge' (as a semantic system can be added to a syntax), the result will be nonrecursive combinations of two calls only [note that there might be independent reasons to block the processing of three calls, as argued by Watson et al. (2023) for the chestnut-crowned babbler, Pomatostomus ruficeps]. A clear conclusion of this discussion is that the existence of three-call combinations should be investigated. ...

Cognitive constraints on vocal combinatoriality in a social bird
  • Citing Article
  • May 2023

iScience

... Another line of research explores compositionality in animal communication. A body of studies have shown that some primates and birds produce combinations of sounds, and that these combinations elicit different responses than if the sounds consist of single elements or other combinations [93][94][95][96][97][98][99][100][101][102][103][104]. The unlimited ability to make new combinations of meaningful building blocks, that was claimed to be unique for human communication [105,106], is not observed in any of these studies. ...

Call combinations and compositional processing in wild chimpanzees

... [43]), are another promising method, with the advantage of using biologically relevant hypotheses based on the genetic distance from the common ancestor of all dog breeds (e.g. [44,45]). At the same time, we think that in terms of ecological relevance, investigations that are based on functional breed selection (e.g. ...

Genetic distance from wolves affects family dogs’ reactions towards howls

Communications Biology

... species capable of continually acquiring/modifying vocalizations through life) may also be capable of combining calls (e.g. some elephant [19], cetacean [20] and songbird [21] species). In light of this evidence, it seems that repertoire size constraint should not be the only factor considered when investigating the potential factors influencing the occurrence of combinatoriality. ...

Multi-level combinatoriality in magpie non-song vocalizations

... From an evolutionary point of view, our baseline expectation should be that humans probably inherited some form of dominance psychology from our shared ancestry with chimpanzees and bonobos, whose social life is strongly shaped by dominance hierarchies [8,19,[70][71][72][73] Dominance rank is associated among them with both shorter-term social influence, including access to food and mating opportunities [5][6][7]74], and longer-term outcomes such mating opportunities and fitness [35,75]. Alongside directed movements like chasing and biting as well as cues like peering [8,9], both of our closest relatives acknowledge their place in a stable hierarchy by signalling dominance or submission using arbitrary displays such as pant-grunts [8,19,71,72,76]. Strikingly, aspects of human dominance appear evolutionarily continuous with those in apes, such as attentional mechanisms [77] and some cross-culturally conserved and probably reliably developing ethological displays, which include the pride display (homologous with the bluff display in chimpanzees; [78]) relevant for dominance when accompanying hubristic pride [49], and the shame display (with elements in a variety of primate submission displays, such as crouching or a lowered body posture), which can signal pure subordination, especially in non-western, educated, industrialized, rich and democratic (WEIRD) societies [78][79][80]. ...

Between-group variation in production of pant-grunt vocalizations by wild bonobos (Pan paniscus)

Behavioral Ecology and Sociobiology

... The fact that infants show a significantly different looking pattern than both human adults and apes suggests that proficiency in language is not driving the observed looking pattern. Earlier explanations that relied on human-animal morphological differences in vocal tracts [44], lack of declarative communication [45], or lack of call composition [46] no longer stand. Our results add to the shared cognitive foundations of language by suggesting that event decomposition, a foundation of syntax, evolved before language, on par with signal combinations [47], theory of mind [48], and joint commitment [49]. ...

Declarative referential gesturing in a wild chimpanzee ( Pan troglodytes )

Proceedings of the National Academy of Sciences

... Faces in and of themselves, including neutral faces, are a gateway to a better understanding of multisensorial combinatorial and compositional communicative behavior. Spiess et al. (2022) turn our attention to birds and write on "Syntax-like Structures in Maternal Contact Calls of Chestnut-Crowned Babblers (Pomatostomus ruficeps)" Chestnut-crowned babblers (Pomatostomus ruficeps) are a bird species found inland of southeastern Australia. They communicate socially through a series of calls, around 18 in total. ...

Syntax-like Structures in Maternal Contact Calls of Chestnut-Crowned Babblers (Pomatostomus ruficeps)

International Journal of Primatology

... More generally, our method provides new perspectives on studying and comparing interaction systems, which might significantly contribute to better understand their evolutionary origins. A growing number of studies are addressing complexity in communication sequences by looking for context-specific combinations, compositionality and combinatoriality in the communication systems of NHP (Arnold & Zuberbühler, 2008;Bosshard et al., 2022;Engesser et al., 2016;Leroux & Townsend, 2020;Ouattara et al., 2009; for reviews see: Nowak et al., 2000;Zuberbühler, 2018). As explained in Amici et al. (2024), the vast majority of studies in this field has been conducted on single modalities of communication, and the methods vary greatly according to the species. ...

From collocations to call-ocations: using linguistic methods to quantify animal call combinations

Behavioral Ecology and Sociobiology

... Fröhlich 2017; Heesen et al. 2019;Watson et al. 2020;Safryghin et al. 2022). Here, building on work from Leroux et al. (2022), we investigated chimpanzee signals using collocation analysis. This method quantifies whether signal combinations are non-random, defined here as the co-occurrence of signal components with a frequency that is statistically above chance. ...

Call combinations in chimpanzees: a social tool?
  • Citing Article
  • August 2022

Behavioral Ecology