About
252
Publications
61,912
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
4,254
Citations
Introduction
Publications
Publications (252)
The increasing availability of cross-linguistic databases dedicated to documenting morphosyntactic, lexical and phonological features has proliferated the use of such data for studies on language evolution and human history. However, most of these databases were not designed to ensure independence of features, such that it is not valid to jointly u...
When speakers of different languages are in contact, they often borrow features like sounds, words, or syntactic patterns from one language to the other, but the lack of historical data has hampered estimation of this effect at a global scale. We break out of this impasse by using genetic admixture as a proxy for population contact. We find that la...
Human population history, as traced by our genome, has shaped the distribution of languages around the world. While case studies suggest that population history can also affect specific structures of languages, such as patterns in their sound systems and grammars, results are conflicting, and it remains unknown whether such effects hold globally. H...
Macro-rhythm is a parameter that distinguishes the prosodic profiles of languages at the intonational level. However, no conclusive quantification of macro-rhythm differences between languages has been proposed and there are to date no dedicated studies investigating the regularity and variability of macro-rhythm within a single language. This stud...
Human language relies on a rich cognitive machinery, partially shared with other animals. One key mechanism, however, decomposing events into causally linked agent–patient roles, has remained elusive with no known animal equivalent. In humans, agent–patient relations in event cognition drive how languages are processed neurally and expressions stru...
Most animals interact with conspecifics through acoustic signals that are modulated in frequency and rhythm. While small animals vocalize at higher pitch than large ones due to the smaller size of their vocal apparatus, the rules governing vocalization rhythms throughout the animal kingdom remain unknown. Vocal rhythms serve as a natural informatio...
Both areal and phylogenetic affiliation have been discussed as driving factors of the distribution of word order in the languages of the world. However, disentangling the interaction of these two factors is challenging. Here we take Indo-European as a test case. Word order in this family is largely homogeneous both within areas and within branches,...
One of the most specific features of the human language faculty is its intrinsic spatio-temporal dynamic, as reflected in languages' characteristic mode of extra-genomic evolution. Understanding what has emerged in the hominin lineage therefore requires capturing this dynamic and the diversity of languages and structures that it generates. In this...
It is an unresolved question to what extent the current frequency distributions of linguistic features inform us about what is representative of the language faculty and does not instead result from historical contingencies. We probe this question by leveraging unique data from the oldest attested languages, those preserved through writing from up...
Human evolutionary history in Central Africa reflects a deep history of population connectivity. However, Central African hunter-gatherers (CAHGs) currently speak languages acquired from their neighbouring farmers. Hence it remains unclear which aspects of CAHG cultural diversity results from long-term evolution preceding agriculture and which refl...
When conversing, humans instantaneously predict meaning from fragmentary and ambiguous mspeech, long before utterance completion. They do this by integrating priors (initial assumptions about the world) with contextual evidence to rapidly decide on the most likely meaning. One powerful prior is attentional preference for agents, which biases senten...
Human evolutionary history in Central Africa reflects a deep history of population connectivity. However, Central African hunter-gatherers (CAHGs) currently speak languages acquired from their neighbouring farmers. Hence it remains unclear which aspects of CAHG cultural diversity results from long-term evolution preceding agriculture, and which ref...
Human language relies on a rich cognitive machinery, partially shared with other animals. One key mechanism, decomposing events into causally-linked agent-patient roles, however, has remained elusive with no known animal equivalent. In humans, agent-patient relations in event cognition drive how languages are processed neurally and expressions stru...
In this work we are presenting a database structure to encode the phenomenon of differential possession across languages, considering noun possession classes and possessive constructions as independent but linked. We show how this structure can be used to study different dimensions of possession: semantics, noun valence, and possessive construction...
The language comprehension system preferentially assumes that agents come first during incremental processing. While this might reflect a biologically fixed bias, shared with other domains and other species, the evidence is limited to languages that place agents first, and so the bias could also be learned from usage frequency. Here, we probe the b...
Language models based on artificial neural networks increasingly capture key aspects of how humans process sentences. Most notably, model-based surprisals predict event-related potentials such as N400 amplitudes during parsing. Assuming that these models represent realistic estimates of human linguistic experience, their success in modelling langua...
Comprehenders across languages tend to interpret role-ambiguous arguments as the subject or the agent of a sentence during parsing. However, the evidence for such a subject/agent preference rests on the comprehension of transitive, active-voice sentences where agents/ subjects canonically precede patients/objects. The evidence is thus potentially c...
In syntactic change, it remains an open issue whether word orders are more conservative or innovative in subordinate clauses compared with main clauses. Using 47 dependency-annotated corpora and Bayesian phylogenetic inference, we explore the evolution of S/V, V/O, and S/O orders across main and subordinate clauses in Indo-European. Our results rev...
Agreement markers that refer to the same feature or argument tend to be found in the same position (e.g., all subject agreement markers as suffixes, all object agreement markers as prefixes). However, little is known about the exceptions to this trend: cases where different values of the same feature are marked in different positions in the word (i...
A central aspect of human experience and communication is understanding events in terms of agent (“doer”) and patient (“undergoer” of action) roles. These event roles are rooted in general cognition and prominently encoded in language, with agents appearing as more salient and preferred over patients. An unresolved question is whether this preferen...
Human history is written in both our genes and our languages. The extent to which our biological and linguistic histories are congruent has been the subject of considerable debate, with clear examples of both matches and mismatches. To disentangle the patterns of demographic and cultural transmission, we need a global systematic assessment of match...
Morphological systems often reuse the same forms in different functions, creating what is known as syncretism. While syncretism varies greatly, certain cross-linguistic tendencies are apparent. Patterns where all syncretic forms share a morphological feature value (e.g., first person, or plural number) are most common cross-linguistically, and this...
Morphological systems often reuse the same forms in different functions, creating what is known as syncretism. While syncretism varies greatly, certain cross-linguistic tendencies are apparent. Patterns where all syncretic forms share a morphological feature value (e.g., first person, or plural number) are most common cross-linguistically, and this...
Agreement markers that refer to the same feature or argument tend to be found in the same position (e.g., all subject agreement markers as suffixes, all object agreement markers as prefixes). However, little is known about the exceptions to this trend: cases where different values of the same feature are marked in different positions in the word (i...
Morphological complexity metrics like entropy, and notions like the Paradigm Cell-Filling Problem, have recently (re)gained popularity for the synchronic analysis of inflectional systems. The potential of these quantitative approaches, however, remains largely untapped with respect to diachronic research. This paper constitutes a first exploration...
A prominent principle in explaining a range of word-order regularities is dependency locality, which minimizes the linear distances (dependency lengths) between a head and its dependents. However, it remains unclear to what extent language users in fact observe locality when producing sentences under diverse conditions of cross-categorical harmony...
Emerging data in a range of non-human animal species have highlighted a latent ability to combine certain pre-existing calls together into larger structures. Currently, however, the quantification of context-specific call combinations has received less attention. This is problematic because animal calls can co-occur with one another simply through...
Languages differ in how they mark the dependencies between verbs and arguments, e.g., by case. An eye tracking and EEG picture description study examined the influence of case marking on the time course of sentence planning in Basque and Swiss German. While German assigns an unmarked (nominative) case to subjects, Basque specifically marks agent ar...
Languages tend to encode events from the perspective of agents, placing them first and in simpler forms than patients. This agent bias is mirrored by cognition: Agents are more quickly recognized than patients and generally attract more attention. This leads to the hypothesis that key aspects of language structure are fundamentally rooted in a cogn...
Spoken language, as we have it, requires specific capacities—at its most basic advanced vocal control and complex social cognition. In humans, vocal control is the basis for speech, achieved through coordinated interactions of larynx activity and rapid changes in vocal tract configurations. Most likely, speech evolved in response to early humans pe...
Inflectional affixes expressing the same grammatical category (e.g., subject agreement) tend to appear in the same morphological position in the word. We hypothesize that this cross‐linguistic tendency toward category clustering is at least partly the result of a learning bias, which facilitates the transmission of morphology from one generation to...
A guide to principles and methods for the management, archiving, sharing, and citing of linguistic research data, especially digital data.
“Doing language science” depends on collecting, transcribing, annotating, analyzing, storing, and sharing linguistic research data. This volume offers a guide to linguistic data management, engaging with current...
As efficient systems of communication, languages are usually expected to map meanings to forms in a one-to-one way, using for example the same affix form (e.g., -s in English) every time a particular meaning is intended (e.g., plural number), and placing affixes with the same meaning consistently in the same position (e.g., always suffixal). Forms...
Romance verb stem alternations (e.g., Spanish tengo `I have' vs. tienes `you have') constitute seemingly unnecessary but highly inheritable morphological traits. Using novel phylogenetic methods, we assess the impact of frequency and alternation patterns on properties of their evolution, specifically on the speed of change and the long term prefere...
Previous work suggests that when speakers linearize syntactic structures, they place longer and more complex dependents further away from the head word to which they belong than shorter and simpler dependents, and that they do so with increasing rigidity the longer expressions get, for example, longer objects tend to be placed further away from the...
The way infants learn language is a highly complex adaptive behavior. This behavior chiefly relies on the ability to extract information from the speech they hear and combine it with information from the external environment. Most theories assume that this ability critically hinges on the recognition of at least some syntactic structure. Here, we s...
When speakers of different languages interact, they are likely to influence each other: contact leaves traces in the linguistic record, which in turn can reveal geographical areas of past human interaction and migration. However, other factors may contribute to similarities between languages. Inheritance from a shared ancestral language and univers...
Culture evolves in ways that are analogous to, but distinct from, genomes. Previous studies examined similarities between cultural variation and genetic variation (population history) at small scales within language families, but few studies have empirically investigated these parallels across language families using diverse cultural data. We repor...
Emerging data in a range of non-human animal species have highlighted a latent ability to combine certain pre-existing calls together into larger structures. Currently, however, there exists no objective quantification of call combinations. This is problematic because animal calls can co-occur with one another simply through chance alone. One commo...
Some languages around the Pacific have multiple possessive classes of alienable constructions using appositive nouns or classifiers. This pattern differs from the most common kind of alienable/inalienable distinction, which involves marking, usually affixal, on the possessum, and has only one class of alienables. The Japanese language isolate Ainu...
In his target article “General linguistics must be based on universals (or nonconventional aspects of language)”, Martin Haspelmath invites us to reconsider the distinction between general linguistics and the linguistics of a particular language. The distinction is clearly useful to demarcate foci of interest and job descriptions. Also, it helps cl...
When speakers of two or more languages interact, they are likely to influence each other: contact leaves traces in the linguistic record, which in turn can reveal geographic areas of past human interaction and migration. However the complex, multi-dimensional nature of contact has hindered the development of a rigorous methodology for detecting its...
Words in utterance-final positions are often pronounced more slowly than utterance-medial words, as previous studies on individual languages have shown. This paper provides a systematic cross-linguistic comparison of relative durations of final and penultimate words in utterances in terms of the degree to which such words are lengthened. The study...
Planning to speak is a challenge for the brain, and the challenge varies between and within languages. Yet, little is known about how neural processes react to these variable challenges beyond the planning of individual words. Here, we examine how fundamental differences in syntax shape the time course of sentence planning. Most languages treat ali...
Bayesian phylogeography has been used in historical linguistics to reconstruct homelands and expansions of language families, but the reliability of these reconstructions has remained unclear. We contribute to this discussion with a simulation study where we distinguish two types of spatial processes: migration, where populations or languages leave...
Morphological complexity is expected to decrease under mass admixture from adult second language speakers. While this has been chiefly shown for morphological richness, an unresolved question is whether the effect extends to aspects of morphological boundedness. Here we report a case study of Sino-Tibetan verbs, contrasting verbal expressions of tw...
This paper investigates the origins of sortal numeral classifiers in the Indo-Iranian languages. While these are often assumed to result from contact with non-Indo-European languages, an alternative possibility is that classifiers developed as a response to the rise of optional plural marking. This alternative is in line with the so-called Greenber...
Communication plays a vital role in the social lives of many species and varies greatly in complexity. One possible way to increase communicative complexity is by combining signals into longer sequences, which has been proposed as a mechanism allowing species with a limited repertoire to increase their communicative output. In mammals, most studies...
Morphological complexity is expected to decrease under mass admixture from adult second language speakers. While this has been chiefly shown for morphological richness, an unresolved question is whether the effect extends to aspects of morphological boundedness. Here we report a case study of Sino-Tibetan verbs, contrasting verbal expressions of tw...
The way infants manage to extract meaning from the speech stream when learning their first language is a highly complex adaptive behavior. This behavior chiefly relies on the ability to extract information from speech they hear and combine it with the external environment they encounter. However, little is known about the underlying distribution of...
Huijbregts’s commentary¹ about our paper (Blasi et al. 2019) gives us the opportunity to highlight an ongoing problem in the investigation of language evolution that has hindered its research since the 19th century: the lack of engagement with empirical data. This problem stems partially from the issue of how language is defined, and consequently,...
Bayesian phylogeography aims to reconstruct migrations in evolutionary processes. This methodological framework has been used for the reconstruction of homelands and historical expansions of various language families, but its reliability for language diversification research has remained unclear. We contribute to this discussion with a simulation s...
The Language of Hunter-Gatherers - edited by Tom Güldemann February 2020
In Blasi et al. (2019) we have shown, through a series of statistical analyses and models, that human sound systems have been affected by a transition in bite configuration starting from the Neolithic. Tarasov and Uyeda (2020) (henceforth T&U) raise a number of observations in relation to our article. We appreciate T&U’s engagement with our work an...
Recent research has revealed several languages (e.g. Chintang, Rarámuri, Tagalog, Murrinhpatha) that challenge the general expectation of strict sequential ordering in morphological structure. However, it has remained unclear whether these languages exhibit random placement of affixes or whether there are some underlying probabilistic principles th...
This paper explores the application of quantitative methods to study the effect of various factors on phonetic word duration in ten languages. Data on most of these languages were collected in fieldwork aiming at documenting spontaneous speech in mostly endangered languages, to be used for multiple purposes, including the preservation of cultural h...
Morphological complexity is expected to decrease under mass admixture from adult second language speakers. While this has been chiefly shown for morphological richness, an unresolved question is whether the effect extends to aspects of morphological boundedness. Here we report a case study of Sino-Tibetan verbs, contrasting verbal expressions of tw...
Recent research has revealed several languages (e.g. Chintang, Raráramuri, Tagalog, Murrinhpatha) that challenge the general expectation of strict sequential ordering in morphological structure. However, it has remained unclear whether these languages exhibit random placement of affixes or whether there are some underlying probabilistic principles...
The first fricatives
In 1985, the linguist Charles Hockett proposed that the use of teeth and jaws as tools in hunter-gatherer populations makes consonants produced with lower lip and upper teeth (“f” and “v” sounds) hard to produce. He thus conjectured that these sounds were a recent innovation in human language. Blasi et al. combined paleoanthrop...
Culture evolves in ways that are analogous to, but distinct from, genetic evolution. Previous studies have demonstrated correlations between genetic and cultural diversity at small scales within language families, but few studies have empirically investigated parallels between genetic and cultural evolution across multiple language families using a...
Linguistic diversity is a key aspect of human population diversity and shapes much of our social and cognitive lives. To a considerable extent, the distribution of this diversity is driven by environmental factors such as climate or coast access. An unresolved question is whether the relevant factors have remained constant over time. Here, we addre...
A key step in understanding the evolution of human language involves unravelling the origins of language’s syntactic structure. One approach seeks to reduce the core of syntax in humans to a single principle of recursive combination, merge, for which there is no evidence in other species. We argue for an alternative approach. We review evidence tha...
Significance
When we speak, we unconsciously pronounce some words more slowly than others and sometimes pause. Such slowdown effects provide key evidence for human cognitive processes, reflecting increased planning load in speech production. Here, we study naturalistic speech from linguistically and culturally diverse populations from around the wo...
Some languages constrain the recursive embedding of NPs to some specific morphosyntactic types, allowing it for example only with genitives but not with bare juxtaposition. In Indo-European, every type of NP embedding — genitives, adjectivizers, adpositions, head marking, or juxtaposition — is unavailable for syntactic recursion in at least one att...
Polysynthesis presupposes the existence of 'words', a domain or unit of phonology and syntax that is extremely variable within and across languages: what behaves as a 'word' with respect to one phonological or syntactic rule or constraint may not behave as such with respect to other rules or constraints. Here we develop a system of variables that a...
This handbook offers an extensive cross-linguistic and cross-theoretical survey of polysynthetic languages, in which single multi-morpheme verb forms can express what would be whole sentences in English. These languages and the problems they raise for linguistic analyses have long featured prominently in language descriptions, and yet the essence o...
This handbook offers an extensive cross-linguistic and cross-theoretical survey of polysynthetic languages, in which single multi-morpheme verb forms can express what would be whole sentences in English. These languages and the problems they raise for linguistic analyses have long featured prominently in language descriptions, and yet the essence o...
Introduction In explanations of how linguistic structures are distributed in the world, the pendulum has swung back from an emphasis on universals, which dominated the second half of the twentieth century, to a renewed emphasis on local developments and areal diffusion. This shift in emphasis started over twenty years ago with Dryer (1989), who dre...
Language's intentional nature has been highlighted as a crucial feature distinguishing it from other communication systems. Specifically, language is often thought to depend on highly structured intentional action and mutual mindreading by a communicator and recipient. Whilst similar abilities in animals can shed light on the evolution of intention...