Sam Passmore’s research while affiliated with Australian National University and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (59)


Map of 719 Cantometrics societies (represented by 5242 songs)
Each point is a society, sized by the number of songs recorded for that society. All points are used to estimate latent variables. 121 societies (represented by 923 songs) are matched to both genetic and linguistic data and are coloured red. Societies without matching genetic and linguistic data are in grey. See Fig. S2 and S3 for maps of the 10 or more-song sample, and the SCCS sample. Maps made with Natural Earth.
Scatter plots between Ornamentation and Rhythm latent variables for 5242 songs
A Each point is a song, and labels within the graph refer to exemplar songs described in S2.2: [A] Song with Xylophone—Burmese, [B] Djokobo—Mbendjele, [C] Caravan Song—Tibet, [D] Alima Song—Mbuti. B The same scatter plot as (A), with topographical gridlines showing the density of points in Fig. 2a, overlaid with points from the Atlantic-Congo language family (red) and the Sino-Tibetan language family (blue). Squares show songs from the Atlantic-Congo society Ubangi (red), and the Sino-Tibetan society Burmese (blue). The dispersal of squares reflects within-society diversity.
A variogram showing the spatial autocorrelation coefficients (r) as a function of distance for society-level pairwise measures of Genetic FST distances, phylogenetic distance from the global language phylogeny, and Musical PhiST distances
White shapes indicate significant autocorrelation and black shapes indicate non-significant autocorrelation. Error bars show the 95% confidence intervals for each distance. See Fig. S7 for the same graph for the individual musical metrics. See Fig. S8 and S9 for the 10 or more-song sample and the SCCS sample. See Supplementary Data 2–4 for detailed statistical information.
Heat map of the Adjusted R² from partial RDA tests for each aspect of musical diversity
RDA tests show the amount of variation explained by Genetic, Linguistic, or Spatial distances while controlling (ctrl) for a second process. From top-left, clockwise, Global distances, within Africa, within Europe, and within Southeast Asia. All correlations are rounded to two decimal places.
Global musical diversity is largely independent of linguistic and genetic histories
  • Article
  • Full-text available

May 2024

·

166 Reads

·

3 Citations

Sam Passmore

·

·

Chiara Barbieri

·

[...]

·

Music is a universal yet diverse cultural trait transmitted between generations. The extent to which global musical diversity traces cultural and demographic history, however, is unresolved. Using a global musical dataset of 5242 songs from 719 societies, we identify five axes of musical diversity and show that music contains geographical and historical structures analogous to linguistic and genetic diversity. After creating a matched dataset of musical, genetic, and linguistic data spanning 121 societies containing 981 songs, 1296 individual genetic profiles, and 121 languages, we show that global musical similarities are only weakly and inconsistently related to linguistic or genetic histories, with some regional exceptions such as within Southeast Asia and sub-Saharan Africa. Our results suggest that global musical traditions are largely distinct from some non-musical aspects of human history.

Download

Convergent evolution in a large cross-cultural database of musical scales

December 2023

·

119 Reads

·

5 Citations

Scales, sets of discrete pitches that form the basis of melodies, are thought to be one of the most universal hallmarks of music. But we know relatively little about cross-cultural diversity of scales or how they evolved. To remedy this, we assemble a cross-cultural database (Database of Musical Scales: DaMuSc) of scale data, collected over the past century by various ethnomusicologists. Statistical analyses of the data highlight that certain intervals (e.g., the octave, fifth, second) are used frequently across cultures. Despite some diversity among scales, it is the similarities across societies which are most striking: step intervals are restricted to 100-400 cents; most scales are found close to equidistant 5- and 7-note scales. We discuss potential mechanisms of variation and selection in the evolution of scales, and how the assembled data may be used to examine the root causes of convergent evolution.


Global and regional prevalence of group singing. (a) Location of 247 societies with at least 10 recordings in the Global Jukebox (GJB). Societies with the same Ethnographic Atlas ID are merged. (b) Location of 49 societies with at least 10 texts in the Natural History of Song Ethnography (NHS). (c) Proportion of group singing in every geographical region. Circles are sized based on regional sample sizes, displayed in parentheses (number of GJB songs, number of NHS texts).
Relationship between group singing and social context. Probability of group singing for each text/recording as predicted by social context and society random effects, for Natural History of Song (a) and Global Jukebox (b). For each context, the number of societies and number of texts are listed in parentheses. Datapoints shown below ridge.
Relationship between group singing and community size. Proportion of group singing and local community size in societies with a sample of at least 10 entries in GJB (a), NHS (b) and both (c). Circle size weighted by number of entries; colour gradient indicates proportion of group singing; shaded areas indicate 95% CI.
Relationship between sociovocal styles and social differentiation. Each facet shows the probabilities of songs belonging to the specified type of singing as predicted by social differentiation and random society effects. Data from societies with at least 10 songs are shown (641 songs, 50 societies). Left-hand panel shows the skewed predictions for the interlocked response category, attributable to just two societies. Points are jittered (0.02).
Group singing is globally dominant and associated with social context

September 2023

·

165 Reads

·

6 Citations

Music is an interactive technology associated with religious and communal activities and was suggested to have evolved as a participatory activity supporting social bonding. In post-industrial societies, however, music's communal role was eclipsed by its relatively passive consumption by audiences disconnected from performers. It was suggested that as societies became larger and more differentiated, music became less participatory and more focused on solo singing. Here, we consider the prevalence of group singing and its relationship to social organization through the analysis of two global song corpora: 5776 coded audio recordings from 1024 societies, and 4709 coded ethnographic texts from 60 societies. In both corpora, we find that group singing is more common than solo singing, and that it is more likely in some social contexts (e.g. religious rituals, dance) than in others (e.g. healing, infant care). In contrast, relationships between group singing and social structure (community size or social differentiation) were not consistent within or between corpora. While we cannot exclude the possibility of sampling bias leading to systematic under-sampling of solo singing, our results from two large global corpora of different data types provide support for the interactive nature of music and its complex relationship with sociality.


Group singing is globally dominant and associated with social context

August 2023

·

21 Reads

·

3 Citations

Music is an interactive technology associated with religious and communal activities and was suggested to have evolved as a participatory activity supporting social bonding. In post-industrial societies, however, music’s communal role was eclipsed by its relatively passive consumption by audiences disconnected from performers. It was suggested that as societies became larger and more differentiated, music became less participatory and more focused on solo singing. Here, we consider the prevalence of group singing and its relationship to social organization through the analysis of two global song corpora: 5,776 coded audio recordings from 1,024 societies, and 4,709 coded ethnographic texts from 60 societies. In both corpora, we find that group singing is more common than solo singing, and that it is more likely in some social contexts (e.g. religious rituals, dance) than in others (e.g. healing, infant care). In contrast, relationships between group singing and social structure (community size or social differentiation) were not consistent within or between corpora. While we cannot exclude the possibility of sampling bias leading to systematic under-sampling of solo singing, our results from two large global corpora of different data types provide support for the interactive nature of music and its complex relationship with sociality.


Figure 1 Map of 993 societies represented in Cantometrics, sized by the number of songs (a total of 5,484 songs, the maximum number within one society is 72, and the minimum is 1). The inset histogram shows the global distribution of song unusualness, with a dotted line showing the top 3% of unusual songs.
Figure 2 The six steps for calculating Unusualness for any particular song are as follows. 1) Identify a region of interest (South America). 2) Identify a society of interest (Canela). 3) For all songs, excluding songs by Canela performers, calculate the probability of each state within each feature (Note the values in the figure are for illustrative purposes only). 4) For each Canelo song, take the state probability for the value coded within that song and sum it with all other state probabilities. For example: in song 3240, the Vocal Organisation variable is coded as 13 (Polyphony, red). This occurs in 10% of all South American songs, or 0.1. For interval width, song 3240 is coded as a 4 (Narrow intervals), this occurs in 0.2 of all other South American songs (blue). Nasality is coded as 10 (Slight nasalization), which occurs in 0.1 of all other South American songs. We sum these state probabilities together and take the log of the summed value to calculate unusualness (the log-likelihood). 5) Repeat from step 2 for all societies in South America. 6) Repeat from step 1 once all South American societies have been calculated.
Figure 3 Boxplots showing the spread of unusualness within the top and bottom 10 unusual societies (from societies represented by 5 songs or more).
Figure 5 A) Posterior distributions of the regression coefficient estimates for the four predictor variables. Blue fill indicates 89% confidence intervals. Only the Nearest phylogenetic neighbour and Kinship unusualness show 89% confidence intervals not containing zero. B) Scatter plot between the unusualness score for a particular song, and the average unusualness score for all other songs within a society (Leave One Out). Scales are reversed so that further to the right of the x-axis, or higher up the y-axis is more unusual. Quantitatively smaller values are more unusual.
The Exceptions and the Rules in Global Musical Diversity

August 2023

·

81 Reads

·

3 Citations

Journal of Cognition

Global music diversity is a popular topic for both scientific and humanities researchers, but often for different reasons. Scientific research typically focuses on the generalities through measurement and statistics, while humanists typically emphasize exceptions using qualitative approaches. But these two approaches need not be mutually exclusive. Using a quantitative approach to identify musical outliers and a qualitative discussion of the most unusual songs, we can combine scientific and humanities approaches to unite knowledge on musical diversity. Objectively defining unusual music is a delicate task, having historically been subject to Eurocentric approaches. Using the Global Jukebox, a dataset containing almost 6,000 songs from over 1,000 societies coded on 37 "Cantometric" variables of musical style, we designate the unusualness of a song as the frequency of its coded variables relative to their regional frequency. Using quantitative metrics to identify outliers in musical diversity, we present a qualitative discussion of some of the most unusual individual songs (from a Panpipe ensemble from Kursk, Russia), and a comparison of unusual repertoires from Malay, Kel Aïr, and Moroccan Berber musical cultures. We also ask whether unusual music is the result of unusual social organisation or isolation from other groups. There is weak evidence that the unusualness of music is predicted by kinship organisation and cultural isolation, but these predictors are heavily outweighed by the finding that unusual songs are best predicted by knowing the society they come from - evidence that quantitatively supports the existence of musical style.


Fig. 1. The global distribution of fusion and informativity scores. The scores with a minimum of 0 (absence of all metric features) and a maximum of 1 (presence of all metric features) have been standardized to a mean of 0 and a variance of 1. The hotspots of low fusion are located in West Africa and Southeast Asia. Many Austronesian languages also rank low on fusion. The geographic patterns of informativity scores are less clear compared to fusion. Among lower-scoring languages are those spoken in West Africa, Southeast Asia, many Uralic languages, and languages spoken in India (Indo-Aryan and Dravidian).
Fig. 3. The scores of fusion and informativity on the global tree. The scores with a minimum of 0 (absence of all metric features) and a maximum of 1 (presence of all metric features) have been standardized to a mean of 0 and a variance of 1. We detect many patterns of closely related languages scoring similarly, which might indicate the faithful transmission of grammatical complexity from ancestor languages to their descendants rather than large-scale adaptations of grammatical complexity to changes in sociodemographic factors. Similar to geographic distribution, we see that fusion scores follow a more defined pattern of phylogenetic clustering compared to informativity scores.
Societies of strangers do not speak less complex languages

August 2023

·

345 Reads

·

15 Citations

Science Advances

Many recent proposals claim that languages adapt to their environments. The linguistic niche hypothesis claims that languages with numerous native speakers and substantial proportions of nonnative speakers (societies of strangers) tend to lose grammatical distinctions. In contrast, languages in small, isolated communities should maintain or expand their grammatical markers. Here, we test these claims using a global dataset of grammatical structures, Grambank. We model the impact of the number of native speakers, the proportion of nonnative speakers, the number of linguistic neighbors, and the status of a language on grammatical complexity while controlling for spatial and phylogenetic autocorrelation. We deconstruct "grammatical complexity" into two separate dimensions: how much morphology a language has ("fusion") and the amount of information obligatorily encoded in the grammar ("informativity"). We find several instances of weak positive associations but no inverse correlations between grammatical complexity and sociodemographic factors. Our findings cast doubt on the widespread claim that grammatical complexity is shaped by the sociolinguistic environment.


Locations of languages in Kinbank
Each point indicates a unique language variety and is centered on the geographical center-point of the area where the speakers live when the data were recorded, but may also indicate a historical location, the demographic center-point or some other representative point. Colored points indicate languages from the 7 language families mentioned in text: Austronesian (light blue; n = 377); Pama-Nyungan (red; n = 105); Indo-European (yellow; n = 106), Bantu (orange; n = 113), Uralic (purple; n = 25), Tupian (maroon; n = 29); Cariban (pale blue; n = 7) and other languages (grey).
Probability of consonant and vowel combinations used in words referring to mother
Each row shows the probability density estimate from the model posterior. The colored sections and annotated numbers show 89% highest probability density intervals. Intervals that contain 0.5 have no statistical effect (since there is a 50:50 chance the sound refers to mother or father). Results show words with a root syllable of [ŋa] and [na] are more likely to refer to mother and [pa] and [ta] to refer to father. [ma] words are predicted to be used equally between mother and father words.
Maximum clade credibility tree of Bantu languages displaying four possible states: No cross-cousin marriage (CM) and no bifurcate-merging (BM) of kinterms (black), no cross-cousin marriage with bifurcate-merging (yellow), cross-cousin marriage and no bifurcate merging (red), and both cross-cousin marriage and bifurcate merging are present (light blue), with their likely probabilities from a dependent model of evolution
Languages marked with grey show where kinterms data were present, but social data were imputed by BayesTraits. Pie-graphs indicate the probability of states at each node, which were calculated using the BayesTrait command "RecNode". Deeper into the phylogeny there is much uncertainty as to the relationship between these two traits, indicated by the almost equal probability of all four states.
Descriptions of the tables used in Kinbank and the data they contain
An extended description with additional structuring files is given in S3 Table in S1 File.
Kinbank: A global database of kinship terminology

May 2023

·

504 Reads

·

7 Citations

For a single species, human kinship organization is both remarkably diverse and strikingly organized. Kinship terminology is the structured vocabulary used to classify, refer to, and address relatives and family. Diversity in kinship terminology has been analyzed by anthropologists for over 150 years, although recurrent patterning across cultures remains incompletely explained. Despite the wealth of kinship data in the anthropological record, comparative studies of kinship terminology are hindered by data accessibility. Here we present Kinbank, a new database of 210,903 kinterms from a global sample of 1,229 spoken languages. Using open-access and transparent data provenance, Kinbank offers an extensible resource for kinship terminology, enabling researchers to explore the rich diversity of human family organization and to test longstanding hypotheses about the origins and drivers of recurrent patterns. We illustrate our contribution with two examples. We demonstrate strong gender bias in the phonological structure of parent terms across 1,022 languages, and we show that there is no evidence for a coevolutionary relationship between cross-cousin marriage and bifurcate-merging terminology in Bantu languages. Analysing kinship data is notoriously challenging; Kinbank aims to eliminate data accessibility issues from that challenge and provide a platform to build an interdisciplinary understanding of kinship.


Fig. 1. Variance explained by phylogeny and geography. Each point is a Grambank feature. The panels represent different domains of grammar that the features are associated with: (A) clausal, (B) nominal domain, (C) pronominal domain, and (D) verbal domain. A high value indicates that a large part of the variance is explained by either space (y axis) or phylogeny (x axis). The ellipses represent the standard deviation of the joint posterior, tilted for the covariance.
Fig. 2. Grammatical similarity in the Grambank sample of languages. The color coding represents the distribution of languages according to the first three principal components (PCs) mapped onto RGB color space (PC1, red; PC2, green; PC3, blue). Similarity in color indicates similarity in grammatical structure on the first three dimensions. See fig. S15 for loading of Grambank features on the first two components and fig. S16 for correlation with theoretical metrics.
Fig. 3. Distribution of the 12 largest families in our dataset in Grambank design space. The x axis represents the first principal component (PC1), and the y axis represents the second principal component (PC2). All languages are plotted, and for each facet, one family is highlighted in a different color. Austronesian languages, which are known for lacking gender and having little morphology, are found on the far left.
Grambank reveals the importance of genealogical constraints on linguistic diversity and highlights the impact of language loss

April 2023

·

1,060 Reads

·

61 Citations

Science Advances

While global patterns of human genetic diversity are increasingly well characterized, the diversity of human languages remains less systematically described. Here, we outline the Grambank database. With over 400,000 data points and 2400 languages, Grambank is the largest comparative grammatical database available. The comprehensiveness of Grambank allows us to quantify the relative effects of genealogical inheritance and geographic proximity on the structural diversity of the world's languages, evaluate constraints on linguistic diversity, and identify the world's most unusual languages. An analysis of the consequences of language loss reveals that the reduction in diversity will be strikingly uneven across the major linguistic regions of the world. Without sustained efforts to document and revitalize endangered languages, our linguistic window into human history, cognition, and culture will be seriously fragmented.


Independent histories underlie global musical, linguistic, and genetic diversity

March 2023

·

29 Reads

·

4 Citations

Music is a universal, diverse cultural trait shaped by cultural and biological evolution. The extent to which global musical diversity traces the historical movements of people and their cultures is unresolved, with regional studies producing mixed results. Using a global musical dataset of 5,242 songs and 719 societies we identify five axes of musical diversity and show that musical traits contain geographically constrained patterns of between-society diversity. We pair musical data to genetic and linguistic datasets spanning 121 societies containing 981 songs, 1,296 individual genetic profiles, and 121 languages, showing that musical traditions contain similar, albeit weaker, patterns of spatial decay to linguistic diversity and genetic diffusion. However, the structure of musical similarity is different to linguistic or genetic histories. Musical relationships correlate with genetic and linguistic relationships within some regions, but not globally. Our results suggest that global musical traditions are distinct from non-musical aspects of human history.


Figure 2. The coefficients and 95% credible intervals for fixed effects in 6 bivariate regression models and one multivariable model (L1 combined & L2 combined) for fusion and informativity, without and with spatiophylogenetic random effects (dashed and solid lines respectively). The linear regression coefficients of fixed effects representing exotericity in the spatiophylogentic models are depicted with the error bars. Error bars in black cross zero, whereas the bars colored red and blue indicate robust positive and negative relationships. All effects that appear influential (colored in red or blue) in models that do not control for random effects of genealogy and geography (dashed error bars) disappear after we control for these sources of non-independence (solid error bars). The exceptions are the weak positive effects of L1 speakers on fusion and
Figure 4. The distribution of fusion scores in Southern Africa (A) and informativity scores in Eurasia (B.1) with the focus on Uralic languages and the phylogeny of Uralic languages included in the global tree (B.2). The difference in fusion scores between two Bantu languages -Tsonga (low) and Southern Sotho (high) -can be explained from the perspective of phylogenetic relatedness, with Tsonga resembling the scores of other low-scoring outliers in Southern Africa from other language families than Bantu and to lesser extent its neighboring sister language Tswa (A). Higher informativity scores in Uralic languages show a clear pattern of phylogenetic clustering (B.2) and are found in Samoyedic (Nganasan, Selkup, Tundra Nenets, and Forest Enets), Ugric (Hungarian), and Permian (Komi-Permyak and Komi-Zyrian), which diverged earlier from the rest of the languages. Higher informativity scores in Uralic languages can also be ascribed to contact with Indo-European languages (B.1): Hungarian is surrounded by higherscoring languages spoken in Europe and the speakers of Tundra Nenets and Komi-Permyak are typically bilingual in Russian.
WAIC values and quantiles (0.025, 0.5, and 0.975) of estimates of models fitting only random effects and intercept in models predicting fusion.
Societies of strangers do not speak grammatically simpler languages

February 2023

·

338 Reads

·

7 Citations

Many recent proposals claim that languages adapt to their environments. The Linguistic Niche hypothesis claims that languages with numerous native speakers and substantial proportions of non-native speakers (societies of strangers) will tend to lose grammatical distinctions. In contrast, languages in small, isolated communities should maintain or expand their range of grammatical markers. Here, we test such claims using a new global dataset of grammatical structures - Grambank. We model the impact of the number of native speakers, the proportion of non-native speakers, the number of linguistic neighbors, and the status of a language on grammatical complexity while controlling for spatial and phylogenetic autocorrelation. We deconstruct "grammatical complexity" into two separate dimensions: (i) how much morphology a language has ("fusion"), and (ii) the amount of information obligatorily encoded in the grammar ("informativity"). We find several instances of weak positive associations but no inverse correlations between grammatical complexity and sociodemographic factors. Our findings cast doubt on the widespread assumption that grammatical complexity is shaped by the sociolinguistic environment.


Citations (22)


... We suggest that future research should aim to further characterize voice preferences in a more integrative way, by measuring shared preferences for songs and vocalizations spanning different geographic, linguistic and cultural contexts, as seen in recent cross-cultural studies [15][16][17]21,88]. In addition to the cultural factors previously discussed, future studies could also explore a larger range of vocalizations. ...

Reference:

Appreciation of singing and speaking voices is highly idiosyncratic
Global musical diversity is largely independent of linguistic and genetic histories

... 8,9 Despite cross-cultural variation, scales across the world tend to be more similar than chance, and certain scales (e.g., the minor pentatonic) repeatedly appear in far-flung corners of the world. [10][11][12] This suggests that there is some selection process or conscious innovation common to different groups of humans that leads to the use of similar scales (SI Section S1). [12][13][14][15][16][17][18][19][20][21][22][23] In this study, we compare and contrast theories of how and why scales evolved, using a diverse, global dataset of scales. ...

Convergent evolution in a large cross-cultural database of musical scales

... Although the relative importance of these different environmental influences remains unknown, in general, richer musical and singing environments are considered to foster greater singing participation and ability. While group singing is prevalent in most cultures (Shilton et al., 2023), children raised in cultures where singing is a normative social behavior generally learn to sing at a faster rate (Kreuntzer, 2001). Similarly, sociocultural beliefs such as stereotypes around singing may influence how likely individuals are to take up singing (Harrison, 2007). ...

Group singing is globally dominant and associated with social context

... Being able to address this issue based not only on a handful of languages but based on evidence from hundreds to thousands of them is a major recent development in the field. Some examples of new large-scale resources include the Database of Cross-Linguistic Colexifications (Rzymski et al., 2020), registering the way meanings are expressed in over 3,000 languages; or Kinbank, Database of Kinship Terminology (Passmore et al., 2023) which collects data on how languages express kinship (e.g., whether they have a word for uncle; or if they differentiate between paternal and maternal grandfather with different words). To be clear, while these resources are new, they build on the fundamental field work of linguists who went -and still go -into the world to document languages. ...

Kinbank: A global database of kinship terminology

... The languages are evaluated as more similar if their phoneme distributions are alike. In the typology-based similarity assessment, we examine how similar the typological features are using the Grambank dataset [20], which numerically records the typological characteristics of languages. In this study, we primarily utilize corpus-based similarity assessment, while typology-based similarity evaluation is employed as a supplementary method to examine how well it aligns with the similarity evaluation of data within the same language family. ...

Grambank reveals the importance of genealogical constraints on linguistic diversity and highlights the impact of language loss

Science Advances

... The influence of autocorrelation due to historical relationships is somewhat mitigated by using a stratified sample (the SCCS), originally designed to avoid this problem [24]. More importantly, recent work suggests musical styles are weakly correlated with genetic, linguistic and spatial relationships [44]. We therefore have good reasons to consider GJB data as suitable for this kind of study, notwithstanding the inherent tradeoffs one must take into account between sample size and quality. ...

Independent histories underlie global musical, linguistic, and genetic diversity
  • Citing Preprint
  • March 2023

... Also other domains of language, such as phonology (Blaxter, 2017), phonotactics (Baumann & Matzinger, 2021;Napoleão de Souza & Sinnemäki, 2022) and syntax (Benítez-Burraco, S. Chen, Gil, Gaponov, et al., 2024) have been shown to be influenced by societal factors, such as language contact. At the same time, there is still discussion if and how language contact causes morphological simplification, from both experimental (Cuskley et al., 2015;De Smet, Rosseel, & Van De Velde, 2022) and quantitative cross-linguistic studies (Kauhanen, Einhaus, & Walkden, 2023;Koplenig, 2019;Lupyan & Raviv, 2024;Shcherbakova et al., 2023). In any case, instead of just looking at correlations between proportions of L2 speakers and morphological complexity, it is worthwhile to study which specific language-internal and sociodemographic factors mediate the relationship between social and language structure (Sinnemäki, 2020;Sinnemäki & Di Garbo, 2018). ...

Societies of strangers do not speak grammatically simpler languages

... Speech and song are produced by the same vocal tract, yet each makes distinct demands on musculature, breathing, and motor control mechanisms 13,14 , raising the possibility that certain acoustical cues could serve as markers of each category 15 . However, even though people readily distinguish speech and song, the cues underlying the categories, even within cultures, are far from clear 11,[16][17][18][19] , so that such a claim is difficult to address. Indeed, even if speech and singing reliably exist as separate, recognizable entities, their cognitive representation could depend mostly on learned regularities that are particular to each cultural group. ...

The Global Jukebox: A public database of performing arts and culture

... The results decisively vindicated Darwin's proposal of human races and languages evolving in concert following a tree-like history of splits and isolation (Darwin 1860) at the global level; however, it did not align with the SFE model with Africa as the center of origin, with it instead being more inclined toward a Eurasian-centered model. A more recent and novel analysis, which covered a cultural layer adjacent to language, namely, music, was carried out on a dataset of 152 societies (containing 1,054 songs from the public database The Global Jukebox in the form of raw coded Cantometrics data, 1,719 genomic profiles, and 152 languages); the analysis demonstrated weak links between music and language (R 2 <= 0.05), as well as with genetic distance and geographic proximity, in contrast to the much stronger relationships found between genes and geography: the results suggest that genes and culture are surprisingly decoupled (Passmore et al. 2022). For the Indo-European family, Bouckaert et al. (2012) used Bayesian phylogeographic approaches with a dataset of basic vocabulary term lists from 103 ancient and contemporary Indo-European languages to model the expansion of the family, finding decisive support for an Anatolian origin over a steppe origin, with both the inferred timing and root location of the Indo-European language trees fitting with an agricultural expansion from Anatolia beginning 8000 to 9500 years ago. ...

Global relationships between musical, linguistic, and genetic diversity
  • Citing Preprint
  • March 2022