[Show abstract][Hide abstract] ABSTRACT: Mandarin Chinese is characterized by being a tonal language and as such the
pitch (or F0) of its utterances carries considerable linguistic information.
However, speech samples from different individuals are subject to changes in
amplitude and phase which must be accounted for in any analysis which attempts
to provide a typology of the language. A joint model for amplitude, phase and
duration is presented which combines elements from Functional Data Analysis,
Compositional Data Analysis and Linear Mixed Effects Models. By decomposing
functions via a functional principal component analysis, and connecting
registration functions to compositional data analysis, a joint multivariate
mixed effect model can be formulated which gives insights into the relationship
between the different modes of variation as well as their dependence on
linguistic and non-linguistic covariates. The model is applied to the COSPRO-1
data set, a comprehensive database of spoken Taiwanese Mandarin, containing
approximately 50 thousand phonetically diverse sample F0 contours, and reveals
that phonetic information is jointly carried by both amplitude and phase
[Show abstract][Hide abstract] ABSTRACT: A model for fundamental frequency (F0, or commonly pitch) employing a functional principal component (FPC) analysis framework is presented. The model is applied to Mandarin Chinese; this Sino-Tibetan language is rich in pitch-related information as the relative pitch curve is specified for most syllables in the lexicon. The approach yields a quantification of the influence carried by each identified component in relation to original tonal content, without formulating any assumptions on the shape of the tonal components. The original five speaker corpus is preprocessed using a locally weighted least squares smoother to produce F0 curves. These smoothed curves are then utilized as input for the computation of FPC scores and their corresponding eigenfunctions. These scores are analyzed in a series of penalized mixed effect models, through which meaningful categorical prototypes are built. The prototypes appear to confirm known tonal characteristics of the language, as well as suggest the presence of a sinusoid tonal component that is previously undocumented.
The Journal of the Acoustical Society of America 06/2012; 131(6):4651-64. · 1.65 Impact Factor
[Show abstract][Hide abstract] ABSTRACT: Fundamental frequency (F0, broadly 'pitch') is an integral part of spoken human language; however, a comprehensive quantitative model for F0 can be a challenge to formulate owing to the large number of effects and interactions between effects that lie behind the human voice's production of F0, and the very nature of the data being a contour rather than a point. The paper presents a semiparametric functional response model for F0 by incorporating linear mixed effects models through the functional principal component scores. This model is applied to the problem of modelling F0 in the tone language Qiang, a language in which relative pitch information is part of each word's dictionary entry. Copyright (c) 2010 Royal Statistical Society.