Conference PaperPDF Available

Applying the Vowel Formant Dispersion (VFD) method to the study of reduced or alterered vowel productions

Authors:
n=1
Introduction
Formant centre frequencies are used as acoustic measures of
vowel quality because they correlate with vowel articulation
A number of measures based on vowel formants have been
derived to capture whole-system effect of vowel production.
Most commonly used is Vowel Space Area
1
(VSA), the area of
the quadrilateral/triangle enclosing the vowels [a], [u], [i], [æ] or
[a], [u], [i] that constitute acoustic extremes as represented by
average formant frequencies.
However, VSA may not be powerful enough to capture small
changes in vowel articulation due to effects of treatments for
patients with Parkinson’s disease (PD). AVS has not been
evaluated in this respect.
Alternative measures , e.g. Vowel Articulation Index
2
(VAI) or
its inverse FCR
3
, and Average Vowel Space
4
(AVS) are more
focused on detecting expansion or reductions in vowel spaces.
-
These metrics do not give detailed insight into the nature of
the change in vowel articulation.
Further, these metrics reduce all obtained vowel formant
measurements into a single metric,
-
all vowel formant measurements are reduced to a single
metric, giving
-
a substantial loss of statistical power
-
a considerable reduction in quality of intra-speaker models of
articulatory proficiency.
Vowel Formant Dispersions (VFD) in Hz
500
1000
1500
Normal Positive treatment effect Negative treatment effect
Hypokinetic dysarthria
Different kinds of articulation
n=1
F
2
F
1
400
600
800
1000 1500 2000 2500
F
2
F
1
400
600
800
1000 1500 2000 2500
+
+
Relocation of the vowel space
w
i
=
tan
1
F
1
i
F
1
c
F
2
i
F
2
c
for F
2
i
> F
2
c
tan
1
F
1
i
F
1
c
F
2
i
F
2
c
+
π
for F
2
i
> F
2
c
and F
1
i
F
1
c
tan
1
F
1
i
F
1
c
F
2
i
F
2
c
π
for F
2
i
< F
2
c
and F
1
i
< F
1
c
π
2
for F
2
i
= F
2
c
and F
1
i
> F
1
c
π
2
for F
2
i
= F
2
c
and F
1
i
< F
1
c
0 for F
2
i
= F
2
c
and F
1
i
= F
1
c
F
2
c
=
1
n
F
2
i
i=1
n
{V (F
2
, F
1
) : F
1
< F
1c
}
F
1
c
=
1
n
F
1
i
i=1
n
Math corner
d
c
= V
c
1
V
c
2

= (F
1
c
1
F
1
c
2
)
2
+ (F
2
c
1
F
2
c
2
)
2
Fredrik Karlsson & Jan van Doorn
Umeå University, Sweden
F
2
F
1
200
400
600
800
500 1000 1500 2000 2500
n=n
Poster presented at the 14th Meeting of the International Clinical Phonetics and Linguistics Association, in Cork, Ireland, June 27–30 2012
F
2
F
1
200
400
600
800
500 1000 1500 2000 2500
Vowel Formant Dispersion (VFD) in Hz
200
400
600
800
1000
1200
[u]-corner [i]-corner [ae]-corner [a]-corner
Vowel Formant Dispersion
A new metric, Vowel Formant Dispersion
5
(VFD) is proposed to
address these issues for clinical groups, to provide
-
more detailed interpretations of speakers’ production proficiency
-
increased reliability of within-speaker estimates of this
proficiency.
A vowel space centre is defined for each speaker as the point placed
at the mean F
1
of all vowels in the system, and the F
2
of those vowels
that have F1< F
1
(center line) i.e. vowels that are more closed.
-
This definition aims at supporting a robust placement of the
center across vowel spaces with differing properties
Each vowel is subsequently viewed as a vector originating from the
center point, each having a length (the VFD length) and an angle
component.
VFD applied to studies of clinical groups
In statistical group comparisons, all measured vowel productions may be
included in the model
A robust placement of vowel space center affords inferences concerning the
direction of changes in vowel articulation range to be drawn.
The VFD lengths further affords interpretations concerning changes in the
variability of speakers in realizing targets.
By investigating the placement of the vowel space center, a descriptive
analysis of whole-system relocation, that may be a desirable treatment
outcome in transexual patients, is also possible.
-
However, it should be kept in mind that this is a point estimate of the
relocation, and therefore not robust to the effect of individual variation.
References
1. Kent, R. D., & Kim, Y.-J. (2003). Toward an acoustic typology of motor speech disorders. Clinical Linguistics & Phonetics, Clinical Linguistics & Phonetics, 17(6), 427–445.
2. Roy, N., Nissen, S. L., Dromey, C., & Sapir, S. (2009). Articulatory changes in muscle tension dysphonia: evidence of vowel space expansion following manual circumlaryngeal therapy Journal of
communication disorders, Journal of communication disorders, 42(2), 124–135.
3. Sapir, S., Ramig, L. O., Spielman, J. L., & Fox, C. (2010). Formant Centralization Ratio: A Proposal for a New Acoustic Measure of Dysarthric Speech. Journal of Speech, Language, and Hearing
Research, 53(1), 114–114.
4. Lane, H., Matthies, M. L., Perkell, J. S., Vick, J., & Zandipour, M. (2001). The Effects of Changes in Hearing Status in Cochlear Implant Users on the Acoustic Vowel Space and CV
Coarticulation. Journal of Speech, Language, and Hearing Research, 44(3), 552–563.
5. Karlsson, F., & van Doorn, J. (In review) Vowel formant dispersion as a measure of articulation proficiency, The Journal of the Acoustical Society of America.
F
2
F
1
200
400
600
800
500 1000 1500 2000 2500
Te
A
Statistics
Applying the Vowel Formant Dispersion (VFD) method to the study of reduced
or alterered vowel productions
måndag 24 september 12
... In light of its varying effectiveness, several alternative measures to the VSA have been suggested and implemented with some success. For investigations of the vowel acoustic space with the corner vowels alone, the Formant Centralization Ratio (FCR; Sapir et al., 2010) and the Vowel Formant Dispersion (VFD; Karlsson & van Doorn, 2012a, 2012b metrics have both demonstrated increased sensitivity in distinguishing speakers with speech disorders and typically developing (TD) controls. The VFD metric has specific applicability to vowel centralization, and not group differentiation alone, as it provides a measure of degree of centralization, as well as the direction (i.e., which of the corner vowels) from which the centralization occurs (Karlsson & van Doorn, 2012b). ...
Article
Full-text available
Purpose Speech production deficits and reduced intelligibility are frequently noted in individuals with Down syndrome (DS) and are attributed to a combination of several factors. This study reports acoustic data on vowel production in young adults with DS and relates these findings to perceptual analysis of speech intelligibility. Method Participants were eight young adults with DS as well as eight age- and gender-matched typically developing (TD) controls. Several different acoustic measures of vowel centralization and variability were applied to tokens of corner vowels (/ɑ/, /æ/, /i/, /u/) produced in common English words. Intelligibility was assessed for single-word productions of speakers with DS, by means of transcriptions from 14 adult listeners. Results Group differentiation was found for some, but not all, of the acoustic measures. Low vowels were more acoustically centralized and variable in speakers with DS than TD controls. Acoustic findings were associated with overall intelligibility scores. Vowel formant dispersion was the most sensitive measure in distinguishing DS and TD formant data. Conclusion Corner vowels are differentially affected in speakers with DS. The acoustic characterization of vowel production and its association with speech intelligibility scores within the DS group support the conclusion of motor control deficits in the overall speech impairment. Implications are discussed for effective treatment planning.
Article
Full-text available
The articulatory range of a speaker has previously been estimated by the shape formed by first and second formant measurements of produced vowels. In a majority of the currently available metrics, formant frequency measurements are reduced to a single estimate for a condition, which has adverse consequences for subsequent statistical testing. Other metrics provide estimates of size of vowel articulation changes only, and do not provide a method for studying the direction of the change. This paper proposes an alternative approach. Vowel formant frequencies are redefined as vectors originating from a defined center point of the vowel space fixed to a basic three-vowel frame. The Euclidean length of the vectors, the vowel formant dispersion (VFD), can be compared across conditions for evidence of articulatory expansions or reductions across conditions or speaker groups. Further, the angle component of the vowel vectors allows for analyses of direction of the reduction or expansion. Based on the range of investigations afforded by the VFD metric, and simulation experiments that compare its statistical properties with those of other proposed metrics, it is argued that the VFD procedure offers an enhanced view of vowel articulation change over rival metrics.
Article
Full-text available
Purpose The vowel space area (VSA) has been used as an acoustic metric of dysarthric speech, but with varying degrees of success. In this study, the authors aimed to test an alternative metric to the VSA—the formant centralization ratio (FCR), which is hypothesized to more effectively differentiate dysarthric from healthy speech and register treatment effects. Method Speech recordings of 38 individuals with idiopathic Parkinson’s disease and dysarthria (19 of whom received 1 month of intensive speech therapy [Lee Silverman Voice Treatment; LSVT LOUD]) and 14 healthy control participants were acoustically analyzed. Vowels were extracted from short phrases. The same vowel-formant elements were used to construct the FCR, expressed as (F2u + F2ɑ + F1i + F1u) / (F2i + F1ɑ), the VSA, expressed as ABS([F1i × (F2ɑ – F2u) + F1ɑ × (F2u – F2i) + F1u × (F2i – F2ɑ)] / 2), a logarithmically scaled version of the VSA (LnVSA), and the F2i/F2u ratio. Results Unlike the VSA and the LnVSA, the FCR and F2 i/F2 u ratio robustly differentiated dysarthric from healthy speech and were not gender sensitive. All metrics effectively registered treatment effects and were strongly correlated with each other. Conclusion Albeit preliminary, the present findings indicate that the FCR is a sensitive, valid, and reliable acoustic metric for distinguishing dysarthric from unimpaired speech and for monitoring treatment effects, probably because of reduced sensitivity to interspeaker variability and enhanced sensitivity to vowel centralization.
Article
Full-text available
Unlabelled: In a preliminary study, we documented significant changes in formant transitions associated with successful manual circumlaryngeal treatment (MCT) of muscle tension dysphonia (MTD), suggesting improvement in speech articulation. The present study explores further the effects of MTD on vowel articulation by means of additional vowel acoustic measures. Pre- and post-treatment audio recordings of 111 women with MTD were analyzed acoustically using two measures: vowel space area (VSA) and vowel articulation index (VAI), constructed using the first (F1) and second (F2) formants of 4 point vowels/ a, i, ae, u/, extracted from eight words within a standard reading passage. Pairwise t-tests revealed significant increases in both VSA and VAI, confirming that successful treatment of MTD is associated with vowel space expansion. Although MTD is considered a voice disorder, its treatment with MCT appears to positively affect vocal tract dynamics. While the precise mechanism underlying vowel space expansion remains unknown, improvements may be related to lowering of the larynx, expanding oropharyngeal space, and improving articulatory movements. Learning outcomes: The reader will be able to: (1) describe possible articulatory changes associated with successful treatment of muscle tension dysphonia; (2) describe two acoustic methods to assess vowel centralization and decentralization, and; (3) understand the basis for viewing muscle tension dysphonia as a disorder not solely confined to the larynx.
Article
Full-text available
In order to examine the role of hearing status in controlling coarticulation, eight English vowels in /bVt/ and /dVt/ syllables, embedded in a carrier phrase, were elicited from 7 postlingually deafened adults and 2 speakers with normal hearing. The deaf adults served in repeated recording sessions both before and up to a year after they received cochlear implants and their speech processors were turned on. Each of the two hearing control speakers served in two recording sessions, separated by about 3 months. Measures were made of second formant frequency at obstruent release and at 25 ms intervals until the final obstruent. An index of coarticulation, based on the ratio of F2 at vowel onset to F2 at midvowel target, was computed. Changes in the amount of coarticulation after the change in hearing status were small and nonsystematic for the /bVt/ syllables; those for the /dVt/ syllables averaged a 3% increase—within the range of reliability measures for the 2 hearing control speakers. Locus equations (F2 at vowel onset vs. F2 at vowel midpoint) and ratios of F2 onsets in point vowels were also calculated. Like the index of coarticulation, these measures tended to confirm that hearing status had little if any effect on coarticulation in the deaf speakers, consistent with the hypothesis that hearing does not play a direct role in regulating anticipatory coarticulation in adulthood. With the restoration of some hearing, 2 implant users significantly increased the average spacing between vowels in the formant plane, whereas the remaining 5 decreased that measure. All speakers but one also reduced vowel duration significantly. Four of the speakers reduced dispersion of vowel formant values around vowel midpoint means, but the other 3 did not show this effect.
Article
Acoustic methods have progressed to the point that an acoustic typology of the motor speech disorders can be constructed from a parameteric assessment of the speech subsystems (e.g., phonation, nasal resonance, vowel articulation, consonant articulation, intonation, and rhythm). The results of this analysis can be interpreted in respect to global functions in speech (e.g., voice quality, intelligibility, and prosody). This paper reviews studies showing that specific acoustic analyses have demonstrated or potential value toward the overall goal of constructing acoustic profiles of dysarthria and apraxia of speech. Several different acoustic measures are relevant to the study of the motor speech disorders, and these are increasingly supported by normative data and by guidelines for clinical application. Examples of these applications are discussed for a variety of specific neurologic diseases or perceptual types of disorder. Acoustic studies are useful in the study of motor speech disorders and recent progress points to a parametric analysis.