Conference PaperPDF Available

Vowel Space, Speech Rate and Language Space

Authors:

Abstract and Figures

Acceleration of speech rate is often said to be correlated with a reduction of the vowel space. However, a monocausal explanation of the vowel space reduction by speech rate is surely too simplistic. With our regionally balanced database of a German text read in two reading tempi we present geolinguistic maps of • the different sizes of the vowel space and • different changes of the sizes of the vowel space when comparing normal and accelerated speech rates. These maps for the normal reading tempo show regional patterns of vowel space sizes for the long and short vowel system. Accelerating speech rate affects the vowel space size in regionally specific patterns. In addition, increasing reading tempo shows a surprising general effect: initially large vocal spaces are reduced while initially small vocal spaces are enlarged. Interestingly, vowel space size and change due to accelerating reading tempo does only limitedly reflect traditional dialect regions.
Content may be subject to copyright.
VOWEL SPACE, SPEECH RATE AND LANGUAGE SPACE
Beat Siebenhaar and Matthias Hahn
Universität Leipzig, Germany
siebenhaar@uni-leipzig.de, matthias.hahn@uni-leipzig.de
ABSTRACT
Acceleration of speech rate is often said to be corre-
lated with a reduction of the vowel space. However,
a monocausal explanation of the vowel space reduc-
tion by speech rate is surely too simplistic. With our
regionally balanced database of a German text read in
two reading tempi we present geolinguistic maps of
the different sizes of the vowel space and
different changes of the sizes of the vowel space
when comparing normal and accelerated speech
rates.
These maps for the normal reading tempo show
regional patterns of vowel space sizes for the long and
short vowel system. Accelerating speech rate affects
the vowel space size in regionally specific patterns. In
addition, increasing reading tempo shows a surprising
general effect: initially large vocal spaces are reduced
while initially small vocal spaces are enlarged.
Interestingly, vowel space size and change due to
accelerating reading tempo does only limitedly reflect
traditional dialect regions.
Keywords: German, speech rate, tempo, vowel
space, geophonetics
1. INTRODUCTION
Variationist geolinguistic and sociophonetic research
focuses on specific sounds or the relation of sounds
within a linguistic system. Realisations of specific
sounds are correlated to areal, social or interactional
factors. The vowel space is architecturally conceptu-
alized in terms of relative questions like: Is /eː/ more
fronted than /iː/? Where is the position of an /aː/?
Usually, the size of the vowel space itself is not under
investigation. However, comparison of different
studies makes it obvious that the vowel space area is
quite variable. Fig. 1 shows different representations
of the vowel space for the long monophthongs of
Standard German. These studies except for [13]
with data from a word list represent data from rea-
ding tasks and have been analysed differently, so that
the results are only comparable to a limited extent.
Recordings of [13] were made in Berlin; however,
the authors claim that their students had no dialectal
influence. [15] analyses the Kiel Corpus [5] with
recordings from the northwestern part of Germany.
[11] examines data from Vienna, representing an
Austrian standard, and [14] looks at data from Leipzig
in East Central Germany.
Figure 1: Comparison of the vowel space of long
vowels in standard German intended recordings.
This comparison makes it evident that there is
variation in the geometry of the vowel space as well
as in the size of the vowel space. It also suggests that
there is regional variation in Standard-intended Ger-
man speechthat is, Standard German as performed
regionally [4, 7, 8]. Moreover, the analyses [5, 11, 12,
14, 15] document stylistic differences and [18] shows
that speech rate affects articulation accuracy in many
ways. In addition to segment elision and an increase
in coarticulation phenomena, vowel undershoot is a
typical feature attributed to higher speech rate. The
hypothesis in [9] postulates that a vowel target may
not be reached by the articulators under the temporal
constraints of an accelerated speech rate, resulting in
formant undershoot and a reduction in the vowel
space size; however, this theory is by no means uni-
versally accepted and findings are inconsistent [12].
With a geographically balanced dataset of German
read speech in two speech rates [1, 2, 3] we will focus
on the following questions:
Is there regional variation in vowel space size?
Is there regional variation in how long and short
vowels pattern in terms of vowel space size?
How does acceleration of speech rate affect these
patterns?
2. DATA AND METHODS
2.1. Data
The data are part of the “Deutsch heute” corpus from
the Institute for German Language (IDS) [1, 7]. For
this study the Aesop fable The North Wind and the
Sun was used as reading material. This was recorded
twice per speaker, once in a “normal” reading tempo,
and then in a “fast” reading tempo. The subjects were
two male and two female high school students per
place. They were aged 1720, local to the area under
investigation, and not professional speakers. The
recording took place during lesson time in school in a
quiet room. The study area covers the whole conti-
guous German-speaking area of Germany, Austria,
Switzerland, Liechtenstein, East Belgium and South
Tyrol, from which 161 evenly distributed locations
were selected. Altogether, there are 644 recordings in
two reading tempi.
2.2. Preprocessing
For forced alignment we used the Munich Automatic
Segmentation tool (MAUS) [6]. Then the first two
formants were measured in the central 60% of the
monophthongs. A script calculated formant values at
30 positions, then the median of these formant values
was taken for each of the 260,000 individual vowels.
In order to equalise gender and individual differences,
the data were z-normalised [10], that means that the
data were transformed in a way that the centre of all
vowels got a 0 and the distribution reached a standard
deviation of 1. These z-normalised data were the basis
for the measurement of the vowel space, represented
in z-normalised squared units.
2.3. Measuring the vowel space
The vowel space of the German standard comprises
six vowels. However, the North Wind and the Sun
features only one occurrence of /ɛː/ and of /uː/ for
each recording, so the data density for these two
vowels is quite weak. Therefore, we only measure the
vowel space for long vowels in the area between /iː/,
/eː/, /aː/, and /oː/, marked in Fig. 2 in grey.
Figure 2: Vowel space of the Standard German
long vowel system (solid line) and the area used for
this paper (shaded area).
The exclusion of /ɛː/ is not of great concern for this
analysis, since there is a tendency for /ɛː/ and /eː/ to
merge as /eː/ in the north of the German-speaking area
and in Austria [7]. If /ɛː/ were taken into account, the
vowel spaces would no longer be directly compa-
rable. Omitting /uː/ is more problematic due to its
position in the high back corner of the vowel space.
However, the available data are not sufficient for the
analysis of /uː/.
For the vowel space of the short vowels, we use
the corresponding area between /ɪ/, /ɛ/, /a/, and /ɔ/.
To calculate each vowel space, the first two for-
mants of every individual monophthong were mea-
sured. Thirty measurements were taken within the
central 60% of each vowel, and the median of these
was calculated. The data were then Lobanov- or z-
normalised [16] transformed so that the centre of all
vowels received a value of 0 and the distribution had
a standard deviation of 1. This allows a comparison
of speakers with various vocal physiologies [17].
These normalised formant measurements formed the
basis for the calculation of vocal space size. The
vowel space area for every speaker was measured for
long and short vowels separately, each in the normal
and fast reading tempi.
The vowel space values were mapped with Arc-
GIS using a local smoothing to level out outliers. This
means that the value mapped for every polygon
represents the median value of this polygon and all its
neighbouring values. Thus each area point is
represented by 20 to 32 speakers, and because of the
robustness against outliers the median is used.
3. RESULTS
3.1. Vowel spaces of males and females
Sociolinguistic research often finds differences be-
tween male and female speakers. Therefore, we first
compare them to test whether we can treat the datasets
of males and females as one larger and therefore ge-
nerally more reliable dataset, or whether they are
distinct. Over the whole dataset, we find correlations
of male and female data of r=0.45 for the long vowel
system and r=0.48 for the short vowel system, so the
chance remains that the datasets of males and females
can be treated as one. Fig. 3 maps the vowel spaces of
long vowels in normal reading tempo. The brown
areas mark larger vowel spaces, the blue areas mark
smaller vowel spaces.
Figure 3: Comparison of the vowel space of long
vowels in normal reading tempo of male (left) and
female (right) speakers. Larger vowel spaces in
brown, smaller vowel spaces in blue.
The comparison for the long vowels in normal rea-
ding tempo shows very similar regional distributions
across genders, with only a few larger differences in
the central western part and around Berlin. The cor-
respondences between male and female speakers for
the fast reading tempo and for the short vowel system
are comparable, but each shows different areal
distributions. Finally, it can be said that the regional
distributions of vowel space size for men and women
are quite similar, though only two men and two
women were recorded per place. Therefore, in the
following analyses we treat data from males and
females in a common dataset.
3.2. Ratio of vowel space sizes of the short vowel to
long vowel systems at different reading tempi
Before presenting the vowel spaces for the long and
short vowel systems, we will have a look at the
relationship between the two systems. Fig. 4 shows
the size of the vowel space of the short vowel system
as a proportion of the size of the vowel space of the
long vowel system. Red areas mark big differences in
the size of the vowel spaces of the long and the short
vowel system, while blue areas mark smaller differ-
ences in the size of the two vowel systems. The two
maps in Fig. 4, representing the two reading tempi,
are very consistent. This means that changes in
reading tempo affect long and short vowels in a
similar matter.
Figure 4: Ratio of the vowel space sizes of the short
vowel system to the long vowel system at normal
(left) and fast (right) reading tempi. Small differe-
nces in red, large differences in blue.
3.3. Comparing normal and fast reading tempi
When we compare the vowel space sizes for long
vowels in normal (left) and fast (right) reading tempo
(Fig. 5), we find quite different maps. This means that
change in tempo does not affect the vowel space size
of all regions in the same manner.
Figure 5: Vowel space size of the long vowel sys-
tem at normal (left) and fast (right) reading tempi.
Large vowel spaces in brown, small vowel spaces
in blue.
While the eastern part of Austria and the Ale-
mannic south have quite small vowel space areas in
both tempi, there are striking changes in the transition
zone from Bavarian to Alemannic and in the north-
eastern area of Mecklenburg-Vorpommern.
The results for the short vowel system (Fig. 6)
seem to be as disparate as for the long vowel system.
Here, the south is a bit more consistent than for the
long vowel system. For both normal and fast reading
tempi, the transition zone from Bavarian to Ale-
mannic and Franconian is separated from the neigh-
bouring areas. Moreover, in the north lower German
area and in the Ostfalen area, we find contrary distri-
butions for the two reading tempi.
Figure 6: Vowel space size of the short vowel sys-
tem at normal (left) and fast (right) reading tempi.
Large vowel spaces in brown, small spaces in blue.
Figs. 5 and 6 show that increasing reading tempo
does not universally result in the expected reduction
of the vowel space. In some regions there is even an
enlargement of the vowel space. However, com-
parison of the corresponding maps in Figs. 5 and 6 is
rather confusing. Therefore, the ratio of the vowel
space size of the fast reading tempo to the normal
reading tempo was calculated for the long vowel
system and the short vowel system. The result is
given in Fig. 7, showing in red the reduction of the
vowel space with an increase in reading tempo, and
in blue the expansion of the vowel space with an
increase in reading tempo.
Figure 7: Vowel space size change with increasing
tempo of the long vowel system (left) and the short
vowel system (right). Reduction of vowel spaces in
red, enlargements of vowel spaces in blue.
When we compare the changes for the long and
short vowel systems, the two pictures show a very
high correspondence. The areas where the vowel
space is enlarged when people speak faster are almost
identical. So, concerning the adjustments of vowel
space size, strategies for speaking faster are identical
for the long and the short systems.
When comparing the percent change of the vowel
space size with total vowel space size at normal rea-
ding tempo (the left-side maps in Figs. 6 and 7), we
get a quite surprising result. Fig. 8 shows a negative
correlation of vowel space size in normal tempo with
amount of reduction when reading faster. The regres-
sion models are highly significant (Long vowel sys-
tem (left): F(1,160) = 271.9169; p < 0.001; Short vo-
wel system (right): F(1,160) = 201.2398; p < 0.001).
So, we find that regions with a relatively large vowel
space in normal reading tempo show reduction of the
vowel space when reading faster. In regions where
people use a relatively small vowel space in normal
reading tempo, they enlarge it when reading faster.
Figure 8: Linear regressions, change of vowel
space size (%) in fast reading tempo by vowel space
size in normal reading tempo by place. Long vowel
system (left) and short vowel system (right).
As the vowel space size shows regional patterning,
we also find a related geolinguistic distribution of the
different strategies when speeding up the speaking
tempo (Fig. 7). The maps reflect how the strategies of
vowel space size change by increasing reading tempo.
The distribution of the data in the Standard-intended
reading task does not fit the traditional dialect areas,
but it shows a new regional distribution.
4. DISCUSSION AND CONCLUSION
Our results indicate that there are regional patterns of
vowel space size in Standard-intended German.
Because the patterns are consistent across both male
and female voices, the variation seems to not be
idiolectal but rather has a strong areal imprint.
Accelerating reading tempo does not only affect the
temporal domain but also vowel space size. Our
analyses likewise show regionally specific reduction
patterns for the vowel space.
The relationship between the vowel space sizes of
long and short vowels remain fairly stable between
reading tempi. Increasing the reading tempo has two
different effects: large vowel space sizes are reduced,
small vowel space sizes are enlarged.
The geographical distributions of vowel space size
in read speech and change due to reading tempo only
slightly reflect traditional dialect regions. Overall, the
geographical distribution is surprising as the vowel
space is based on the qualities of the vowels. How-
ever, the traditional maps show relatively large,
mostly phonologically motivated differences, whe-
reas there is a phonetic analysis here. While we do not
have the corresponding geolinguistic data on the vo-
wel space of the dialects to make a direct comparison,
we have to refer to other dialect classifications based
on the segmental phonetic level and on morphological
structures. Yet there are hardly any similarities to
these dialect classifications. The geographical
distribution of vowel space size may be interpreted as
having little correspondence to segmental phonetic
and morphological structures because the linguistic
system is independent of its phonetic realisation. This
difference would then have nothing to do with the
opposition of dialect and standard. One could also
argue that this distribution has more to do with the
regional distribution of speech rate or segment reduc-
tion than with dialect areas [2, 3]. But even these
relationships cannot be established unambiguously.
Thus, the design and change of the vocal space seems
to be a relatively independent parameter, probably
due to the fact that it is hardly perceived consciously.
Despite the apparent independence from traditio-
nal dialect classifications, reasonably stable geogra-
phical patterns emerge with regard to the design of
the vocal space. These show that theories of vocal
space [9, 12, 17, 18] must not only take into account
general aspects of influence such as speech rate, style,
and coarticulation, but that social and geographical
components must also be taken into account. This
means that the influence of speech rate on the shaping
of the vowel space is not a universal phenomenon, but
can vary even within one language.
5. REFERENCES
[1] Brinckmann, C., et al. 2008. German Today: An
areally extensive corpus of spoken Standard German.
Proc. LREC Marrakech. Paris: ELRA., 31853191.
[2] Hahn, M., & Siebenhaar, B. 2016. Sprechtempo und
Reduktion im Deutschen (SpuRD). In: Jokisch, O.
(ed), Elektronische Sprachsignalverarbeitung 2016.
Dresden: TUDpress, 198205.
[3] Hahn, M., & Siebenhaar, B. 2019. Spatial Variation of
Articulation Rate and Phonetic Reduction in Standard-
Intended German. In Calhoun, S., Escudero, P.,
Tabain, M., & Warren, P. (eds.), Proceedings of the
19th International Congress of Phonetic Sciences.
Melbourne, Australia.
[4] Iivonen, A. 1994. Zur gehobenen regionalen phone-
tischen Realisierung des Deutschen. In Viereck, W.
(ed), Verhandlungen des Internationalen
Dialektologenkongresses Bamberg 29. 7.4. 8. 1990
vol 3. Stuttgart, 311330.
[5] IPDS. 19941997. The Kiel Corpus of Read Speech
(vol. 14). Kiel: Institut für Phonetik und digitale
Sprachverarbeitung.
[6] Kisler, T., Reichel, U. D., & Schiel F. 2017.
Multilingual processing of speech via web services.
Computer Speech & Language 45, 326347.
[7] Kleiner, S. 2011ff. Atlas zur Aussprache des
deutschen Gebrauchsstandards (AADG).
http://prowiki.ids-mannheim.de/bin/view/AADG/.
[8] König, W. 1989. Atlas zur Aussprache des
Schriftdeutschen in der Bundesrepublik Deutschland.
Ismaning: Hueber.
[9] Lindblom, B. 1963. Spectrographic study of vowel
reduction. J. Acoustical Soc. of America 35, 1773
1781.
[10] Lobanov, B. M. 1971. Classification of Russian
Vowels Spoken by Different Speakes J. Acoustical
Soc. of America 49: 606608.
[11] Moosmüller, S. 2007. Vowels in Standard Austrian
German. An Acoustic-Phonetic and Phonological
Analysis. Wien.
http://www.kfs.oeaw.ac.at/publications/
habil_2007may28_tableofcontents_zus_final.pdf
[12] Schulz, E., Oh, M. O., Andreeva, B., & Möbius, B.
2016. Impact of prosodic structure and information
density on vowel space size. Proceedings of Speech
Prosody 2016. Boston: ISCA, 350354.
[13] Sendlmeier, W. F. & Seebode, J. 2006. Formant-
karten des deutschen Vokalsystems.
https://www.kw.tu-berlin. de/fileadmin/a01311100/
Formantkarten_des_deutschen_Vokalsystems_01.pdf
[14] Siebenhaar, B. 2014. Instrumentalphonetische Analy-
sen zur Ausgestaltung des Sprechlagenspektrums in
Leipzig. Zeitschrift für Dialektologie und Linguistik
81, 151190.
[15] Simpson, A. P. 1998. Phonetische Datenbanken des
Deutschen in der empirischen Sprachforschung und
der phonologischen Theoriebildung. Kiel: AIPUK.
[16] Thomas, E. R. & Kendall, T. 20072015. NORM:
The vowel normalization and plotting suite. [Online
Resource:
http://lingtools.uoregon.edu/norm/norm1.php]
[17] van der Harst, S. 2011. The Vowel Space Paradox. A
Sociophonetic Study on Dutch. Utrecht: LOT.
[18] Weiss, B. 2008. Sprechtempoabhängige Aussprache-
variationen. Berlin. http://edoc.hu-
berlin.de/dissertationen/weiss-benjamin-2008-05-
28/PDF/weiss.pdf
... We use reading material from the German Today corpus [4] which contains two separate readings of the German version of The North Wind and the Sun, which were read in a normal and a fast tempo. In total, 1,494 recordings are available from a younger (17)(18)(19) years, high school graduates) and an older generation (50-60, adult education classes), males as well as females, who originate from 195 locations in Austria, Belgium, Germany, Liechtenstein, Luxembourg, South Tirol, and Switzerland. However, the location density differs for the age groups, 165 locations are available for the younger generation and only 79 locations for the older generation. ...
... We also look at tempoinduced changes in sound quality. Elsewhere we show that vowel space size is geographically distributed -distinctly from traditional dialect boundaries -and that an increase in tempo results in a change of the vowel space which is dependent on the size of the vowel space in normal tempo, insofar as larger vowel spaces are reduced, while originally small spaces are enlarged [19]. ...
Conference Paper
Full-text available
Tempo of speech and phonetic reduction are closely related and differ in their spatial distributions. The SpuRD-project (Sprechtempo und Reduktion im Deutschen) focusses on this web of relationships and their spatial variation for the whole German-speaking area in central Europe. Using standard-intended reading material in normal and fast reading tempi, an array of reduction phenomena is analysed to find whether they are caused by tempo or whether they are independent variants of a limited linguistic area. The results reported here show on a macroscopic level that the spatial distributions of temporal characteristics such as the duration of articulation and the degree of segment reduction do not coincide everywhere, but have independent distribution areas especially at a higher tempo. That means that articulation rate is composed of regionally varying temporal and segmental features. For explanations of this macroscopic variation the material is analysed with regard to particular microscopic variation with independent spatial distributions.
... Even well after reunification, there are still economic and societal differences between the two regions: Crucially, they differ in the amount of females in leading positions (more in eastern G), gender-based salary differences (in favor of females in eastern G [19]), and in labor division (more equal in eastern G [20]). Regarding dialectal differences, the two cities are comparable in their region-specific vowel space sizes [21]. Against this background, we address three research questions: RQ1: Do females and males differ in self-ascribed gender identity (femininity) and views on gender roles across cities (higher fem. ...
Conference Paper
Full-text available
Previous research has identified cross-linguistic differences in the relation between gender identity and acoustics. This study tests whether such differences extend to different cities in the same country (in western and eastern parts of Germany, formerly BRD and GDR, with distinct historical developments). Results show that gender role concepts vary between regions, with males in the eastern city showing more egalitarian views and higher femininity than males in the west. On the group level, the difference in gender identity was not evidenced in the acoustic vowel space. On an individual level, variation in vowel productions could however be explained by femininity, especially with respect to the second formant (front-back dimension). The effect of femininity was modulated by city, with speakers in the east indexing femininity in vowel productions while speakers in the west did not. Implications for the analysis of inter-speaker variation in fine-phonetic detail is discussed.
... If the zone between (C)V(C) and (C 3 )V(C 3+ ) (see figure 4A/C) is assumed to be the border zone between the North and the South region, it must be stated that this geographical division does not correspond to traditional dialect classification. However, in part this finding is in line with more recent studies on the connection between speech rate and regional varieties (Hahn & Siebenhaar 2016, Siebenhaar & Hahn 2019). These studies also find a limited correspondence between regional borders and traditional dialect classification. ...
Article
Full-text available
This study presents a micro-typological description of German dialects, focusing on the structure of 13,492 tokens of monosyllables, across 182 locations within Germany. Based on data from the Phonetischer Atlas der Bundesrepublik Deutschland , systematic geographical differences in both the segmental and prosodic organization of syllables are explored. The analysis reveals a North–South contrast in the organization of syllable structure. While the North tends toward more simple CVC syllables, the South tends toward the clustering of obstruents. An analysis of sonority dispersion reveals that in southern German, final demisyllables tend to follow more closely the sonority scale. Based on Markov chain models, the study reveals geographical differences in transition probabilities between the segments within monosyllables in German dialects.*
Conference Paper
Full-text available
Tempo of speech and phonetic reduction are closely related and differ in their spatial distributions. The SpuRD-project (Sprechtempo und Reduktion im Deutschen) focusses on this web of relationships and their spatial variation for the whole German-speaking area in central Europe. Using standard-intended reading material in normal and fast reading tempi, an array of reduction phenomena is analysed to find whether they are caused by tempo or whether they are independent variants of a limited linguistic area. The results reported here show on a macroscopic level that the spatial distributions of temporal characteristics such as the duration of articulation and the degree of segment reduction do not coincide everywhere, but have independent distribution areas especially at a higher tempo. That means that articulation rate is composed of regionally varying temporal and segmental features. For explanations of this macroscopic variation the material is analysed with regard to particular microscopic variation with independent spatial distributions.
Chapter
Full-text available
Die Untersuchung bietet den ersten systematischen Zugang zu der Frage, inwiefern Sprechtempo und phonetische Reduktion im deutschen " Gebrauchsstan-dard " regionaler Variation unterliegen. Dazu werden in zwei intendierten Tempi ein-gelesene Aufnahmen des " Nordwind und Sonne "-Textes ausgewertet, die es ermög-lichen, einen direkten Vergleich zwischen den bisher 67 über den deutschsprachigen Raum verteilten Ortspunkten zu ziehen. Die vorläufigen Ergebnisse zeigen, dass so-wohl Sprechtempo (gemessen als phonetische Artikulationsrate in Lauten/s) als auch Reduktion (gemessen als prozentualer Segmentelisionsgrad) z. T. deutliche Raum-muster ausprägen. Für die Artikulationsrate kann ein tendenzielles Süd-Nord-Gefälle festgehalten werden, während für den Segmentelisionsgrad ein deutlicher Kontrast zwischen nördlichen und westlichen Regionen auf der einen und östlichen und süd-östlichen Regionen auf der anderen Seite besteht. Im Weiteren kann festgehalten werden, dass ähnliche Werte für die Artikulationsrate regional sehr unterschiedlich zustande kommen und dadurch ein komplexes Zusammenspiel mehrerer Faktoren freigelegt wird. Um diese ersten Befunde zu präzisieren und zu differenzieren, müs-sen in weiterführenden Untersuchungen sowohl die konkreten Ausprägungen regio-nalspezifischer Reduktionsmuster als auch die temporale Feinstruktur der fokussier-ten Areale genauer betrachtet werden.
Article
Full-text available
Following the breakdown of traditional dialects in parts of central Saxony, findings showing the current configuration of the variation spectrum are still unavailable. This paper will complement previous studies by presenting a further, specific analysis, albeit this time by means of an instrumental-phonetic analysis. The formants and the monophthong length of twelve female in-formants from Leipzig will be investigated in three different recording situations: an interview, a text to be read aloud, and intended dialect. In comparison to other regions, the intended standard language has a tendency to become centralized, which is especially the case for long vowels. In comparison to the pronunciation used while reading aloud, further centralizations can be found in intended dialect and spontaneous speech. As far as spontaneous speech is concerned, back vowels are shifted to the front, whereas front vowels are shifted to the back, with the exception of [i:]. In both situations, the articulatory space is clearly reduced; however, the individual variation is very large. In the spontaneous speech of all the speakers, it is noticeable that the articulatory space of the short vowels is uniformly small, independent of the sound length so that a limit in the reduction seems to be reached. The lack of variation between speech levels in Dresden (Kehrein 2012) can be validated here for Leipzig. However, it is unclear whether the observed differences are regional, or whether these can be traced back to the survey methods used: phonetic transcriptions done by ear and dialectality measurements vs. instrumental-phonetic measurements. In principle, the approach with instrumental-phonetic measurements proves to be an effective method to test and differentiate results from variation linguistics and dialectology.
Article
Full-text available
The problem of speaker normalization is investigated for classifying the Russian vowels. The known methods of normalization of formantsF 1 and F 2 which make their normalized values F 1 N and F 2 N invariant to compression or expansion and to parallel shift are mentioned. A new statistical method of normalization is then suggested. All of these methods are compared using an index of normalization quality (η). The result of the comparison shows that the normalization suggested in this letter has the largest index, not only on the average but also for each individual vowel pair. The procedure of classification of the Russian vowels using a self‐normalized formant plot is shown at the end of the letter.
Article
A new software paradigm `Software as a Service' based on web services is proposed for multilingual linguistic tools and exemplified with the BAS CLARIN web services. Instead of traditional tool development and distribution the tool functionality is implemented on a highly available server that users or applications access via HTTP requests. As examples we describe in detail five multilingual web services for speech science operational since 2012 and discuss the benefits and drawbacks of the new paradigm as well as our experiences with user acceptance and implementation problems. The services include automatic segmentation of speech, grapheme-to-phoneme conversion, syllabification, speech synthesis, and optimal symbol sequence alignment.
Article
Measurements of formant frequencies and duration are reported for 8 Swedish vowels uttered by a male talker in three consonantal environments under varying timing conditions. An exponential function is used to describe the extent to which formant frequencies in the vowels reach their target values as a function of vowel-segment duration. A target is specified by the asymptotic values of the first two formant frequencies of the vowel and is independent of consonantal context and duration. It is thus an invariant attribute of the vowel. The results suggest an interpretation in terms of a simple dynamic model of vowel articulation.
Thesis
In der vorliegenden Dissertation wurden Äußerungen des spontansprachlichen Teils des Kielkorpus auf Aussprachevariationen untersucht, die sich auf Änderungen der lokalen Sprechrate zurückführen lassen. Drei Arten von Aussprachevariationen wurden behandelt: Spektrale Charakteristika von Vokalen (1. und 2. Formant) und stimmlosen Frikativen (Center of Gravity), sowie die phonetische Annotation von häufigen Wortformen. In allen Bereichen wurden signifikante Reduktionen bei erhöhtem lokalen Tempo nachgewiesen, die sich über verstärkte Koartikulation erklären lassen. Es ist allerdings zu vermuten, dass diese Reduktionserscheinungen für die Frikative perzeptiv nicht relevant sind. Die beiden anderen Effekte liegen dagegen in einem Bereich, der in einer erschwerten Erkennung resultieren sollte. Die perzeptiven Auswirkungen wurden jedoch nicht überprüft. Bedeutsam sind die erfassten Reduktionen insofern, als dass es sich dabei um Effekte handelt, die sich nicht über Sprechstil, Wortart, Betonung oder Wortform erklären lassen, sondern innerhalb dieser Bedingungen auftreten. Gerade für unbetonte Silben und Funktionswörter ist dies überraschend, da in diesen Bedingungen generell reduzierte Varianten geäußert werden. Somit handelt es sich hier um einen Ökonomie-Effekt, der für Monophthonge relativ gesehen stärker auftritt als für stimmlose Frikative; vermutlich durch die genaueren Definitionen der artikulatorischen Gesten bei diesen Konsonanten. Unklar ist allerdings, in wie weit diese Aussprachevariationen eine direkte Folge der Sprechtemposchwankungen darstellen. Entweder sind sie durch das lokale Tempo induziert, oder weitere Faktoren müssen als ursächlich angenommen werden. Aufgrund des untersuchten Materials dürften diese Ergebnisse auch für weitere spontansprachliche Kommunikationssituationen gelten; eine Generalisierung auf weitere Sprecher ist aufgrund der Datenbehandlung möglich, nicht jedoch auf andere Sprechstile.
Book
Atlas zur Aussprache des Schriftdeutschen in der Bundesrepublik Deutschland. - Ismaning : Hueber Bd. 1. Text. - 1989. - 173 S. Bd. 2. Tabellen und Karten. - 1989. - 332 S.