Content uploaded by Laura Becker
Author content
All content in this area was uploaded by Laura Becker on Feb 11, 2025
Content may be subject to copyright.
Available via license: CC BY 4.0
Content may be subject to copyright.
Zero marking in inection:
A token-based approach
Laura Becker
ABSTRACT
Keywords:
token-based
typology,
corpus typology,
zero marking,
zero exponence
1INTRODUCTION
1
1
Journal of Language Modelling
Laura Becker
et al.
et al.
et al.
Zero marking in inection
et al.
2ZERO MARKING
Zero marking and coding eciency
Laura Becker
The grammatical form-frequency correspondence hypothesis
2
2
et al.
et al.
Zero marking in inection
et al.
Laura Becker
A working denition of zero markers
3
4
3
4
Zero marking in inection
5
Stem
Marker
5
Laura Becker
Zero Marker
/deɪ/
day /deɪz/ day
/deɪ/
/z/ /z/
day
A A
6
6
Zero marking in inection
MarkerA
A
Zero MarkerA
A A
7
A A
A A
7
Laura Becker
Zero marking in inection
cantaríamos
cant-a-r-í-a-mos canta-
ríamos cant-a-ría-mos canta-r-í-a-mos cantar-í-amos
Laura Becker
3 DATASET AND SEGMENTATION
Dataset
et al.
8
8
affixation.csv lemmas.csv
https://osf.io/p4mkc/?view_only=
5238ace9cb1d4f4d998486ebb28f4fd8
Zero marking in inection
Data pre-processing
Laura Becker
9
10
9
10
preprocessing.txt
code-preprocessing.R
Zero marking in inection
11
et al.
12
11
12 epitran.py
Laura Becker
allumer
allume allumes allume allumons
allumais allumais allumait allumions
allumai allumas allumat allumâmes
allumerai allumeras allumera allumerons
allumerais allumerais allumerait allumerions
allume allumes allume allumions
allumasse allumasses allumât allumassions
allumer
13
13 affixation.csv
Zero marking in inection
Extracting stems and zero markers
14
allumer
15
alym
14
15
Laura Becker
allumer
et al.
Zero marking in inection
anu
chaski
ts’ers
ak’etebs 16
da- ga-
-eb ak’etebs
16 -a- ak’etebs
k’etebi
Laura Becker
ts’ers
ak’etebs
-eb/-ob
ts’ers
ʔarsala
iktašafa
ban ceoj
Zero marking in inection
ʔarsala
iktašafa
ban
ceoj
Stem alternations and suppletion
A
Laura Becker
A
A
17
A
Kloß
Kloß
17
affixation.csv
Zero marking in inection
A
Kloß
Kloß kls
/o/ /ø/
Kloß
/o/ /ø/ A
gyomor
-or
ɟomr
A
gyomor
Laura Becker
absúrden
A
-o-
A
gyomor
absúrden
/rdn/
/-e-/
A
køgɁ²
køgɁ²
Zero marking in inection
A
køgɁ²
A
A
aiddolaš /-š//-čč/
/-žž/
bahá
Laura Becker
aiddolaš
bahá
(A) (A)
think
go think
θ-
go
think go
(A) (A)
Zero marking in inection
overthink undergo
-ɡow -ɪŋk
A
A
A
A
Hapax legomenon markers
Laura Becker
čáppat
-pp -bb
čáppat
fái
maːkabeʀə
-b -p
-b
maːkapʁɐ -p
Morphomic paradigms
Zero marking in inection
4ESTIMATING THE PROBABILITY OF
ZERO MARKERS
Observed distributions
Laura Becker
adjective
noun
verb
0.00 0.25 0.50 0.75 1.00 0.00 0.25 0.50 0.75 1.00 0.00 0.25 0.50 0.75 1.00
0
20
40
60
80
proportion of zero markers
N languages
18
18
Zero marking in inection
19
19
Laura Becker
pfxpfx+sfxsfx
has_ifx
Modelling the probability of zero marking
et al.
brms
20
20 et al.
code-phylogeny.R
Zero marking in inection
21
22
21
et al.
code-prob.R
22 A
ce-probcheck-mu-<predictor>.pdf
ce-probcheck-zoi-<predictor>.pdf
Laura Becker
0.0
0.1
0.2
0.3
0.4
0.5
A N V
part of speech
probability of zero marking
0.0
0.1
0.2
0.3
0.4
0.5
has_ifx pre pre+sfx sfx
affix position
probability of zero marking
0.0
0.1
0.2
0.3
0.4
0.5
epitran not phon. phon.
representation
probability of zero marking
0.0
0.1
0.2
0.3
0.4
0.5
−2 −1 0 1 2
N values per cell (standardized)
probability of zero marking
0.0
0.1
0.2
0.3
0.4
0.5
−2 −1 0 1 2
N lemmas (standardized)
probability of zero marking
Zero marking in inection
0.00
0.25
0.50
0.75
1.00
A N V
part of speech
probability of no zero marking
0.00
0.25
0.50
0.75
1.00
has_ifx pre pre+sfx sfx
affix position
probability of no zero marking
0.00
0.25
0.50
0.75
1.00
epitran not phon. phon.
representation
probability of no zero marking
0.00
0.25
0.50
0.75
1.00
−2 −1 0 1 2
N values per cell (standardized)
probability of no zero marking
0.00
0.25
0.50
0.75
1.00
−2 −1 0 1 2
N lemmas (standardized)
probability of no zero marking
Laura Becker
23
0.00
0.25
0.50
0.75
1.00
A N V
part of speech
probability of no zero marking
0.00
0.25
0.50
0.75
1.00
has_ifx pre pre+sfx sfx
affix position
probability of no zero marking
23 code-morphomic.R
ce-probmorph-mu-<predictor>.pdf
ce-probmorph-zoi-<predictor>.pdf
Zero marking in inection
5FUNCTIONS ASSOCIATED WITH ZERO
MARKING
Cells with the highest probability of zero marking
≥
24
25
24
cells-merged.csv
25
code-cells.R
Laura Becker
26
26
ce-cells-check-<predictor>.pdf
Zero marking in inection
0.00
0.25
0.50
0.75
1.00
INDF.NEUT.SG_A
PL.VOC_A
NOM.SG_A
DAT.SG_N
GEN.PL_N
DEF.NOM.SG_N
INDF.PL_N
ACC.SG_N
NOM.SG_N
INDF.SG_N
2SG.PRS_V
2SG.PRS.SBJV_V
3SG.PRS.SBJV_V
1SG.PRS.SBJV_V
3SG.PST_V
1SG.PRS_V
3SG.PRS_V
2SG.IMP_V
cells
probability of zero marking
data observed predicted
0.00
0.25
0.50
0.75
1.00
−0.5 0.0 0.5 1.0 1.5 2.0
N values (standardized)
probability of zero marking
0.00
0.25
0.50
0.75
1.00
0 2 4 6
N lemmas (standardized)
probability of zero marking
Laura Becker
27
Values with the highest probability of zero marking
27
Zero marking in inection
≥
28
29
30
28
values-merged.csv
29
code-values.R
30
ce-values-check-<predictor>.pdf
Laura Becker
0.00
0.25
0.50
0.75
1.00
NOM_A
INDF_A
DEF_N
DAT_N
PL_N
GEN_N
VOC_N
ACC_N
NOM_N
SG_N
INDF_N
SBJV_V
PROG_V
PL_V
1SG_V
NFIN_V
3SG_V
2SG_V
PRS_V
SG_V
IMP_V
values
probability of zero marking
data observed predicted
0.00
0.25
0.50
0.75
1.00
has_ifx pre pre+sfx sfx
affix position
probability of zero marking
0.00
0.25
0.50
0.75
1.00
012
N cells (standardized)
probability of zero marking
0.00
0.25
0.50
0.75
1.00
0 5 10 15
N lemmas (standardized)
probability of zero marking
Zero marking in inection
6THE FREQUENCY OF ZERO MARKERS
IN LANGUAGE USE
et al.
A
Laura Becker
adjective
noun
verb
0 5 10 0 5 10 0 5 10
0
2
4
6
8
10
12
log token frequency of markers
marker length in N phonological segments
marker type overt zero
31
31
et al. et al.
Zero marking in inection
et al. 32
32 code-ud.R
Laura Becker
0.8
0.9
1.0
1.1
0 5 10
marker log frequency
N phonological segments
affix position
has_ifx
pre
pre+sfx
sfx
0.8
0.9
1.0
1.1
has_ifx pre pre+sfx sfx
affix position
N phonological segments
0.4
0.5
0.6
0 5 10
marker log frequency
probability of zero marker
affix position
has_ifx
pre
pre+sfx
sfx
0.4
0.5
0.6
has_ifx pre pre+sfx sfx
affix position
probability of zero marker
Zero marking in inection
7DISCUSSION
The probability of zero marking
Laura Becker
Cells and values associated with zero marking
Zero marking in inection
et al.
Laura Becker
Frequency eects and ax position
et al.
et al.
et al. et al.
et al.
et al.
Zero marking in inection
Support for the non-development scenario
of zero markers
et al.
Laura Becker
8 CONCLUSION
Zero marking in inection
ABBREVIATIONS
REFERENCES
Imperatives and commands
East and
West
A-morphous morphology
Laura Becker
Znaki czy nie znaki? – II. zbiór prac lingwistycznych
Morphological complexity
Word Structure
The Oxford guide
to Uralic languages
Understanding and measuring morphological complexity
Linguistics Vanguard
Positional faithfulness
Proceedings of the Society for Computation in Linguistics
Agreement from a
diachronic perspective
Language
Yearbook of Morphology 2005
Journal of Linguistics
Word and paradigm morphology
Language
Language
Zero marking in inection
Word
Structure
All things morphology: Its independence and its interfaces
Cognition
The Cambridge handbook of morphology
Models of inection
Perspectives on grammaticalization
Handbook of historical
linguistics
Frequency of use and the organization of language
The Oxford
handbook of typology
Language change
Studies in Language
Studies in typology and
diachrony. Papers presented to Joseph H. Greenberg on his 75th birthday
Journal of Statistical Software
Laura Becker
Journal of Statistical Software
Journal of Phonetics
Journal of Phonetics
Journal of Phonetics
Laboratory Phonology
The
Oxford handbook of inection
Case and Agreement in Panará (... and Beyond)
Advances in Functional Linguistics
Language
Explanation in typology: Diachronic sources, functional
motivations and the nature of the evidence
Folia Linguistica
Typology and universals
The paradigmatic structure of person marking
Oxford research encyclopedia of linguistics
Zero marking in inection
Cours de linguistique générale
Diacritics
https://www.jstor.org/stable/20616535
The grammar network: How linguistic structure is shaped
by language use
The morphology and phonology of exponence
Journal of Phonetics
Studies in Language
On understanding grammar
Universals of language
Language universals: With special reference to feature
hierarchies
Linguistics
Vanguard
Linguistic Typology
Studies in linguistic analysis
Explaining
language universals
Glottolog 4.4
http://glottolog.org
Linguistic universals and language
change
Laura Becker
Cognitive Linguistics
Linguistic Discovery
Aspects of linguistic variation
Journal of Linguistics
Journal of Linguistics
Linguistics
Georgian: A structural reference grammar
Anthropological Linguistics
Language
Russian and Slavic grammar: Studies 1931-1981
Frequency and the
emergence of linguistic structure
Phonetic interpretation. Papers in
laboratory phonology VI
The role of prosodic phrasing in Korean word segmentation
https://linguistics.ucla.edu/wp-content/uploads/2021/11/
SahyangKim_dissertation.pdf
Zero marking in inection
Yearbook of Morphology 1994
Linguistic Workshop II: Arbeiten Des
Kölner Universalienprojekts 1973/4
Thoughts on grammaticalization
Glossa
Communicative eciency: Language structure and use
Journal of Linguistics
Inectional morphology: A theoretical study
based on aspects of Latin verb conjugation
Proceedings of the 12th Language Resources and Evaluation
Conference
https://aclanthology.org/2020.lrec-1.483
Studia
Linguistica
Das Zéro-Problem in der Linguistik. Kritische
Untersuchungen zur strukturalistischen Analyse der Relevanz sprachlicher Form
Studies in Language
Morphology 2000: Selected
Laura Becker
papers from the 9th Morphology Meeting, Vienna, 24–28 February 2000
Annual Meeting of the Berkeley
Linguistics Society
Lingue e linguaggio
Proceedings of the 11th International Conference on Language Resources
and Evaluation (LREC 2018)
https://aclanthology.org/L18-1429
Finiteness: Theoretical and empirical foundations
The Cambridge handbook of morphology
Proceedings of the National Academy
of Sciences
Rivista di Linguistica
Proceedings of the West Coast Conference on Formal Linguistics
R: A language and environment for statistical computing
https://www.R-project.org/
The “broken” plural problem in Arabic and comparative
Semitic
Transactions of the Philological
Society
A short history of linguistics
Language typology and syntactic
description. Volume 1
Zero marking in inection
Theoretical Morphology
Language
Humanities and Social Sciences Communications
Essais de linguistique generale
et de typologie linguistique oerts au professeur Denis Creissels à l’occasion de ses 65
ans
Phonological augmentation in prominent positions
Linguistic typology
Word Structure
Linguistics Vanguard
Language Typology and Universals
Inectional morphology: A theory of paradigm structure
Morphological typology: From word to
paradigm
https://unimorph.github.io/doc/unimorph-schema.pdf
Word Structure
The
morphology and phonology of exponence
Laura Becker
Statistics and
Computing
Journal of Phonetics
Studies in Language
Natural Language & Linguistic Theory
Zero marking in inection
Laura Becker
Zero marking in inection
http://hdl.handle.net/11234/1-5287
The psychobiology of language: An introduction to
dynamic philology
Proceedings of
the Eleventh Annual Meeting of the Berkeley Linguistics Society
Laura Becker
0000-0002-1835-9404
Zero marking in inection: A token-based approach
https://dx.doi.org/10.15398/jlm.v12i2.361
Creative Commons Attribution 4.0 Public License
http://creativecommons.org/licenses/by/4.0/