ChapterPDF Available

Code Switching: Linguistic

Authors:

Abstract

Though code-switching (CS) has been exceptionally well researched, controversy continues to reign over its identity, structure, and the rules governing its use. This article relates the state of the field to epistemological differences among researchers with respect to data, methods, the nature of evidence, and principles of scientific proof. Progress toward developing accountable theories of CS will be achieved once greater efforts are made to distinguish (1) apparently homologous but fundamentally distinct manifestations of language contact (currently subsumed under the label of CS) and (2) robust community-based usage patterns from isolated counterexamples.
struction: Essays in the Honor of Robert Glaser. Erlbaum,
Hillsdale, NJ
Doise W, Mugny G 1984 The Social De!elopment of the Intellect.
Pergamon Press, Oxford
Kozulin A 1998 Psychological Tools.A Sociocultural Approach
to Education. Harvard University Press, Cambridge, MA
Mead G H 1934 Mind,Self,and Society. University of Chicago
Press, Chicago
Pauli C, Reusser K 2000 Cultivating students’ argumentation
and reasoning in solving mathematical text problems through
the use of a computer tool: a video-based analysis of
dialogues. Research Report. University of Zurich Institute of
Education (http:!!www.didac.unizh.ch)
Perret-Clermont A-N, Perret J-F, Bell N 1991 The social
construction of meaning and cognitive activity in elementary
school children. In: Resnick L B, Levine L, Teasley S D (eds.)
Perspecti!es on Socially Shared Cognition. American Psycho-
logical Association, Washington, DC pp. 41–62
Reusser K 1993 Tutoring systems and pedagogical theory:
Representational tools for understanding, planning, and
reflection in problem-solving. In: Lajoie S P, Derry S (eds.)
Computers as Cogniti!e Tools. Erlbaum, Hillsdale, NJ pp.
143–78
Roschelle J, Teasley S D 1995 The construction of shared
knowledge in collaborative problem solving. In: O’Malley C E
(ed.) Computer-supported Collaborati!e Learning. Springer,
Berlin pp. 69–97
van Oers B 1996 Learning mathematics as meaningful activity.
In: Stee L P, Nesher P, Cobb P, Goldin G A, Greer B (eds.)
Theories of Mathematical Meaning. Erlbaum, Mahwah, NJ
pp. 91–113
Vygotsky L S 1962 Thought and Language. Harvard University
Press, Cambridge, MA
Webb N M, Palincsar A S 1996 Group processes in the
classroom. In: Berliner D C, Calfee R C (eds.) Handbook of
Educational Psychology. Macmillan, New York pp. 841–73
K. Reusser
Code switching: Linguistic
Code-switching (CS) refers to the mixing, by bilinguals
(or multilinguals), of two or more languages in
discourse, often with no change of interlocutor or
topic. Such mixing may take place at any level of
linguistic structure, but its occurrence within the
confines of a single sentence, constituent, or even word,
has attracted most linguistic attention. This article
surveys the linguistic treatment of such intrasentential
switching.
In combining languages intrasententially, various
problems of incompatibility may arise. The most
obvious derive from word order dierences: under
what conditions, if any, can the boundary between
constituents ordered dierently in two languages host
a switch? Other potential combinatorial diculties
involve mismatches in grammatical categories, sub-
categorization patterns, morphology, and idiomatic
expressions. Systematic examination of the sponta-
neous speech of bilinguals resident in a wide range of
communities suggests, however, that speakers gen-
erally manage to circumvent these diculties. CS
tends not to produce utterances that contain mono-
lingually ungrammatical sentence fragments. Discov-
ery of the mechanisms enabling such ‘grammatical’ CS
is the major goal of current research. Central questions
include locating permissible switch sites and ascertain-
ing the nature (hierarchical or linear, variable or
categorical) of the constraints on switching.
1. Background
Though CS is apparently a hallmark of bilingual
communities world-wide, it has only begun to attract
serious scholarly attention in the last few decades.
Researchers first dismissed intrasentential code-
switching as random and deviant (e.g., Weinreich
1953!1968) but are now unanimous in the conviction
that it is grammatically constrained. The basis for this
conviction is the empirical observation that bilinguals
tend to switch intrasententially at certain (morpho)
syntactic boundaries and not at others. Early eorts to
explain these preferences proceeded by proscribing
certain switch sites, for example, between pronominal
subjects and verbs or between conjunctions and their
conjuncts. However, these particular sites were soon
reported to figure among the regular CS patterns of
some bilingual communities.
The first more general account of the distribution of
CS stemmed from the observation that CS is favored
at the kinds of syntactic boundaries which occur in
both languages. The equi!alence constraint (Poplack
1980) states that switched sentences are made up of
concatenated fragments of alternating languages, each
of which is grammatical in the language of its
provenance (see also Muysken 2000). The boundary
between adjacent fragments occurs between two con-
stituents that are ordered in the same way in both
languages, ensuring the linear coherence of sentence
structure without omitting or duplicating lexical con-
tent.
That general principles, rather than atomistic con-
straints, govern CS is now widely accepted, though
there is little consensus as to what they are or how they
should be represented. Much current research assumes
unquestioningly that the mechanisms for language
switching follow directly from general principles of
(monolingual) grammar. Theories based on this as-
sumption tend to appeal to such abstract grammatical
properties as inter-constituent relationships (e.g., gov-
ernment, case assignment) and!or language-specific
features of lexical categories (i.e., subcatgorization of
grammatical arguments, inherent morphological fea-
tures).
Since Klavans’s (1985) proposal that CS was con-
strained by structural relations, the formal linguistic
theories successively in vogue have each been extended
to encompass the data of CS. Di Sciullo et al. (1986),
for example, identified the relevant relations as C-
2062
Co-constructi!ism in Educational Theory and Practice
Poplack, Shana. 2001. Code-switching (Linguistic). In Smelser, Niel & Baltes, Paul (eds.),
International Encyclopedia of the Social and Behavioral Sciences. Elsevier Science Ltd. 2062-2065.
command and government: CS cannot occur where a
government relation holds. Replacement of the func-
tion of government in standard theory by the notion of
feature agreement led to a parallel focus on feature
matching in CS studies. The Functional Head Con-
straint (Belazi et al. 1994) adds language choice to the
features instantiated in functional and lexical cate-
gories, prohibiting CS where a mismatch occurs. A
more recent Minimalist proposal (MacSwan 1999)
restricts CS at structural sites showing cross-language
dierences in monolingual features.
This distinction between lexical and functional
categories is not new to CS research. It is a hallmark of
theories invoking the complement structure of in-
dividual lexical items to characterize permissible CS
sites (e.g., Joshi 1985) and its sequel, the Null Theory of
CS (Santorini and Mahootian 1995; see also Bentahila
and Davies’s Subcategorisation Constraint 1983). Per-
haps the most detailed model involving the contrast
between lexical properties and functional (or ‘system’)
morphemes is the Matrix Language Frame model
(Azuma 1993, Myers-Scotton 1993). Here, structural
constraints on CS result from a complex interaction
between a dominant matrix language and the pro-
hibition against embedding ‘system’ morphemes from
the ‘embedded’ language in matrix language structure.
The assumption that bilingual syntax can be ex-
plained by general principles inferred from the study
of monolingual grammar has not yet been substan-
tiated. While formal theories of grammar may account
well for monolingual language structure, including
that of the monolingual fragments in CS discourse,
there is no evidence to suggest that the juxtaposition of
two languages can be explained in the same way.
Bilingual communities exhibit widely dierent pat-
terns of adapting monolingual resources in their
code-mixing strategies, and these are not predictable
through purely linguistic considerations. The equi-
valence constraint, as formalized by Sanko(1998), is
a production-based explanation of the facts of CS,
which incorporates the notions of structural hierarchy
and linear order, and accounts for a number of
empirical observations in addition to the equivalent
word order characterizing most actual switch sites.
These include the well-formedness of the monolingual
fragments, the conservation of constituent structure,
and the essential unpredictability of CS at any poten-
tial CS site. The mechanisms of monolingual and
bilingual grammars are not assumed a priori to be
identical.
2. E!aluating CS Theories
There has been remarkably little cross-fertilization
among CS theories; indeed, each has been greeted with
a host of counter-examples. Testing the fit of com-
peting models against the data of CS should be a
straightforward matter since they often make com-
peting predictions. But their disparate assumptions,
goals, and domains of application have hindered such
eorts. Assessment of the descriptive adequacy of a
theory of CS requires that at least two methodological
issues be resolved. One involves classification of other-
language phenomena, the other, confronting the pre-
dictions of the theory with the data of actual bilingual
behavior.
It is uncontroversial that CS diers from the other
major manifestation of language contact: lexical bor-
rowing. Despite etymological identity with the donor
language, established loanwords assume the morpho-
logical, syntactic, and often, phonological, identity of
the recipient language. They tend to be recurrent in the
speech of the individual and widespread across the
community. The stock of established loanwords is
available to monolingual speakers of the recipient
language, who access them normally along with the
remainder of the recipient-language lexicon. Loan-
words further dier from CS in that there is no
involvement of the morphology, syntax, or phonology
of the lexifier language.
Recent research has shown that borrowing is
actually much more productive than implied above
(see the papers in Poplack and Meechan 1998). In
particular, the social characteristics of recurrence and
diusion are not always satisfied. This results in what
has been called, after Weinreich (1953!1968), nonce
borrowing (Sankoet al. 1990). Like its established
counterpart, the nonce loan tends to involve lone
lexical items, generally major-class content words, and
to assume the morphological, syntactic, and often,
phonological identity of the recipient language. Like
CS, on the other hand, nonce borrowing is neither
recurrent nor widespread, and necessarily requires a
certain level of bilingual competence. Distinguishing
nonce borrowings from single-word CS is conceptually
easy but methodologically dicult, especially when
they surface bare, giving no apparent indication of
language membership.
The classification of lone items is at the heart of a
fundamental disagreement among CS researchers over
(a) whether the distinction between CS and borrowing
should be formally recognized in a theory of CS, (b)
whether these and other manifestations of language
contact can be unambiguously identified in bilingual
discourse, and (c) criteria for determining whether a
given item was switched or borrowed. Researchers
who consider lone other-language items to be CS tend
to posit an asymmetrical relationship, in which one
language dominates and other-language items are
inserted (e.g., Joshi 1985, Myers-Scotton 1993). Where
the class of CS is (in the first instance) limited to
unambiguous multiword fragments, both languages
are postulated to play a role (Belazi et al. 1994,
Sanko1998, Woolford 1983). Muysken (2000) ad-
mits the possibility of both strategies.
The appropriateness of data is also relevant to
evaluating CS theories. The literature on CS largely is
2063
Code switching: Linguistic
characterized by the ‘rule-and-exception’ paradigm.
Despite the onslaught of counter-examples provoked
by successive CS theories, very few have in fact been
tested systematically against the data of spontaneous
bilingual usage. Instead, both the theories and tests of
their applicability tend to be based on isolated ex-
amples, drawn from judgments, informant elicitation,
and linguist introspection. The relation between such
data and actual usage is not known; nor do they
permit us to distinguish between the recurrent and
systematic patterns of everyday interaction and exam-
ples which may be judged ‘acceptable’ in some sense,
but which rarely or never occur.
The equivalence constraint has been verified as a
general tendency in Spanish-English (Poplack 1980);
Finnish-English (Poplack et al. 1987), Arabic-French
(Naı
!t M’Barek and Sanko1988), Tamil-English
(Sankoet al. 1990), Fongbe-French and Wolof-
French (Meechan and Poplack 1995), Igbo-English
(Eze 1998), French-English (Turpin 1998) and Ukrain-
ian-English (Budzhak-Jones 1998) bilingual commu-
nities. But most of the voluminous literature on CS,
especially of the ‘insertional’ type, is based on data
which represents, properly speaking, lexical borrow-
ing. As only the grammar and word order of the
recipient language are pertinent to borrowing, attempts
to understand the structure of CS based on a mixture
of borrowing and true CS (e.g., Myers-Scotton 1993
and many others) appear unwieldy or descriptively
inadequate.
3. Identifying the Results of Language Contact
Insofar as CS and borrowing are based on some
principled combination of elements of the monolingu-
al (i.e., unmixed) vernaculars of the bilingual com-
munity, it is important to have as explicit an idea as
possible of the nature of these vernaculars before
concluding that a code-mixed element is behaving like
one or the other. The analysis of code-mixing as a
discourse mode requires access to the grammars of the
contact languages as they are spoken, and spoken
language is characterized by structural variability. In
confronting, rather than evading this variability,
Sankoet al. (1990) and Poplack and Meechan (1998)
developed a method to compare bilingual structures
with the unmixed source languages of the same
speakers. Making use of the framework of linguistic
variation theory (Labov 1969), the inherent variability
of such forms is used to determine their status. If the
rate and distribution of, for example, case-marking of
the contentious lone other-language items show quan-
titative parallels to those of their counterparts in the
(unmixed) recipient language, while at the same time
diering from relevant patterns in the donor language,
the lone other-language items are inferred to be
borrowed, since only the grammar of the recipient
language is operative. If they pattern with their
counterparts in the (unmixed) donor language, while
at the same time diering from the patterning in the
unmixed recipient language, the lone other-language
items must result from CS.
Quantitative analysis of language mixing phenom-
ena in typologically distinct language pairs shows that
lone other-language items, especially major-class con-
tent words, are by far the most important component
of mixed discourse. These lone items show the same
fine details of quantitative conditioning of phono-
logical, morphological, and syntactic variability as
dictionary-attested loanwords, both of which in turn
parallel their unmixed counterparts in the recipient
language (Poplack and Meechan 1998). This tendency
is apparent regardless of the linguistic properties of the
language pair. This is evidence that most lone items
are borrowed, even if only for the nonce, despite the
lack, in some cases, of dictionary attestation or
diusion within the community.
4. Future Directions
Lack of consensus characterizing the discipline is
related to a number of methodological problems.
Foremost among them is failure to distinguish code-
switching from other types of language mixture,
which, despite similarities in surface manifestation,
are fundamentally dierent mechanisms for combin-
ing languages. The current state of knowledge suggests
that borrowing, nonce or established, is the major
manifestation of language contact in most bilingual
communities. Its linguistic structure is well accounted
for in the traditional language contact literature
(Weinreich 1953!1968). Intrasentential CS involving
multiword fragments of two or more languages is also
attested in some communities. Achievement of con-
sensus on an empirically verifiable characterization of
the rules for juxtaposing these fragments within the
sentence remains an important goal for CS research.
Fit between theories and data could be improved by a
broader empirical base. This would permit researchers
to situate bilingual behavior with respect to the
monolingual vernaculars implicated in language mix-
ing, account for the disparate CS strategies that have
evolved in dierent bilingual communities, and dis-
tinguish among incommensurable manifestations of
bilingual language contact.
See also: Bilingual Education: International Perspec-
tives; Bilingualism and Multilingualism; Bilingualism:
Cognitive Aspects; First Language Acquisition: Cross-
linguistic; Second Language Acquisition; Sociolin-
guistics
Bibliography
Azuma S 1993 The frame-content hypothesis in speech pro-
duction: Evidence from intrasentential code switching. Ling-
uistics 31: 1071–93
2064
Code switching: Linguistic
Belazi H M, Rubin E J, Toribio A J 1994 Code-switching and
X-bar theory: The functional head constraint. Linguistic
Inquiry 25(2): 221–37
Bentahila A, Davies E E 1983 The syntax of Arabic-French
code-switching. Lingua 59(4): 301–30
Budzhak-Jones S 1998 Against word-internal code-switching:
Evidence from Ukrainian-English bilingualism. International
Journal of Bilingualism 2(2): 161–82
di Sciullo A-M, Muysken P, Singh R 1986 Government and
code-mixing. Journal of Linguistics 22(1): 1–24
Eze E 1998 Lending credence to a borrowing analysis: Lone
English-origin incorporations in Igbo discourse. International
Journal of Bilingualism 2(2): 183–201
Joshi A K 1985 Processing of sentences with intrasentential
code-switching. In: Dowty D R, Karttunen L, Zwicky A M
(eds.) Natural Language Parsing: Psychological, Computa-
tional and Theoretical Perspecti!es. Cambridge University
Press, Cambridge, UK, pp. 190–204
Klavans J L 1985 The syntax of code-switching: Spanish and
English. In: King L D, Maley C A (eds.) Selected Papers from
the XIIIth Linguistic Symposium on Romance Languages.
Benjamins, Amsterdam, pp. 213–21
Labov W 1969 Contraction, deletion, and inherent variability of
the English copula. Language 45(4): 715–62
MacSwan J 1999 A Minimalist Approach to Intrasentential Code
Switching. Garland Press, New York
Meechan M, Poplack S 1995 Orphan categories in bilingual
discourse: Adjectivization strategies in Wolof-French and
Fongbe-French bilingual discourse. Language Variation and
Change 7(2): 169–94
Muysken P C 2000 Bilingual Speech: A Typology of Code-
mixing. Cambridge University Press, Cambridge, MA
Myers-Scotton C 1993 Duelling Languages. Clarendon Press,
Oxford, UK
Naı
!t M’Barek M, SankoD 1988 Le discours mixte arabe!
franc"ais: des emprunts ou des alternances de langue? Re!ue
Canadienne de Linguistique 33(2): 143–54
Poplack S 1980 Sometimes I’ll start a sentence in Spanish Y
TERMINO EN ESPAN
#OL: Toward a typology of code-
switching. Linguistics 18(7!8): 581–618
Poplack S, Meechan M (eds.) 1998 Instant Loans, Easy Condi-
tions: The Producti!ity of Bilingual Borrowing; Special Issue of
the International Journal of Bilingualism.KingstonPress,
London
Poplack S, Wheeler S, Westwood A 1987 Distinguishing
language contact phenomena: Evidence from Finnish-English
bilingualism. In: Lilius P, Saari M (eds.) The Nordic Languages
and Modern Linguistics. University of Helsinki Press, Helsinki,
pp. 33–56
SankoD 1998 A formal production-based explanation of the
facts of code-switching. Bilingualism:Language and Cognition
1(1): 39–50
SankoD, Poplack S, Vanniarajan S 1990 The case of the nonce
loan in Tamil. Language Variation and Change 2(1): 71–101
Santorini B, Mahootian S 1995 Code-switching and the syntactic
status of adnominal adjectives. Lingua 96: 1–27
Turpin D 1998 ‘‘Le franc"ais c’est le last frontier’’: The status of
English-origin nouns in Acadian French. International Journal
of Bilingualism 2(2): 221–33
Weinreich U 1953!1968 Languages in contact. Mouton, The
Hague
Woolford E 1983 Bilingual code-switching and syntactic theory.
Linguistic Inquiry 14(3): 519–36
S. Poplack
Coeducation and Single-sex Schooling
Coeducation or single-sex schooling does not only
refer to an organizational form of separating or mixing
girls and boys, but also refers to such questions as:
should there be dierent goals, curricula, rights, and
outcomes for the two genders? Equality is not per-
ceived in all countries and not always an important
principle for men and women. For a long time,
education was oriented to prepare girls and boys for
their dierent spheres in adult life. Today, equality is
the goal, but it is not at all clear whether separation or
coeducation is the better way to reach this goal.
1. A Brief History of Coeducation
The history of educational systems shows, for most
countries, that especially for education beyond el-
ementary levels, single-sex schools have been the
preferred form, although coeducation was not un-
usual. Financial restrictions in mass education forced
coeducational schooling, but for ideological reasons,
separation was preferred by the authorities. While this
is mainly true for Europe, in the United States, soon
after establishing public schools, coeducation became
the norm. David Tyack and Elisabeth Hansot describe
the beginning as ‘smuggling in the girls’ which led to
the adoption of coeducation by gradually moving
from ‘why to why not’ (Tyack and Hansot 1992, p. 47).
Even in secondary education, only 12 cities out of 628
reported that they had single-sex high schools at the
end of the nineteenth century. The European sec-
ondary education was for boys only—especially in
Germany; girls received secondary education only on
a private basis and were not allowed to attend higher
education at all. Prussian universities enrolled women,
but not before 1908.
Democratic movements in all countries valued
equality of education and therefore pledged for co-
education. The debates emphasized the question of
dierence or similarity of women and men as human
beings. While the assumption of similarity historically
was associated with the preference for coeducation,
those who postulated dierences were divided in
advocates and opponents to coeducation. The advo-
cates thought coeducation would ensure that girls and
boys themselves would make sure they behaved like
girls and boys. The opponents feared that girls and
boys together would encourage the loss of their
engendered behavior.
In most European countries, the years between 1960
and 1980 brought big educational reforms, and co-
education was one of those (Wilson 1991, p. 203)—
although it was more of a side-eect of the reforms,
and astonishingly, after the heated discussions in the
first half of the twentieth century, a change nearly
without debate. Eorts to establish ‘schools for all’
2065
Coeducation and Single-sex Schooling
Copyright !2001 Elsevier Science Ltd. All rights reserved.
International Encyclopedia of the Social & Behavioral Sciences ISBN: 0-08-043076-7
... In a children case, Yow et al. (2018) showed that Code Switching strongly referred to the linguistic competent. This issue was supported by Poplack (2001) that the code mixing was a part of code switching. In the classroom, it happens as well especially when they learn about English. ...
Article
Social Language is the language which is usually used by the students in their daily activities. They could gather any ideas to speak up without having barriers. This study investigated on the use of social language as the bridge to build up a dialogue by using English in the classroom in order to have natural communications. Researchers formulated the research questions; how did the students switch the language from their Social Languages to English? This research used a descriptive qualitative by purposive sampling to get the data analyzed by using checklist instrument to classify them based on the theoretical framework. The result showed that the students switched their social language to English by implementing the basic patterns of sentence structures. This way changed the students’ performance in delivering a dialogue by using English naturally. So, it could be concluded that students’ social language could become a bridge to use English for communication. It’s hopefully beneficial for English Lecturers in teaching English in the classroom referring to English for Specific Purpose.
... Code-switching is a phenomenon in which a multilingual speaker alternates between two or more languages within an utterance (Poplack, 2001). It is generally recognized as an advanced ability, as it demonstrates a person can master multiple languages simultaneously. ...
Preprint
Full-text available
With the ubiquity of Large Language Models (LLMs), guardrails have become crucial to detect and defend against toxic content. However, with the increasing pervasiveness of LLMs in multilingual scenarios, their effectiveness in handling multilingual toxic inputs remains unclear. In this work, we introduce a comprehensive multilingual test suite, spanning seven datasets and over ten languages, to benchmark the performance of state-of-the-art guardrails. We also investigates the resilience of guardrails against recent jailbreaking techniques, and assess the impact of in-context safety policies and language resource availability on guardrails' performance. Our findings show that existing guardrails are still ineffective at handling multilingual toxicity and lack robustness against jailbreaking prompts. This work aims to identify the limitations of guardrails and to build a more reliable and trustworthy LLMs in multilingual scenarios.
... However, in bilingual and multilingual communities worldwide, speakers often alternate between two or more languages in a single utterance. This phenomenon is known as code-switching or codemixing (Poplack, 2001). Several works have explored code-mixed text generation (Gupta et al., 2020a;Gautam et al., 2021;Mondal et al., 2022; * Equal contribution. ...
Preprint
Full-text available
Multilingual Large Language Models (LLMs) have demonstrated exceptional performance in Machine Translation (MT) tasks. However, their MT abilities in the context of code-switching (the practice of mixing two or more languages in an utterance) remain under-explored. In this paper, we introduce Rule-Based Prompting, a novel prompting technique to generate code-mixed sentences. We measure and compare the code-mixed MT abilities of 3 popular multilingual LLMs: GPT-3.5-turbo, GPT-4, and Gemini Pro across five language pairs: English-{Hindi, Bengali, Gujarati, French, Spanish} using k-shot prompting (k{0,1,10,20}k\in\{0, 1, 10, 20\}) and Rule-Based Prompting. Our findings suggest that though k-shot prompting often leads to the best results, Rule-Based prompting shows promise in generating unique code-mixed sentences that vary in their style of code-mixing. We also use k-shot prompting to gauge the code-mixed to English translation abilities of multilingual LLMs. For this purpose, we create a gold-standard code-mixed dataset spanning five language pairs: English-{Hindi, Bengali, Gujarati, French, Spanish}. As a real-world application of our work, we create a code-mixed chatbot.
... Alternations refer to the use of two languages in one clause, utterance, or between clauses of the same conversation. In this sense, code-switching is the alternation of two or more codes in a communicative act (conversation) without a change of speaker or subject matter; that is to say, two or more languages are used in the same conversation in different turns (inter-switch or alternation; Merritt, 2020;Poplack, 2001). On the other hand, code-mixing refers to the mixture of words from two or more languages without altering the grammar of the first language in the same conversation turn (intra-switch or insertions), denoting the insertion of a lexical item of one language into the structure of another (Mabule, 2015). ...
Chapter
The research presented in this volume covers first language acquisition, second language acquisition, language heritage and language impairment. Papers in this collection use a variety of experimental methods, such as eye-tracking, elicitation tasks, production tasks administered off-line and untimed, transcriptions of spontaneous speech, production elicitation, Truth Value Judgement tasks, standardized tests and multiple choice tasks. The studies included in this book try to cover most of the methods used in first and second language acquisition in typical and atypical populations. This book will be useful for linguists, speech therapists, and psycholinguists working on first, second and impaired language acquisition.
... The phenomenon of code-switching is a widely discussed topic among sociolinguists, psycholinguists, and linguists who study language contact. According to scholars many scholars, code-switching presents various challenges in different contexts (Auer, 1995;Gumperz, 1976Gumperz, , 1982Grosjean 1982Grosjean , 1992Grosjean , 1998Grosjean , 2008Myers-Scotton 1993, 2002, 2006Muysken, 1995Muysken, , 2000Poplack, 1980Poplack, , 2015and Gardner-Chloros, 2009). Myers-Scotton (1993) defines code-switching as "the selection by bilinguals or multilinguals of forms from an embedded variety in utterances of a matrix variety during the same conversation" (p. ...
Article
Full-text available
The focus of the current study is on investigating code switching and interference in the oral communication of Kazakh-Russian bilinguals. Particular emphasis is given to exploring the relationships between various types of code switching as interference and their impact on speech. The study collects data from interviews with popular young men and employs componential analysis as the primary research methodology. The paper examines how interference affects both pragmatic and structural changes in speech, and it is argued that social factors are linked to the negative effects of interference and the specifics of code switching in Kazakh-Russian bilinguals' communication. The study reveals that the native Kazakh language underwent morphological and structural alterations as a result of the deliberate selection of code-switching types, and the influence of Russian as a guest language has played a significant role in these changes. Overall, this study sheds light on the complexities of language interactions and the potential impact of bilingualism on linguistic diversity.
... Models' Challenges in Code-Mixed Text Processing: Code-mixing -the blending of two or more languages within a single utterance of a conversation (Poplack, 2001), is common in multilin-gual communities, including Kenya (Orao, 2012). The phenomenon of code-mixing presents unique challenges in the field of NLP. ...
Preprint
Full-text available
The deployment of Large Language Models (LLMs) in real-world applications presents both opportunities and challenges, particularly in multilingual and code-mixed communication settings. This research evaluates the performance of seven leading LLMs in sentiment analysis on a dataset derived from multilingual and code-mixed WhatsApp chats, including Swahili, English and Sheng. Our evaluation includes both quantitative analysis using metrics like F1 score and qualitative assessment of LLMs' explanations for their predictions. We find that, while Mistral-7b and Mixtral-8x7b achieved high F1 scores, they and other LLMs such as GPT-3.5-Turbo, Llama-2-70b, and Gemma-7b struggled with understanding linguistic and contextual nuances, as well as lack of transparency in their decision-making process as observed from their explanations. In contrast, GPT-4 and GPT-4-Turbo excelled in grasping diverse linguistic inputs and managing various contextual information, demonstrating high consistency with human alignment and transparency in their decision-making process. The LLMs however, encountered difficulties in incorporating cultural nuance especially in non-English settings with GPT-4s doing so inconsistently. The findings emphasize the necessity of continuous improvement of LLMs to effectively tackle the challenges of culturally nuanced, low-resource real-world settings.
Article
Full-text available
In recent years, the study of codeswitching has made great strides by leveraging a multidisciplinary approach that integrates insights from experimental psycholinguistics, corpus linguistics, cognitive psychology, neurolinguistics, and other allied fields. We provide an overview of 2 main strands of this research, focusing on the control process model of codeswitching (e.g., Green, 2018), the variable equivalence hypothesis (e.g., Torres Cacoullos, 2020), and the ways in which these accounts of codeswitching behavior intersect and complement one another. We argue that the common insight of these 2 distinct approaches is their focus on conventionalized linguistic norms at the level of the speech community. The second portion of the article sheds light on how various aspects of codeswitching experience give rise to adaptive change through bilingual phenotyping, highlighting the centrality of our mentor Judy Kroll's contributions to this research. Finally, we present a social network analysis of Judy's research publications and argue that the high degree of interconnectivity in Judy's research network, combined with the many positive community norms that she has been instrumental in establishing, have greatly benefited both the individual members of the network and the research enterprise itself.
Article
The article deals with cases of switching codes in the speech of the inhabitants of Lithuania, as well as the border Lithuanian-Belarusian territories, who speak Polish, Lithuanian and Belarusian (Russian) languages. The analyzed texts were written down by Rita Balkute during ethnographic expeditions, which were not aimed at revealing the actual linguistic features of the speech of the informants. Analysis of the material shows that ethnolinguistic factors play an important role in code switching; for example, informants often switch to Polish when describing magical practices or when quoting spells. The article discusses the reasons for such transitions, as well as cases of mixing codes when hybrid texts are formed in Polish and Belarusian.
Article
Full-text available
One characteristic of multilingual speakers is that in everyday life, they may integrate elements from their languages in the same sentence or discourse, a practice known as code-switching. This paper examines code-switching at the interfaces, in particular as related to information structure. Despite the fact that a core question of modern linguistic theory is how syntactic and information-structural theories interact in accounting for licensing of different grammatical phenomena, there has been relatively little literature on code-switching and information structure. In this paper, we provide an overview of the available literature on code-switching across different language combinations, focusing in particular on subject pronoun–verb switches, ellipsis, light verbs, topic/focus particles, and code-switching between sign languages. We argue that the study of the interplay between information structure and code-switching sheds light on our understanding of multilingual grammars and language competence more generally. In this regard, we discuss theoretical and methodological considerations to guide future studies.
Article
Full-text available
Nonce borrowings in the speech of bilinguals differ from established loanwords in that they are not necessarily recurrent, widespread, or recognized by host language monolinguals. With established loanwords, however, they share the characteristics of morphological and syntactic integration into the host language and consist of single content words or compounds. Furthermore, both types of loanwords differ from intrasentential code-switching — alternate sentence fragments in the two languages, each of which is grammatical by monolingual standards from the standpoints of appropriate function words, morphology, and syntax. In a large corpus of Tamil—English bilingual speech, many words of English origin are found in objects governed by Tamil verbs and vice versa. The equivalence constraint on intrasentential code-switching predicts that no code-switch should occur between verb and object in an SOV/SVO bilingual situation, and hence that objects whose language differs from that of the verb must be borrowed, if only for the nonce. To verify this prediction, we compare quantitatively the distribution across various syntactic contexts of both native Tamil and English—origin complements of Tamil verbs, and find them to be parallel. But the strongest evidence in favor of the nonce borrowing hypothesis comes from an analysis of variable accusative and dative case marking in these complements, in which the English-origin material is shown, morphologically and syntactically, to be virtually indistinguishable from Tamil (nonpronominal) nouns. In addition, we present supporting evidence from the genitive, locative, and other cases and from nonce borrowings from Tamil into these speakers' English.
Article
Nous présentons les résultats d’une étude empirique du discours mixte arabe marocain/français dont le but est de situer ce discours par rapport aux théories de l’emprunt et de l’alternance. Nous nous basons en grande partie sur l’analyse quantitative de la distribution des contextes syntaxiques où se juxtaposent des éléments des deux langues.
Article
The abstract for this document is available on CSA Illumina.To view the Abstract, click the Abstract button above the document title.
Article
The categorization of lone lexical items from one language embedded in another is often difficult due to their ambiguous status as either loanwords or codeswitches. Following variationist principles, we use a comparative method to disambiguate lone English-origin nouns in otherwise Acadian French discourse. Their exact status is important because it has been argued that the constraints on different manifestations of language contact differ. Incorporating the notion of structural conflict sites as diagnostics, we examine these items at the morphological (e.g., number marking), syntactic (e.g., adjective placement) and discourse (e.g., flagging) levels for evidence of system membership. Our method reveals that although most of these items can be unambiguously classified as borrowings, a small subset must be considered to be codeswitches.
Article
This paper offers an empirical analysis of data from natural Igbo-English bilingual discourse which demonstrates how the two most important manifestations of language contact, namely, codeswitching and borrowing, can be unambiguously and consistently distinguished. The paper focuses on showing how inherently ambiguous lone English-origin nouns and verbs, incorporated into otherwise Igbo discourse, can be assigned language membership, not solely on the basis of their surface appearance as has been the case in the literature, but only by situating the ambiguous forms in the context of the entire system. Using diagnostics such as vowel harmony, inflection, and word order of verbs as well as modification structures of nouns, the paper demonstrates beyond a shadow of doubt that lone English-origin items are distributed among syntactic slots in the same way as native items of Igbo and differ from codeswitches to English which show distinct patterns. The inescapable conclusion is that the lone English-origin nouns and verbs in otherwise Igbo discourse cannot be classified as codeswitches but rather, must be considered to be borrowings.
Article
In this paper I develop diagnostics for distinguishing word-internal codeswitching from borrowing, based on Ukrainian-English bilingual discourse; a typologically different language pair. Assuming that a codeswitched noun will retain its original characteristics, and a borrowed noun will acquire language-specific properties of the recipient language, I focus on conflict sites in the morphosyntactic structure of Ukrainian (a fusional language) and English (an analytical one). Using quantitative variationist methodologies, I show that English-origin nouns with overt Ukrainian morphology occurring in otherwise Ukrainian discourse do not behave as if they were English. Instead, they replicate the patterns of behavior of monolingual Ukrainian nouns with respect to gender assignment, modifier-noun agreement, inflectional variability and flagging. I found no evidence for lone item codeswitching within a word boundary, and conclude that all English-origin utterances occurring with overt morphology of the recipient language are borrowed.
Article
La production du discours implique la construction d'un message, un codage grammatical, et une articulation. L'A. se penche plus specifiquement sur le processus de codage grammatical, et affirme qu'il existe deux phases de traitement de ce codage