Aivars Glaznieks

Aivars Glaznieks
Eurac Research · Institute for Applied Linguistics

Doctor of Philosophy

About

44
Publications
5,259
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
104
Citations
Citations since 2017
18 Research Items
78 Citations
2017201820192020202120222023051015
2017201820192020202120222023051015
2017201820192020202120222023051015
2017201820192020202120222023051015
Introduction
Aivars Glaznieks currently works at the Institute for Applied Linguistics at Eurac Research. Aivars does research in Sociolinguistics. His current project is 'Wie misst man Textqualität im digitalen Zeitalter? MIT.QUALITÄT'

Publications

Publications (44)
Article
Full-text available
The current study presents an analysis of a linguistic device supporting interactional coherence in computer-mediated communication (CMC). The analysis focused on the use of the German causative conjunction weil and, more specifically, on the syntactic variation of verb-final versus verb-second word order in the weil -clause. Verb-final word order...
Article
Full-text available
This article presents the longitudinal trilingual corpus of young learners of Italian, German and English called LEONIDE. The corpus consists of L1, L2 and L3 learner texts. L1 texts were written in two languages of schooling (i.e. Italian and German), L2 texts in two languages learned as second languages (i.e. German and Italian), and L3 texts in...
Chapter
Wie wirkt sich das Schreiben kürzerer Texte in interaktionsorienterter Online-Kommunikation langfristig auf das Schreiben und die Qualität monologischer Texte aus? Auf diese Frage geht der Beitrag ein und präsentiert dazu empirische Daten aus einer Korpus-Vergleichsstudie, in der die Verwendung ausgewählter Konnektoren in einem Facebook-Korpus quan...
Article
Full-text available
This case study deals with the reflexes of unaccentuated, word-final Standard German-er in multisyllabic words, attested in German Facebook texts written by South Tyrolean Facebook users with German as (one of) their first language(s). After preliminary considerations for the analysis of written language and a presentation of the corpus, linguistic...
Book
Since its beginnings in the late 1980s, the field of Learner Corpus Research (LCR) has been continuously evolving and thereby widening its scope. LCR is rapidly closing gaps within its original scope by expanding on the languages, language mediums (spoken vs written) and learner types covered, and it is turning its attention to ever-emerging new re...
Chapter
Full-text available
Widening the scope was the theme of the 4th Learner Corpus Research (LCR) Conference in Bolzano/Bozen, Italy in October 2017. The aim in setting this theme was to highlight and foster work on growing research topics in the LCR community that deserve more intensive study and attention. These include spoken learner data, less commonly taught language...
Chapter
Full-text available
This paper focuses on non-standard spelling patterns in Facebook messages. We analyse German Facebook messages from around 100 South Tyrolean Facebook users collected for the DiDi Corpus (www.eurac.edu/didi) with respect to the variants used for unstressed word-final <-er> (pronounced as [ɐ]) in polysyllabic words in Standard German. The DiDi Corpu...
Chapter
The contribution presents an analysis of around 600 essays written by secondary school students from three German-speaking areas: South Tyrol (Italy), Tyrol (Austria) and Thuringia (Germany). The texts were analysed holistically for textual features. The results indicate some differences between the three areas that are discussed from a sociolingui...
Conference Paper
Full-text available
Digital Natives, i.e. people who grew up in a digital world, are said to be different to their counterparts, digital immigrants, regarding their communication habits and use of digital services. In this paper, we investigate the linguistic behavior of digital natives compared to digital immigrants in a sociolinguistically annotated corpus of person...
Chapter
For the DiDi project, we studied linguistic behaviour among Facebook users in the multilingual province of South Tyrol by looking at the Facebook texts written by 136 participants from the region. The corpus of texts included status updates, comments and chat messages by Facebook users who were between 14 and 76 years of age and whose L1 was German...
Conference Paper
Full-text available
The DiDi corpus of South Tyrolean data of computer-mediated communication (CMC) is a multilingual sociolinguistic language corpus. It consists of around 600,000 tokens collected from 136 profiles of Facebook users residing in South Tyrol, Italy. In conformity with the multilingual situation of the territory, the main languages of the corpus are Ger...
Poster
Full-text available
Poster presented at the 3rd Italian Conference on Computational Linguistics, 5-6 December 2016, Università degli Studi di Napoli, Federico II
Conference Paper
Full-text available
English. This paper describes an extended version of the KoKo corpus (version KoKo4, Dec 2015), a corpus of written German L1 learner texts from three different German-speaking regions in three different countries. The KoKo corpus is richly annotated with learner language features on different linguistic levels such as errors or other linguistic ch...
Article
Full-text available
At the beginning of the corpus building process is the selection of appropriate software tools and data formats for the acquisition and annotation of original linguistic data. This initial phase is characterised by challenging decisions, for the software needs to be flexible (to facilitate intuitive and speedy transcription), powerful (to meet anno...
Presentation
Full-text available
Talk at ”Multilingualism in the Digital Age”, 19. Juni 2015, University of Reading
Chapter
Full-text available
The paper introduces a corpus-linguistic method to search for linguistic particularities in a systematic way. Linguistic analyses focus on variation in the use of a certain idiom in an L1-learner corpus. Results reveal that aspects of learner language, diatopic variation, and language change must be considered to explain the variation adequately.
Article
Full-text available
Decisions at the outset of preparing a learner corpus are of crucial importance for how the corpus can be built and how it can be analysed later on. This paper presents a generic workflow to build learner corpora while taking into account the needs of the users. The workflow results from an extensive collaboration between linguists that annotate an...
Conference Paper
Full-text available
In this paper, we propose an integrated web strategy for mixed sociolinguistic research methodologies in the context of social media corpora. After stating the particular challenges for building corpora of private, non-public computer-mediated communication, we will present our solution to these problems: a Facebook web application for the acquisit...
Presentation
Full-text available
Talk at the pre-conference workshop “NLP 4 CMC: Natural Language Processing for Computer-Mediated Communication / Social Media” at the 12th edition of KONVENS, Hildesheim, Germany
Conference Paper
Full-text available
We introduce the KoKo corpus, a collection of German L1 learner texts annotated with learner errors, along with the methods and tools used in its construction and evaluation. The corpus contains both texts and corresponding survey information from 1,319 pupils and amounts to around 716,000 tokens. The evaluation of the quality of the performed tran...
Presentation
Full-text available
Talk at the 7th workshop of the DFG scientific network Empirikom ”Social Media Corpora for the eHumanities: Standards, Challenges, and Perspectives”, TU Dortmund University
Article
Full-text available
Special Issue: Building and annotating corpora of computer-mediated discourse. Issues and Challenges at the Inteface of Corpus and Computational Linguistics
Data
Talk at the Workshop from the "Arbeitsgruppe: Korpusbasierte Linguistik" at the 40. Österreichische Linguistiktagung, Nov 22-24, 2013, Universität Salzburg, Salzburg, Austria. Der Vortrag stellt den iterativen Workflow zur Erstellung eines lemmatisierten, POS-getaggten und nach ausgewählten sprachlichen Merkmalen annotierten Lernerkorpus vor und g...
Data
Talk at the Workshop on "Verabeitung und Annotation von Sprachdaten aus Genres internetbasierter Kommunikation" at the International Conference of the German Society for Computational Linguistics and Language Technology (GSCL 2013), Sep 23, 2013, TU Darmstadt. Die automatische Verarbeitung von IBK-Daten stellt herkömmliche Verfahren im Bereich der...
Article
Talk at the DGfS 2013 Workshop on Modellierung nicht-standardisierter Schriftlichkeit, the 35th Annual Conference of the German Linguistic Society (DGfS 2013)
Article
Talk at the international workshop "Building Corpora of Computer-Mediated Communication: Issues, Challenges, and Perspectives", Department of German Language and Literature, Faculty of Culture Studies, TU Dortmund University

Network

Cited By

Projects

Projects (5)
Project
Es ist unübersehbar, dass die Schreibprodukte in sozialen Netzwerken nicht immer den normativen Erwartungen entsprechen, die man an redigierte Texte in der Presse, der Wissenschaft oder der Belletristik heranträgt. Es stellt sich deshalb die Frage, wie sich dies langfristig auf unsere Schriftsprache im Allgemeinen und auf die Qualität von Texten der „digital natives“ im Besonderen auswirken wird. Unser Projekt möchte die Möglichkeiten verbessern, diese Frage empirisch zu untersuchen, und damit die bislang auf schmaler Datengrundlage geführte Debatte über die Auswirkungen digitaler Medien auf Schreibkompetenzen versachlichen. Hierfür wird ein Modell entwickelt, das neue Qualitätsaspekte, die für das vernetzte, multimodale und dialogische Kommunizieren in sozialen Medien relevant sind, in Traditionen des Nachdenkens über Sprachqualität in Rhetorik und Textlinguistik einordnet. Weiterhin werden Messmethoden für die Qualitätsdimension „Kohärenz“ entwickelt und an kleinen Fallstudien evaluiert, um erste Erkentnisse zu der bislang empirisch noch nicht untersuchten Frage zu gewinnen, ob und wie sich das häufige Lesen und Schreiben kurzer, in Dialoge eingebundener Botschaften auf die Kompetenz zum Planen und Verfassen monologischer, längerer Schrifttexte auswirkt. Das Projekt nutzt digitale Sammlungen (Korpora) von Lernertexten und Webgenres, die mit korpuslinguistischen Methoden aufbereitet und ausgewertet werden, und experimentiert auch mit neuen Methoden (z. B. Crowd Sourcing und Data Mining). https://mitqualitaet.com/
Project
"European Literacy Network is funded by COST Action IS1401 - Strengthening Europeans’ Capabilities by Establishing the European Literacy Network (ELN). The ELN is coordinating a global European response to the challenges of the on-going digital revolution, generate research collaborations and serve as a forum through which interdisciplinary knowledge on literacy in a digital world can be integrated, augmented, and acted upon in ways that will raise European literacy research leadership, European educational standards, and will increase global public awareness on the pivotal role played by literacy in fostering human capabilities." https://www.is1401eln.eu/en/gca/index.php?id=5#actionabstract
Project
More at: http://www.eurac.edu/en/research/projects/Pages/projectdetail4195.aspx