Jennifer Frey

Jennifer Frey
Eurac Research · Institute for Applied Linguistics

Doctor of Philosophy

About

29
Publications
3,759
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
57
Citations
Introduction
Jennifer Frey is a Post-Doc researcher at the Institute for Applied Linguistics at Eurac Research Bolzano. Her interdisciplinary work deals with computer-mediated communication, online multilingualsim and language learning and teaching using a broad range of empirical methods for linguistic analysis. While her recent work is located in the field of corpus linguistics, her skillset includes natural language processing, data science and machine learning methods.
Additional affiliations
October 2019 - April 2020
Eurac Research
Position
  • Researcher
Description
  • Post-Doc researcher for various projects in the area of learner corpus research and computer-mediated communication.
October 2016 - September 2019
University of Bologna
Position
  • PhD Student
Description
  • PhD Scholarship, Dissertation: "Using data mining to repurpose German language corpora. An evaluation of data-driven analysis methods for corpus linguistics"
August 2015 - September 2016
University of Bologna
Position
  • Fellow
Description
  • Assegno di ricerca, Project "Musy: Multilingualism in South Tyrolean Social Media Profiles"
Education
October 2007 - June 2013
Graz University of Technology
Field of study
  • Informatik und Informatikmanagement Lehramt
October 2007 - June 2013
Karl-Franzens-Universität Graz
Field of study
  • Deutsch Lehramt

Publications

Publications (29)
Article
This article presents the longitudinal trilingual corpus of young learners of Italian, German and English called LEONIDE. The corpus consists of L1, L2 and L3 learner texts. L1 texts were written in two languages of schooling (i.e. Italian and German), L2 texts in two languages learned as second languages (i.e. German and Italian), and L3 texts in...
Article
Full-text available
Up until today research in various educational and linguistic domains such as learner corpus research, writing research, or second language acquisition has produced a substantial amount of research data in the form of L1 and L2 learner corpora. However, the multitude of individual solutions combined with domain-inherent obstacles in data sharing ha...
Conference Paper
Full-text available
Linguistic complexity research being a very actively developing field, an increasing number of text analysis tools are created that use natural language processing techniques for the automatic extraction of quantifiable measures of linguistic complexity. While most tools are designed to analyse only one language, the CTAP open source linguistic com...
Chapter
In this article, we examine the current situation of data dissemination and provision for CMC corpora. By that we aim to give a guiding grid for future projects that will improve the transparency and replicability of research results as well as the reusability of the created resources. Based on the FAIR guiding principles for research data manageme...
Article
Full-text available
With language resources being collected in many - also small - projects in learner corpus research with considerate amounts of time and ef- fort spent in this activity, making these types of data available in a FAIR way, with standardized and reasoned methods, would contribute substan- tially to the advancement of the field. Additionally, it would...
Conference Paper
In recent years, research data management has also become an important topic in the less data-intensive areas of the Social Sciences and Humanities (SSH). Funding agencies as well as research communities demand that empirical data collected and used for scientific research is managed and preserved in a way that research results are reproducible. In...
Chapter
Multilingual speakers communicate in more than one language in daily life and on social media. In order to process or investigate multilingual communication, there is a need for language identification. This study compares the performance of human annotators with automatic ways of language identification on a multilingual (mainly German-Italian-Eng...
Conference Paper
Full-text available
Digital Natives, i.e. people who grew up in a digital world, are said to be different to their counterparts, digital immigrants, regarding their communication habits and use of digital services. In this paper, we investigate the linguistic behavior of digital natives compared to digital immigrants in a sociolinguistically annotated corpus of person...
Chapter
For the DiDi project, we studied linguistic behaviour among Facebook users in the multilingual province of South Tyrol by looking at the Facebook texts written by 136 participants from the region. The corpus of texts included status updates, comments and chat messages by Facebook users who were between 14 and 76 years of age and whose L1 was German...
Conference Paper
Full-text available
The DiDi corpus of South Tyrolean data of computer-mediated communication (CMC) is a multilingual sociolinguistic language corpus. It consists of around 600,000 tokens collected from 136 profiles of Facebook users residing in South Tyrol, Italy. In conformity with the multilingual situation of the territory, the main languages of the corpus are Ger...
Poster
Full-text available
Poster presented at the 3rd Italian Conference on Computational Linguistics, 5-6 December 2016, Università degli Studi di Napoli, Federico II
Poster
Poster presented at the UCREL Summer School in corpus-based NLP
Presentation
Full-text available
Talk at ”Multilingualism in the Digital Age”, 19. Juni 2015, University of Reading
Presentation
Full-text available
Talk at ”41. Österreichische Linguistiktagung” (ÖLT), 6-8 December 2014, University of Vienna
Conference Paper
Full-text available
In this paper, we propose an integrated web strategy for mixed sociolinguistic research methodologies in the context of social media corpora. After stating the particular challenges for building corpora of private, non-public computer-mediated communication, we will present our solution to these problems: a Facebook web application for the acquisit...
Presentation
Full-text available
Talk at the pre-conference workshop “NLP 4 CMC: Natural Language Processing for Computer-Mediated Communication / Social Media” at the 12th edition of KONVENS, Hildesheim, Germany
Presentation
Full-text available
Talk at the 7th workshop of the DFG scientific network Empirikom ”Social Media Corpora for the eHumanities: Standards, Challenges, and Perspectives”, TU Dortmund University
Book
Full-text available
L3T ist die Kurzform für das preisgekrönte und frei zugängliche “Lehrbuch für Lernen und Lehren mit Technologien” (http://l3t.eu). Im Sommer 2013 wurde das Buch im Rahmen des Projekts “L3T 2.0″ gänzlich überarbeitet und erweitert. Dabei wurde ein neuartiger Weg eingeschlagen: Innerhalb von sieben Tagen, vom 20. bis 28. August 2013 wurde das Buch in...
Chapter
Full-text available
Soziale Medien haben längst die Grenze zur Allgegenwärtigkeit erreicht und sind heutzutage nicht nur in unserem Privatleben alltäglicher Beglei- ter, sondern werden auch von Unternehmen, Organisationen und anderen nicht reellen Personen für die unterschiedlichsten Zwecke eingesetzt. Zunehmend versuchen auch Bildungs- und Ausbildungseinrichtungen di...
Chapter
Full-text available
L3T ist überarbeitet und steht nun in der zweiten Auflage zur Verfügung. Wie es dazu kam und wie es im Rahmen des Projekts „L3T 2.0“ umgesetzt wurde, ist hier festgehalten. Da sich die Entstehung dieses Lehrbuchs deutlich von anderen Buchprojekten unterscheidet, wollen wir auch Einblick in den Entstehungsprozess geben. Ebenso, quasi als Vorwort in...
Chapter
Full-text available
In diesem Kapitel wird ein exemplarischer Überblick über den Einsatz von Technologien beim Fremdsprachenlernen und -lehren gegeben. Ausgehend vom „Gemeinsamen europäischen Referenzrahmen” wird der Technologieeinsatz bei der Entwicklung und Einübung der dort erwähnten Fertigkeiten erläutert. Die weiteren Abschnitte behandeln Sprachenportfolios, Fort...
Book
Social media is increasingly seen as a way to include principals like participation, global interaction and transparency in everyday life. These omnipresent changes in our social reality are reaching every sector of life, encouraging even public institutions to participate and to show presence in the social web. This study seeks to reveal which fa...

Network

Cited By

Projects

Projects (5)
Project
Es ist unübersehbar, dass die Schreibprodukte in sozialen Netzwerken nicht immer den normativen Erwartungen entsprechen, die man an redigierte Texte in der Presse, der Wissenschaft oder der Belletristik heranträgt. Es stellt sich deshalb die Frage, wie sich dies langfristig auf unsere Schriftsprache im Allgemeinen und auf die Qualität von Texten der „digital natives“ im Besonderen auswirken wird. Unser Projekt möchte die Möglichkeiten verbessern, diese Frage empirisch zu untersuchen, und damit die bislang auf schmaler Datengrundlage geführte Debatte über die Auswirkungen digitaler Medien auf Schreibkompetenzen versachlichen. Hierfür wird ein Modell entwickelt, das neue Qualitätsaspekte, die für das vernetzte, multimodale und dialogische Kommunizieren in sozialen Medien relevant sind, in Traditionen des Nachdenkens über Sprachqualität in Rhetorik und Textlinguistik einordnet. Weiterhin werden Messmethoden für die Qualitätsdimension „Kohärenz“ entwickelt und an kleinen Fallstudien evaluiert, um erste Erkentnisse zu der bislang empirisch noch nicht untersuchten Frage zu gewinnen, ob und wie sich das häufige Lesen und Schreiben kurzer, in Dialoge eingebundener Botschaften auf die Kompetenz zum Planen und Verfassen monologischer, längerer Schrifttexte auswirkt. Das Projekt nutzt digitale Sammlungen (Korpora) von Lernertexten und Webgenres, die mit korpuslinguistischen Methoden aufbereitet und ausgewertet werden, und experimentiert auch mit neuen Methoden (z. B. Crowd Sourcing und Data Mining). https://mitqualitaet.com/