Linda Wiechetek

Linda Wiechetek
UiT The Arctic University of Norway · Department of Language and Linguistics

Phd

About

15
Publications
456
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
52
Citations
Introduction

Publications

Publications (15)
Article
Machine learning is the dominating paradigm in natural language processing nowadays. It requires vast amounts of manually annotated or synthetically generated text data. In the GiellaLT infrastructure, on the other hand, we have worked with rule-based methods, where the linguistis have full control over the development the tools. In this article we...
Conference Paper
Full-text available
We investigate both rule-based and machine learning methods for the task of compound error correction and evaluate their efficiency for North Sámi, a low resource language. The lack of error-free data needed for a neural approach is a challenge to the development of these tools, which is not shared by bigger languages. In order to compensate for th...
Conference Paper
Full-text available
We present a method for conducting morphological disambiguation for South Sámi, which is an endangered language. Our method uses an FST-based morphological analyzer to produce an ambiguous set of morphological readings for each word in a sentence. These readings are disambiguated with a Bi-RNN model trained on the related North Sámi UD Treebank and...
Preprint
Full-text available
We present a method for conducting morphological disambiguation for South Sámi, which is an endangered language. Our method uses an FST-based morphological analyzer to produce an ambiguous set of morphological readings for each word in a sentence. These readings are disambiguated with a Bi-RNN model trained on the related North Sámi UD Treebank and...
Article
Communities of lesser resourced languages like North Sámi benefit from language tools such as spell checkers and grammar checkers to improve literacy. Accurate error feedback is dependent on well-tokenised input, but traditional tokenisation as shallow preprocessing is inadequate to solve the challenges of real-world language usage. We present an a...
Conference Paper
Full-text available
This paper presents a set of rules which form the prototype lexical selection component of a rule-based machine translation system between two closely-related minority languages, NorthSámi and LuleSámi. While the languages have comprehensive monolingual computational linguistic resources, they lack bilingual resources. One-to-one relations in the l...
Conference Paper
Full-text available
Grammatical approaches to language technology are often considered less optimal than statistical approaches in multilingual settings, where large-scale portability becomes an important issue. The present paper argues that there is a notable gain in reusing grammatical resources when porting technology to new languages. The pivot language is North S...
Article
Full-text available
This paper describes the development of two prototype systems for machine trans- lation between North Sámi and Lule Sámi. Experiments were conducted in rule-based machine translation (RBMT), using the Apertium platform, and statistical ma- chine translation (SMT) using the Moses- decoder. The experiments show that both approaches have their advanta...
Article
Rule-based MT systems are used when corpora for lesser used languages are not large enough to create statistic systems, but also on the background that linguistic structures may be captured better by linguistic rules than by frequencies. The paper will discuss two different rule-based systems that make use of statistics as well – Apertium and GramT...
Article
Rule-based MT systems are used when corpora for lesser used languages are not large enough to create statistic systems, but also on the background that linguistic structures may be captured better by linguistic rules than by frequencies. The paper will discuss two different rule-based systems that make use of statistics as well – Apertium and GramT...

Network

Cited By

Projects

Project (1)
Project
Develop a grammar checker for North Sámi using only open-source tools, and building a rule-based and language-independent framework to enable easy grammar checker development for other languages.