Sjur N. Moshagen

Sjur N. Moshagen
UiT The Arctic University of Norway · Department of Language and Linguistics

About

12
Publications
1,177
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
48
Citations
Introduction
Sjur N. Moshagen currently works at the Department of Language and Linguistics, UiT The Arctic University of Norway. Sjur does research in Proofing Tools, Morphology and Computational Linguistics. Their most recent publication is 'A Morphological Parser for Odawa'.

Publications

Publications (12)
Chapter
This is the Festschrift of Dr. Jack Rueter. The book presents peer-reviewed scientific work from Dr. Rueter’s colleagues related to the latest advances in natural language processing, digital resources and endangered languages in a variety of languages such as historical English, Chukchi, Mansi, Erzya, Komi, Finnish, Apurina, Sign Languages, Sami l...
Article
Communities of lesser resourced languages like North Sámi benefit from language tools such as spell checkers and grammar checkers to improve literacy. Accurate error feedback is dependent on well-tokenised input, but traditional tokenisation as shallow preprocessing is inadequate to solve the challenges of real-world language usage. We present an a...
Conference Paper
Full-text available
This paper presents aspects of a computational model of the morphology of Plains Cree based on the technology of finite state transducers (FST). The paper focuses in particular on the modeling of nominal morphology. Plains Cree is a polysynthetic language whose nominal morphology relies on prefixes, suffixes and circumfixes. The model of Plains Cre...
Article
This article presents a novel way of combining finite-state transducers (FSTs) with electronic dictionaries, thereby creating efficient reading comprehension dictionaries. We compare a North Saami - Norwegian and a South Saami - Norwegian dictionary, both enriched with an FST, with existing, available dictionaries containing pre-generated paradigms...
Article
Full-text available
The article presents Vuosttáš Digisánit (VD), an electronic dictionary from North Sámi to Norwegian. Its novelty lies in the way we have utilized existing resources (a basic dictionary and a morphological analyser/generator) in order to create a reception dictionary for language learners for a morphologically rich language. With only 7,9 % of the w...
Article
Full-text available
Proceedings of the Workshop on NLP for Reading and Writing – Resources, Algorithms and Tools (SLTC 2008). Editors: Rickard Domeij, Sofie Johansson Kokkinakis, Ola Knutsson and Sylvana Sofkova Hashemi. NEALT Proceedings Series, Vol. 3 (2009), 19-21. © 2009 The editors and contributors. Published by Northern European Association for Language Technolo...
Article
Full-text available
This paper describes an annotation system for Sámi language corpora, which consists of structured, running texts. The annotation of the texts is fully automatic, starting from the original documents in different formats. The texts are first extracted from the original documents preserving the original structural markup. The markup is enhanced by a...
Conference Paper
Smi, transducers, language technology, spelling, proofing, minority languages.
Conference Paper
Full-text available
Two problematic issues in most lexicon systems today are their size and restricted domain of use. In this paper, we introduce a new approach to lexical organization that leads to more compact and flexible lexicons. The lexical entries are conceptual/phonological frames rather then word entries, and a number of expansion rules are used to generate e...

Network

Cited By

Projects

Project (1)
Project
Develop a grammar checker for North Sámi using only open-source tools, and building a rule-based and language-independent framework to enable easy grammar checker development for other languages.