Sjur Nørstebø Moshagen

Sjur Nørstebø Moshagen
  • Master of Arts
  • Chief engineer at UiT The Arctic University of Norway

About

19
Publications
2,765
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
91
Citations
Introduction
Sjur N. Moshagen currently works at the Department of Language and Culture, UiT The Arctic University of Norway. Sjur does research in Proofing Tools, Morphology and Computational Linguistics.
Current institution
UiT The Arctic University of Norway
Current position
  • Chief engineer

Publications

Publications (19)
Book
Full-text available
The book describes various approaches in rule-based language technology The authors are leading experts in this research field. The book is the first of its kind and it gives a comprehensive picture of the state-of-the-art in rule-based language technology. The book shows the suitability of the technology to all language types, including languages...
Article
Full-text available
Currently, machine learning is presented as the ultimate solution for language technology regardless of use case and application, however, it requires as a starting point a massive amount of curated linguistic data in electronic form that is expected to be high quality and representative of the kind of language usage that the tools will follow. For...
Article
Full-text available
Denne utgåva av Nordlyd er eit festskrift til ære for vår kollega professor Trond Trosterud, i samband med at han fyller 60 år 30. august 2022. Utgåva inneheld 22 artiklar skrivne av i alt 43 forfattarar – for det meste folk som har samarbeidd med Trond i tidlegare år. Vi har òg skrive ei innleiing om Trond, og til sist i boka er det ei liste over...
Article
Full-text available
In this article, we study correction of spelling errors, specifically on how the spelling errors are made and how can we model them computationally in order to fix them.The article describes two different approaches to generating spelling correction suggestions for three Uralic languages: Estonian, North Sámi and South Sámi.The first approach of mo...
Chapter
This is the Festschrift of Dr. Jack Rueter. The book presents peer-reviewed scientific work from Dr. Rueter’s colleagues related to the latest advances in natural language processing, digital resources and endangered languages in a variety of languages such as historical English, Chukchi, Mansi, Erzya, Komi, Finnish, Apurina, Sign Languages, Sami l...
Article
Communities of lesser resourced languages like North Sámi benefit from language tools such as spell checkers and grammar checkers to improve literacy. Accurate error feedback is dependent on well-tokenised input, but traditional tokenisation as shallow preprocessing is inadequate to solve the challenges of real-world language usage. We present an a...
Conference Paper
Full-text available
This paper presents aspects of a computational model of the morphology of Plains Cree based on the technology of finite state transducers (FST). The paper focuses in particular on the modeling of nominal morphology. Plains Cree is a polysynthetic language whose nominal morphology relies on prefixes, suffixes and circumfixes. The model of Plains Cre...
Article
This article presents a novel way of combining finite-state transducers (FSTs) with electronic dictionaries, thereby creating efficient reading comprehension dictionaries. We compare a North Saami - Norwegian and a South Saami - Norwegian dictionary, both enriched with an FST, with existing, available dictionaries containing pre-generated paradigms...
Article
Full-text available
The article presents Vuosttáš Digisánit (VD), an electronic dictionary from North Sámi to Norwegian. Its novelty lies in the way we have utilized existing resources (a basic dictionary and a morphological analyser/generator) in order to create a reception dictionary for language learners for a morphologically rich language. With only 7,9 % of the w...
Article
Full-text available
Proceedings of the Workshop on NLP for Reading and Writing – Resources, Algorithms and Tools (SLTC 2008). Editors: Rickard Domeij, Sofie Johansson Kokkinakis, Ola Knutsson and Sylvana Sofkova Hashemi. NEALT Proceedings Series, Vol. 3 (2009), 19-21. © 2009 The editors and contributors. Published by Northern European Association for Language Technolo...
Article
Full-text available
This paper describes an annotation system for Sámi language corpora, which consists of structured, running texts. The annotation of the texts is fully automatic, starting from the original documents in different formats. The texts are first extracted from the original documents preserving the original structural markup. The markup is enhanced by a...
Conference Paper
Smi, transducers, language technology, spelling, proofing, minority languages.
Conference Paper
Full-text available
Two problematic issues in most lexicon systems today are their size and restricted domain of use. In this paper, we introduce a new approach to lexical organization that leads to more compact and flexible lexicons. The lexical entries are conceptual/phonological frames rather then word entries, and a number of expansion rules are used to generate e...

Network

Cited By