Sigita Laurinciukaite’s scientific contributions

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (6)


LITHUANIAN CONTINUOUS SPEECH CORPUS LRN 1: AN IMPROVEMENT
  • Article

January 2009

·

11 Reads

·

3 Citations

Information Technology and Control

Sigita Laurinciukaite

·

Mark Filipovic

·

Laimutis Telksnys

This paper presents the development of Lithuanian continuous speech corpus LRN 1 (Lithuanian Radio News, version 1). The corpus was developed from speech corpus LRN 0.1 by increasing the duration of speech corpus (it lasts 20 hours 50 minutes). The major improvement of speech corpus LRN 1 was a development of time-aligned word level annotations of speech signals. Time-aligned word level annotations of speech signals were obtained after a two-Stage process: automatic realignment of acoustic models of phonemes and subsequent manual correction of annotations. The improvement of the corpus is useful for constructing and evaluating speaker-independent continuous speech recognition systems and for linguistic research.


Framework for Choosing a Set of Syllables and Phonemes for Lithuanian Speech Recognition

January 2007

·

57 Reads

·

5 Citations

Informatica

This paper describes a framework for making up a set of syllables and phonemes that subsequently is used in the creation of acoustic models for continuous speech recognition of Lithua- nian. The target is to discover a set of syllables and phonemes that is of utmost importance in speech recognition. This framework includes operations with lexicon, and transcriptions of records. To fa- cilitate this work, additional programs have been developed that perform word syllabification, lexi- con adjustment, etc. Series of experiments were done in order to establish the framework and model syllable- and phoneme-based speech recognition. Dominance of a syllable in lexicon has improved speech recognition results and encouraged us to move away from a strict definition of syllable, i.e., a syllable becomes a simple sub-word unit derived from a syllable. Two sets of syllables and phonemes and two types of lexicons have been developed and tested. The best recognition accu- racy achieved 56.67% ±0.33. The speech recognition system is based on Hidden Markov Models (HMM). The continuous speech corpus LRN0 was used for the speech recognition experiments.



Table 2 -Major LRN0 characteristics Criterion LRN0 Characteristics
A Technique for Choosing Efficient Acoustic Modeling Units for Lithuanian Continuous Speech Recognition
  • Conference Paper
  • Full-text available

January 2006

·

93 Reads

·

1 Citation

This paper presents a technique and experiments on choosing a set of mixed-duration modeling units representing language-specific phonetic details based on the analysis of available training data. A case study for Lithuanian speech recognition is presented. Lithuanian language defines the following major phonetic features: linguistic stress, consonant softness, vowel duration, and compound phones. Also, syllables or words could be chosen as modeling units. Incorporating all linguistic features into base phone set or using syllable models explodes the number of triphones used for accurate acoustic modeling. Therefore only those phones with phonetic features that have enough training samples are defined as separate phonetic units. The next problem is the choice of appropriate model complexity, which also depends on the available training data. Simple algorithms are proposed for forming dynamic base phone set and choosing complexity for separate models. Experiments on 10 hours of prepared speech from Lithuanian Radio news are conducted and indicate that the proposed modeling methodology allows significant improvement of recognition accuracy.

Download

A Technique for Choosing Efficient Acoustic Modeling Units for Lithuanian Continuous Speech Recognition

January 2006

·

21 Reads

·

1 Citation

This paper presents a technique and experiments on choosing a set of mixed-duration modeling units representing language-specific phonetic details based on the analysis of available training data. A case study for Lithuanian speech recognition is presented. Lithuanian language defines the following major phonetic features: linguistic stress, consonant softness, vowel duration, and compound phones. Also, syllables or words could be chosen as modeling units. Incorporating all linguistic features into base phone set or using syllable models explodes the number of triphones used for accurate acoustic modeling. Therefore only those phones with phonetic features that have enough training samples are defined as separate phonetic units. The next problem is the choice of appropriate model complexity, which also depends on the available training data. Simple algorithms are proposed for forming dynamic base phone set and choosing complexity for separate models. Experiments on 10 hours of prepared speech from Lithuanian Radio news are conducted and indicate that the proposed modeling methodology allows significant improvement of recognition accuracy.


Citations (6)


... The segmentation of recorded material into sentence level was done. Laurinciukaite et al. [57] developed continuous Lithuanian speech corpus, Lithuanian Radio News Version 1 (LRN1) of 20 h and 50 min and contained various level of annotation carried by two-stage process, methods as well as tools. Maskeliunas et al. [80] checked the accuracy of the SR of 10 Lithuanian digits with the help of speech recognizers of English, German, French and Spanish. ...

Reference:

Automatic Speech Recognition System forTonal Languages: State‑of‑the‑Art Survey
LITHUANIAN CONTINUOUS SPEECH CORPUS LRN 1: AN IMPROVEMENT
  • Citing Article
  • January 2009

Information Technology and Control

... Validation of the speech corpus was performed when running speech recognition experiments using data from LRN0 corpus -the prototype of LRN 0.1 [16,6,17,8]. When running experiments, multiple errors (various typos, missing elements) were found and fixed. ...

A Technique for Choosing Efficient Acoustic Modeling Units for Lithuanian Continuous Speech Recognition

... WER was 20%. Silingas et al. [114] presented the investigation on acoustic modeling for Lithuanian continuous SR on the basis of simple as well as contextual phone set. Laurinciukaite et al. [58] discussed the design and development of the speech corpus Lithuanian Radio News Prototype Version 0.1 (LRN 0.1). ...

Towards Acoustic Modeling of Lithuanian Speech

... Lipeika and Laurinčiukaitė [21] presented an ASR system based on phonemes and syllables capable of achieving 56.67% recognition accuracy. Inspired by their success they proposed an updated model, that used phonemes, syllables, and words that could achieve 58.06% accuracy [22]. Lileikytė et al. [23] found that an effective speech transcription system for Lithuanian broadcast data capable of achieving 18.3% WER can be built by using a combination of unsupervised and semi-supervised training methods such as a deep neural network with a bottleneck layer. ...

Syllable-Phoneme based Continuous Speech Recognition
  • Citing Article
  • January 2006

... The problem of finding the best word to sub-word unit mapping for the applications of Lithuanian ASR was first addressed by Raškinis and Raškinienė (2003), followed by Šilingas (2005), Laurinčiukaitė and Lipeika (2007), Gales et al. (2015), Greibus et al. (2017), Lileikytė et al. (2018), and Ratkevicius et al. (2018). ...

Framework for Choosing a Set of Syllables and Phonemes for Lithuanian Speech Recognition
  • Citing Article
  • January 2007

Informatica