Magdalena Igras-Cybulska

Magdalena Igras-Cybulska
AGH University of Science and Technology in Kraków | AGH · Department of Electronics

Doctor of Engineering

About

55
Publications
14,726
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
151
Citations
Introduction
Biomedical Engineer (with specialty in Computer Science and Electronics in Medicine). My scientific research focuses on emotions and other paralinguistic aspects of voice and speech. I am interested also in virtual reality and investigation of user experience.
Additional affiliations
March 2017 - present
AGH University of Science and Technology in Kraków
Position
  • Scientific assistant
March 2017 - December 2019
Techmo Sp. z o.o., Kraków, Poland
Position
  • Speech processing expert
October 2016 - present
AGH University of Science and Technology in Kraków
Position
  • Research Assistant
Description
  • - Introduction to Interface Design - Multimedia and Presentation of Data - Introduction to Virtual Reality
Education
October 2011 - September 2016
AGH University of Science and Technology in Kraków
Field of study
  • Biocybernetics and Biomedical Engineering
March 2010 - July 2011
AGH University of Science and Technology in Kraków
Field of study
  • Computer Science and Electronics in Medicine
October 2006 - January 2010
AGH University of Science and Technology in Kraków
Field of study
  • Biomedical Engineering

Publications

Publications (55)
Article
Full-text available
Artykuł prezentuje opracowaną w AGH bazę danych nagrań mowy emocjonalnej, zgromadzoną w celu badań nad zawartością afektywną sygnału mowy. Opisano sposób rejestracji, parametry, strukturę, metadane i licencję bazy danych. Przedstawiono przykładowe zastosowania do opracowania metod detekcji stanów emocjonalnych w głosie oraz normalizacji nagrań na p...
Conference Paper
An algorithm for automatic detection of breath events in a speech signal is suggested in this paper. The issues of breath events occurrences in recordings are discussed as well as their statistical parameters. Also the role of breath pauses for signalizing punctuation and emotional or physical state of the speaker, in both spontaneous and read spee...
Conference Paper
Full-text available
A supporting system of voice analysis for emergency call centers is being developed at AGH University of Science and Technology in Krakow. The aim of our work is to provide an innovative supporting tool for rapid and accurate assessment of caller profile. The project covers not only speaker identification (when speaker's speech sample is known), bu...
Article
Full-text available
Statistics of pauses appearing in Polish as a potential source of biometry information for automatic speaker recognition were described. The usage of three main types of acoustic pauses (silent, filled and breath pauses) and syntactic pauses (punctuation marks in speech transcripts) was investigated quantitatively in three types of spontaneous spee...
Conference Paper
Full-text available
A new VR application for voice and speech training has emerged from a problem observable in everyday life: an anxiety of public speaking. In the design process, we incorporated both domain knowledge of experts as well as research with end-users in order to explore the needs and the context of the problem. Functionalities of the prototype are the ef...
Conference Paper
We introduce a new IVE designed to compare user interaction between the mode with traditional graphical user interface (GUI) with the mode in which every element of interface is replaced by voice user interface (VUI). In each version, 4 scenarios of interaction with a virtual assistant in a sci-fi location are implemented, each of them lasting seve...
Conference Paper
In 2019 we were asked to prepare a VR stand providing an opportunity to experience applications with social impact during an open event. Every visitor of our stand could choose among several applications (therapeutic, relaxing vs educational vs perspective-taking & empathy-building) and after testing was asked (optionally) to fill in the shortened...
Conference Paper
In the mining industry, accessing the locations and observing processes for educational purposes can be problematic due to safety reasons or high cost of experiencing it in person. Immersive technologies, especially applied in an accessible and responsive design such as WebXR, have great potential to make such educational and training applications...
Conference Paper
Full-text available
Generating multiple bots which look and move authentically and at the same time differently from each other is a difficult task. While working on it, researchers need to deal with problems of low time-efficiency and complexity of the process as well as trade-off between realism of a character and their generation time, while hardware performance sh...
Conference Paper
First person perspective taking presented in head-mounted displays make it a perfect interface to experience empathy toward other people. Since some intercultural misunderstandings stem from ethnocentrism, it is worth considering the possibilities given by the VR experiences to explain behaviors toward out-groups and induce empathetic actions. In t...
Article
People use their voices to communicate not only verbally, but also emotionally. This article presents both, theories and methodologies, that concern emotional vocalizations at the intersection of psychology and digital signal processing. Specifically, it demonstrates the process of encoding and decoding including review and comparison of strategies...
Article
Full-text available
The human voice is one of the basic means of communication, thanks to which one also can easily convey the emotional state. This paper presents experiments on emotion recognition in human speech based on the fundamental frequency. AGH Emotional Speech Corpus was used. This database consists of audio samples of seven emotions acted by 12 different s...
Preprint
Problems of acceptance and perceptual assessment of robots dedicated to interact with humans significantly influence user experience, as well as quality and efficiency of the interaction. Analysis of attitude towards a robot can take place during the direct interaction, on the basis of the presentation of its activities, e.g. in the movie, or using...
Conference Paper
The aim of this paper is to introduce a novel corpus of speech POLAC - Politeness Across Cultures and to present some preliminary results on acoustic correlates of intended politeness. It was created for cross-cultural comparative studies on Polish and Canadians. It consists of 180 three-minutes long conversations in an arranged situation where the...
Article
Full-text available
The study discussed in this article was carried out as a pilot study to assess the process, resources and data management scheme (Thabane et al., 2010) to be used in a large-scale experiment on filmic audio description (AD) reading rate. As part of this study we defined the reading rate in filmic AD context. We described the characteristic features...
Conference Paper
Full-text available
The purpose of the study is to determine the acoustic aspects of the impact of a voice training intervention. The research group participated in a voice training program of Postgraduate Studies of Voice and Speech Training at the University of Social Sciences and Humanities in Warsaw. The subjects of the research are adults, aged 25-61, male and fe...
Conference Paper
Full-text available
Sound in audio description (AD) is a rarely researched topic (Orero & Szarkowska, 2015; Remael, 2012; Szarkowska & Jankowska, 2016) and its reading speed attracted interest of even fewer researchers (Vercauteren, 2016). In our presentation, we would like to present the results of our research into the actually applied and preferred reading speed of...
Thesis
Full-text available
Voice carries a lot of information: about semantic content we want to communicate, about our identity and also about affective, psycho-social or physical attributes of the speaker. Speaker’s states and traits significantly affect the voice itself as well as speaking manner, syntax and semantic content. From the technical point of view, all this info...
Conference Paper
Full-text available
The purpose of the study is to try to check whether the acoustic characteristics of voice can decide on the quality of sermons and to find the acoustic aspects of voice at the time of preaching in relation to the background of theoretical, rhetorical descriptions.
Conference Paper
Nadawany komunikat niewerbalny może być spójny lub niezgodny z komunikatem werbalnym, w zależności od kulturowych reguł wyrażania emocji, które wyznaczają zakres społecznie akceptowanych ekspresji. Ze względu na cenioną wartość bycia osobą szczerą w Polsce oraz wartość bycia osobą miła i okazującą przyjacielskość w Kanadzie, można oczekiwać odmien...
Conference Paper
Full-text available
The aim of the described system is to provide an online solution that profiles customers of a call centre. As an auxiliary module it might enhance functionality of modern call centre systems by active voice analysis. Integrated with existing databases, our system allows for analysis of constant and temporal caller characteristics during a call — re...
Article
Full-text available
In this article the authors investigated and presented the experiments on the sentence boundaries annotation from Polish speech using acoustic cues as a source of information. The main result of the investigation is an algorithm for detection of the syntactic boundaries appearing in the places of punctuation marks. In the first stage, the algorithm...
Chapter
In this study we measure acoustic cues that accompany expression of irony in voice. It is assumed that speaker tend to modulate his voice in a specific manner while communicating irony or sarcasm. We tried to find a pattern to describe it using a set of parameters connected to speech prosody, voice quality, voice timbre and harmonic structure of v...
Conference Paper
Full-text available
In this article [1] we presented statistical models of acoustic phenomena observed within realiza-tions of phonemes and the correlations of the acoustic properties with functional features, such as accents and sentence boundaries. Selected features of Polish phonemes' realizations (the duration, energy and power of the phonemes, the fundamental fre...
Conference Paper
Association rule learning is an approach for discovering interesting relationships in large databases. The analysis of relations, invisible at first glance, is a source of new knowledge which can be subsequently used for prediction. We used this data mining technique (which is an automatic and objective method) to learn about interesting affects as...
Article
Full-text available
The paper exemplifies an interdisciplinary study involving both philology and acoustics.It presents the results of a pilot study aimed at capturing the parallels between themetaphorical expressions describing voice (e.g. jasny ‘bright’, głuchy ‘dull’) and parametershidden behind these terms. It is an attempt of objectifying the description.
Article
Full-text available
A comparative analysis of multi-language speech samples is conducted using acoustic characteristics of phoneme realisations in spoken languages. Different approaches to investigation of phonemic diversity in the context of language evolution are compared and discussed. We introduced our approach (materials and methods) and presented preliminary res...
Article
Full-text available
In this article the authors investigated and presented statistical models of acoustic phenomena observed within realizations of phonemes and the correlations of the acoustic properties with functional features, such as accents and sentence boundaries. The authors used two databases: the first one contained separately produced sentences and the seco...
Conference Paper
Full-text available
With the advancement of cognitive science, the research on the psychology of emotions has become the subject of many studies. The affect-related physiological changes in human body, that have an impact on respiration, phonation and articulation, determine the vocalization character and produce emotion-specific patterns of acoustic parameters of a s...
Conference Paper
The paper presents an approach to automatic recognition of emotions in speech signals. The applied method bases on the composition of two discrete frequency transformations. The wavelet transform was calculated first and next the Fourier transform was applied. The Fourier-wavelet transform representation is used to find the differences between emot...
Conference Paper
The purpose of the study is to determine the acoustic aspects of the impact of a voice training intervention. The research group participated in a voice training program of Postgraduate Studies of Voice and Speech Training at the University of Social Sciences and Humanities in Warsaw. The subjects of the research are adults, aged 25-61, male and fe...
Conference Paper
The aim of our work is to develop the software for caller identification or to create his characteristic by analysis of his voice. Based on collected speech samples, our system aims to identify emergency callers both on-line and off-line. This homeland security project covers speaker recognition (when speaker's speech sample is known), speaker's ge...
Conference Paper
The paper presents analysis of prosodic parameters of speech (energy, phoneme duration) as features characteristic for speaker. The most significant parameters of the features were investigated using CORPORA speech database and described statistically. We observed that phoneme duration depends on a speaker, as well as the preboundary lengthening of...
Conference Paper
Full-text available
The results of investigation of the differences among the phonemes of 574 languages all over the world are presented. We attempt to verify the hypothesis of African origin for all languages and gradual languages diversification on other parts of the globe. The obtained results justify the languages classification by applying the methods used in evolu...
Conference Paper
A new approach to speech normalisation is presented. A method that finds the optimal coefficient for linear slope of the warping function is described. The affine normalisation functions are suggested. Their coefficients depend on expected values of frequency when speech spectra are used as a density of probabilities. The method was developed for c...
Conference Paper
Full-text available
The paper presents statistical phonetic data of Polish collected from a corpus. Lengths of phonemes vary from 5 ms to 670 ms. Average durations of Polish phonemes are presented as well as an important anomaly of longer phonemes in the end of sentences, which is the main topic of the paper. This observation can be used in speech recognition for auto...
Article
Full-text available
During public presentations or interviews, speakers commonly and unconsciously abuse interjections or filled pauses that interfere with speech fluency and negatively affect listeners impression and speech perception. Types of disfluencies and methods of detection are reviewed. Authors carried out a survey which results indicated the most adverse el...
Article
Full-text available
Autorzy prezentują największą, audiowizualną bazę danych mowy polskiej i zarazem jedyną zrealizowaną w jakości HD. Artykuł przedstawia krótki opis podobnych baz dla innych języków oraz opis techniczny wykonanej bazy. Omówiono także napotkane wyzwania w trakcie realizacji bazy danych i jej planowane zastosowania.
Article
Full-text available
Niniejsza praca podejmuje próbę pomiaru cech sygnału mowy skorelownych z jego zawartością emocjonalną (na przykładzie emocji podstawowych). Zaprezentowano korpus mowy zaprojektowany tak, by umożliwić różnicową analizę niezależną od mówcy i treści oraz przeprowadzono testy mające na celu ocenę jego przydatności do automatyzacji wykrywania emocji w m...

Questions

Questions (2)
Question
I am looking for open speech databases which contain information of speaker's age and the age groups are (more or less) equally represented.
Question
Hello,
there are a lot of papers on differences between read and spontaneous speech, even detailed comparisons. However, is anyone aware of attempts to automatically detect if given recording contains read or spontaneous speech? 
Thank you for any helpful suggestions.

Network

Cited By

Projects

Projects (6)
Project
We build a virtual trainer of public speaking and voice emission. Training your voice and speaking skills is be supported by the feedback from automatic speech processing engine. You can train public speaking in front of virtual audience or train your voice in the relaxing natural locations.
Archived project
Modeling of acoustic correlates of punctuation in spoken Polish
Archived project
The project aims at measuring the acoustic diversity in the phoneme inventories of the world’s languages.