Anushiya Rachel Gladston

Anushiya Rachel Gladston
VIT University | VIT

PhD

About

18
Publications
1,851
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
152
Citations
Additional affiliations
June 2019 - present
Karunya University
Position
  • Professor (Assistant)
April 2018 - March 2019
Sri Sivasubramaniya Nadar College of Engineering
Position
  • Senior Researcher
July 2012 - March 2018
Sri Sivasubramaniya Nadar College of Engineering
Position
  • Project Officer
Education
January 2013 - April 2019
Sri Sivasubramaniya Nadar College of Engineering
Field of study
  • Speech Signal Processing
September 2010 - June 2012
Sri Sivasubramaniya Nadar College of Engineering
Field of study
  • Applied Electronics
August 2006 - May 2010
Jerusalem College of Engineering
Field of study
  • Electronics and Communication Engineering

Publications

Publications (18)
Article
Full-text available
Human-computer interactions can be enhanced by making machines recognize the emotional state of a user and respond accordingly. This necessitates text-to-speech systems that can produce natural emotional speech. While several existing methods are data driven, the current work attempts to incorporate happiness into neutral speech using signal proces...
Article
Full-text available
Estimation of glottal closure instants (GCIs) plays a vital role in pitch-synchronous speech processing. The current work performs a qualitative and quantitative review of six existing GCI estimation algorithms, namely, group delay (GD)-based algorithm, DYPSA, YAGA, ZFF, SEDREAMS and DPI algorithm. This paper differs from existing review papers in...
Article
An augmentative and alternative speech communication (AASC) aid comprises a speech recognition system and a speech synthesis system. The main challenge in developing such an aid for dysarthric speakers lies in handling errors in the text derived from the recognition system. These errors (substitution, deletion, and insertion) may be due to inabilit...
Article
Full-text available
Estimation of glottal closure instants (GCIs) from an electroglottograph (EGG) signal can aid in clinical applications involving the diagnosis and treatment of speech pathologies and can also serve as a ground truth to assess algorithms that estimate GCIs from speech signals. In this regard, the current work proposes a phase-difference-based approa...
Article
The estimation of glottal closure instants (GCIs) plays a vital role in several glottal synchronous applications, and may not be restricted to clean speech. This necessitates the development of a GCI estimation algorithm that performs as well on degraded speech as on clean speech. Degradations in speech may be in the form of spectral or temporal pe...
Conference Paper
Glottal closure instants (GCIs) are characterized by a strong negative valley in the speech signal and an abrupt change in the amplitude. In this paper, an algorithm that exploits these two properties of a GCI is proposed to estimate the location of GCIs, specifically from telephone speech. The algorithm considers a symmetrized voiced segment as th...
Article
A text-to-speech synthesis system produces intelligible and natural speech corresponding to any given text. Two main attributes of a synthesizer are the quality of speech produced and the footprint size. In the current work, HMM-based speech synthesizers have been built and assessed using various kinds of phone-sized units, namely, monophone, triph...
Conference Paper
Speech has been in use as an effective medium in human machine interactions for a long time. A conventional text-to-speech (TTS) system produces monotonous speech without any appropriate emotion. The inclusion of emotions in such synthesis systems will not only result in expressive speech but also reduce the monotony of the synthetic speech. The ti...
Conference Paper
Emotions express a person's internal state of being and it is reflected in the speech utterances. Emotions affect the time-domain characteristics of the speech signal, namely intonation patterns, speech rate, and short-term energy function. Conventional text-to-speech (TTS) systems are built to produce speech utterances for a given text, without an...
Conference Paper
Text-to-speech synthesis systems are expected to produce speech that is intelligible and natural. While conventional systems are capable of producing highly intelligible speech, naturalness needs to be improved, in the sense that regardless of the context, any given text is synthesized in a neutral tone. A number of existing techniques to synthesiz...
Conference Paper
Full-text available
A polyglot synthesizer is a text-to-speech synthesis system, that converts a mixed-language text into speech with single speaker's voice. The straightforward way to develop such a system is to build multiple language-specific synthesizers or to build a single synthesizer after merging common phonemes. In these cases, either language-switching betwe...
Conference Paper
A speech synthesizer, synthesizes speech in accordance with the text in any given language. Two important attributes of any synthesizer, are the quality of the synthesized speech and the footprint size of the voice. Quality of the synthesized speech primarily refers to naturalness and intelligibility. The synthetic speech sounds natural when there...
Conference Paper
A mixed-language (polyglot) synthesizer is one that synthesizes intelligible multilingual speech with a single speaker's voice with appropriate pronunciations. Two main requirements of a mixed-language synthesizer are that (i) the transition from one language to another (language switching) and (ii) the influence of one language on another should n...
Conference Paper
Full-text available
State-of-the art approaches to speech synthesis are unit selection based concatenative speech synthesis (USS) and hidden Markov model based Text to speech synthesis (HTS). The former is based on waveform concatenation of subword units, while the latter is based on generation of an optimal parameter sequence from subword HMMs. The quality of an HMM...
Conference Paper
A cochlear implant is a prosthetic device used to mimic the function of a cochlea in a person with profound and bilateral hearing loss caused by a damaged inner ear. The current work revolves around the design of real time channel vocoder based model of a cochlear implant in LabVIEW and the TMS320C6713 DSK. First, a uniform band 16-channel vocoder...
Conference Paper
An unrestricted text-to-speech system is expected to produce a speech signal, corresponding to the given text in a language, that is highly intelligible to a human listener. Presently, unit selection-based synthesis (USS) and statistical parametric synthesis techniques are the state-of-art techniques for this task. Earlier, in [3], a concatenative...
Conference Paper
Full-text available
The cochlear implant is a prosthetic device, used to replace a damaged inner ear. It consists of an externally worn speech processor and an internal receiver stimulator. The cochlear implant is patient specific and system specific and so in the current work, a lab model for the speech processor, based on various vocoder models is designed to analys...

Network

Cited By