
Deepa P Gopinath- PhD
- Professor (Assistant) at College of Engineering Trivandrum
Deepa P Gopinath
- PhD
- Professor (Assistant) at College of Engineering Trivandrum
About
22
Publications
7,576
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
81
Citations
Introduction
Skills and Expertise
Current institution
Publications
Publications (22)
Modern text-to-speech (TTS) systems use deep learning to synthesize speech increasingly approaching human quality, but they require a database of high quality audio-text sentence pairs for training. Malayalam, the official language of the Indian state of Kerala and spoken by 35+ million people, is a low resource language in terms of available corpo...
Laryngeal pathologies resulting in voice disorders are normally diagnosed using invasive methods such as rigid laryngoscopy, flexible nasopharyngo-laryngoscopy and stroboscopy, which are expensive, time-consuming and often inconvenient to patients. Automatic Voice Disorder Detection (AVDD) systems are used for non-invasive screening to give an indi...
The concept of complex numbers (CNs) is used in many disciplines. In many cases, students find it difficult to understand the logic behind CNs. Rotations, vibrations, and oscillations result in sine or cosine waves. Mathematical representation of rotation/vibration/oscillation is done in two ways—trigonometry and complex numbers. But the algebraic...
Text to speech synthesis system intended for any language, converts the given text in that language to corresponding speech. The major challenge in TTS system is to generate artificial speech which appears to be natural and intelligible. This is essential for visually impaired people to properly understand and comprehend the generated speech. Natur...
In this paper we propose a new method for modeling intonation for Malayalam language Text To Speech (TTS) synthesis system. It is obtained by combining curve fitting and CART. Intonation provides naturalness and intelligibility to synthesized speech. Intonation of a sentence is predicted from raw text using positional information of syllables and w...
This paper describes duration modeling in Text To Speech Synthesis (TTS) for Malayalam language using open source Festival TTS engine. Classification and Regression Tree (CART) based data-driven phoneme duration modeling is presented. A number of features are extracted for predicting the duration of phonemes. Objective evaluation test was conducted...
Entropy is a statistical parameter which measures how much information is produced on the average for each letter of a text in the language. Every language normally has certain hidden statistically significant features and certain redundancy. These features can be utilized to form a suitable text compression tool for the optimum use of resources. B...
Synthesized speech output from a Text To Speech (TTS) synthesis system requires the inclusion of prosodic features like pause durations, syllable prolongations etc. to improve intelligibility. These factors also make the synthesized speech more understandable to the physically disabled. This paper focuses on analyzing the factors that affect pause...
Speech and music prosody deals with the presence of rhythm. This work is the analysis of the presence of the chaotic nature in the duration patterns of speech segments. A chaotic system is one whose characteristics are apparently random with some amount of predictability. This study is carried out both quantitatively and qualitatively. The qualitat...
Synthesis of natural sounding speech is the greatest challenge in a Text-to-Speech Synthesis (TTS) system. In natural speech, duration, intensity and pitch are dynamically varied which is manifested as rhythm or prosody of speech. If these variations are not recreated, the synthesized speech will sound robotic. Synthesis of good quality speech depe...
Classification of organisms into different categories using their genomic sequences has found importance in study of evolutionary characteristics, specific identification of previously unknown organisms, study of mutual relationships between organisms and many other aspects in the study of living things. Chaos game representation (CGR) uniquely rep...
Classification of organisms into different categories using their genomic sequences has found its importance in the study of evolutionary characteristics of organisms and specific identification of previously unknown organisms in biodiversity studies and related areas. Chaos game representation (CGR) uniquely represents DNA sequence in a visual for...
The duration of phoneme vary dynamically during continuous speech giving rhythm or prosody to speech. To make synthesized speech appear natural, the durational variation is recreated using duration models. This paper proposes a hybrid duration model combining CART and HMM based on duration analysis of phonemes in Malayalam language. The first part...
Naturalness can be achieved in a text-to-speech (TTMP) by incorporating prosodic features which include duration of basic units, intonation patterns and stress. This paper presents the preliminary duration analysis required for a speech synthesis system for Malayalam, one among the 17 languages spoken in India. The statistical analysis of the durat...
The inclusion of emotional aspects into speech can improve the naturalness of speech synthesis system. The different emotions -sadness, angry, happiness are manifested in speech as prosodic elements like time duration, pitch and intensity. The prosodic values corresponding to different emotions are analyzed at word as well as phonemic level, using...
Prosody or rhythm in speech is manifested as variation in features like duration, intensity and pitch. The rhythm in poetry is dictated by its metre which is followed by the person reciting it. Similarly each speaking style has a basic rhythm. The entire conversation is obtained by the repetition of this rhythm.The purpose of the present study is t...
Natural speech has a particular rhythm or prosody. This prosody depends on different factors like dialect, style of speech and emotional state of the speaker. While acquiring a specific speaking style in a particular language, the rhythm of the words, phrases and sentences get stored in the human brain. At the time of speaking, these patterns are r...
Classification of organisms into different categories using their genomic sequences has found importance in study of evolutionary characteristics, identification of previously unknown organisms, study of mutual relationships between organisms and many other aspects in the study of living things. Chaos game representation (CGR) uniquely represents D...
A very challenging task encountered in the field of bioinformatics is differentiating coding regions from noncoding regions in newly sequenced genomes. Identification of protein coding regions in a DNA sequence is a significant task because, it is the basic step in gene location identification. The coding region shows a periodic organization of thr...