Sangramsing N. Kayte

Sangramsing N. Kayte
  • Doctor of Engineering
  • Data Scientist at European Commission

The Luxembourg-based Computational Linguist for Translation delivers written text translations into and out of the EU

About

99
Publications
99,166
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
894
Citations
Introduction
I am a Machine Learning scientist with over 9 years of industrial, research and development experience. I specialized in handling Natural Language Processing modules such as sentence segmentation, Word segmentation, text lemmatization, stop words, named entity recognition, speech tagging and speech modules for prosody prediction and waveform generation in the real-time and structured or unstructured dataset. I'm fascinated by Machine Learning and Deep Learning Frameworks, which I frequently appl
Current institution
European Commission
Current position
  • Data Scientist
Additional affiliations
January 2019 - June 2020
University of Southern Denmark
Position
  • PostDoc Position
Description
  • Problem Statement:  European Union  Tender Database for identifying Common Procurement Vocabulary (CPV code) and the Get Related Information. I have proposed a new method of automatized text generation, and subsequent classification of the European Union, Tender Electronic Daily, text documents into predefined technological categories of the dataset. The Tender dataset provides information about the respective tenders including features name of project, Title, Description, Types of contract, Co
January 2018 - December 2018
Elevare Systems.AI
Position
  • Group Leader
Description
  • Developing a voice-based virtual assistant name as an obesity prevention coach to direct him or her every day on the dietitian's proper diet plan and remind the user according to the breakfast, lunch, and dinner schedule. This knows the user's personality and increasingly develops by indulging in the gradual learning cycle for every user experience. In addition, calculate step count with visualization of burning and calorie consumption, and store patient history for future guidelines.
January 2014 - November 2017
Dr. Babasaheb Ambedkar Marathwada University
Position
  • Researcher
Description
  • I have been working on A Huge DST Research Project, Govt. From India, funded project "Development and development of the Indian language Audio Speech Synthesis (ASS) program. The main goal of this research is to incorporate the architecture of machine learning models . The study started initially with the conventional approach such as:- —Service Speech Synthesis Selection (USS) — Hidden Markov Model (HMM) but later the accuracy increased using —Neural Deep Network (DNN), —Confrontational Network
Education
January 2019 - June 2020
University of Southern Denmark
Field of study
  • Data Scientist in Natural Language Processing

Publications

Publications (99)
Conference Paper
Full-text available
Stuttering is one of the complex speech disorder which lead speech unintelligible to other and ASR system. In this paper Fundamental frequency (f0), vocal jitter and shimmer were measures the numerical words of 17 Stuttering (PWS) (i.e. 13 Male and 4 female) and 25 Non-Stutterers (NS) (i.e. 15 male 10 female) belongs to the Marathi Language. Jitter...
Book
Full-text available
Advance Digital Imaging and Computing Power have made it possible to use data provided from medical images in new and revolutionry ways, this has also led to considerable interest in the development of automatic medical diagnosis systems to improve the services provided by the medical community. These systems aid physicians to diagnose, measuer imp...
Preprint
Full-text available
The conversion of text to synthetic production of speech is known as text-to-speech synthesis (TTS). This can be achieved by the method of concate-native speech synthesis (CSS) and hidden Markov model techniques. Quality is the important paradigm for the artificial speech produced. The study involves the comparative analysis for quality of speech s...
Article
Full-text available
Exploration of new mines is vitally important for human life. Geospatial Information Systems (GIS) can be effectively used in the gathering, weighting, analyzing and presenting spatial and attribute information to facilitate the mine exploration process. The success of mine exploration largely depends on: the identification of governing factors, th...
Article
Full-text available
Fog computing extends the Cloud Computing paradigm to the edge of the network, thus enabling a new breed of applications and services. Dening characteristics of the Fog are: a) Low latency and location awareness; b) Widespread geographical distribution; c) Mobility; d) Very large number of nodes, e) Predominant role of wireless access, f) Strong pr...
Article
Full-text available
The speech synthesis system is an artificial production of speech with the help of speech synthesizers. It can be achieved using various techniques. During synthesis the smoothing of concatenating points is an important aspect to be studied. This paper attempts to find the effect of pitch-marking process using Time Domain-Pitch Synchronous Overlap...
Article
Full-text available
RS & GIS are vital tools in the present monitoring of frequent measurements of the Earth over decades with significantly high spatial resolution. Protectorate measurements of sea surface high temperature are a key component in the analyses of global warming and its effects. Altimeters and gravity missions such as GRACE are used to measure sea level...
Article
Full-text available
The oldest way of communication and information exchange between human beings is speech. It is the most prominent and natural form of communication between humans. Speech has potential of being important mode of interaction with computer. The communication among human and computer is called human computer interaction. Man machine interface have alw...
Article
Full-text available
With the coastal population increasing, storms have been inflicting unprecedented losses on coastal communities. Coastal agencies require advance information on the predicted path, intensity and progress of a storm and associated waves and storm surges; Near-real-time information during the peak of the storm to monitor flooding and control rescue o...
Article
Full-text available
A new Maharashtra Marathi voice was created using four festival modules Clunits, Clustergen, Multisyn and HTS, as well as an additional database was created with straight processing. All voices were created using the same database to allow for consistency and for easier comparison of the output. Once these voices have been created they can be used...
Chapter
Full-text available
The conversion of text to synthetic production of speech is known as text-to-speech synthesis (TTS). This can be achieved by the method of concatenative speech synthesis (CSS) and hidden Markov model techniques. Quality is the important paradigm for the artificial speech produced. The study involves the comparative analysis for quality of speech sy...
Article
Full-text available
We are study and experimental work is show this paper how automatically work SIOX techniques for got the good result. For the last 20 years, Magnetic Resonance Imaging (MRI) has become a reference examination for cardiac morphology, function and perfusion in humans. Yet, due to the characteristics of cardiac MRI and to the great variability of the...
Article
Full-text available
An automatic system is presented to find the locationof optic disc andit is a major landmark for the detection of other anatomic features and the macula. These localizationare found by principal component analysis to the image,that contains each structure.The detection of optic disc in colour retinal photograph is a significant task in an automated...
Article
Full-text available
Diabetic retinopathy is one vascular disorder where the retina is damaged because fluid leaks from blood vessels into the retina. Early diagnosis of diabetic retinopathy enables timely treatment and in order to achieve it a major effort will have to be invested into screening programs and especially into automated screening programs. For automated...
Article
Full-text available
The research paper rapid advancement in information technology and communications, computer systems increasingly offer the users the opportunity to interact with information through speech. The interest in speech synthesis and in building voices is increasing. Worldwide, speech synthesizers have been developed for many popular languages English, Sp...
Article
Full-text available
A Text-to-speech (TTS) synthesis system is the artificial production of human system. This paper reviews recent research advances in field of speech synthesis with related to statistical parametric approach to speech synthesis based on HMM. In this approach, Hidden Markov Model based Text to speech synthesis (HTS) is reviewed in brief. The HTS is b...
Article
Full-text available
We describe in detail a Grapheme-to-Phoneme (G2P) converter required for the development of a good quality Marathi Text-to-Speech (TTS) system. The Festival and Festvox framework is chosen for developing the Marathi TTS system. Since Festival does not provide complete language processing support specie to various languages, it needs to be augmented...
Article
Full-text available
Marathi is one of the oldest languages in India. This research paper describes the development of Marathi Textto-Speech System (TTS). In Marathi TTS the input is Marathi text in Unicode. The voices are sampled from real recorded speech. The objective of a text to speech system is to convert an arbitrary text into its corresponding spoken waveform....
Article
Full-text available
This research paper reports preliminary results of data-driven modeling of segmentalphoneme duration for Marathi. Classification and Regression Tree based data driven duration modeling for segmental duration prediction is presented. A number of features are considered and their usefulness and relative contribution for segmental duration prediction...
Article
Full-text available
The Many researches have been done in the transformation of emotion. However, for Marathi not many studies have been done. In this paper we construct a Marathi speech database to study the effects of change of emotion. Emotion is an important element in expressive speech synthesis and is investigated by many researchers. In this research paper we b...
Article
Full-text available
This research paper presents the approach towards converting text to speech using new methodology. The text to speech conversion system enables user to enter text in Marathi and as output it gets sound. The paper presents the steps followed for converting text to speech for Marathi language and the algorithm used for it. The focus of this paper is...
Article
Full-text available
This research paper presents two empirical studies that examine the influence of different linguistic aspects on prosody in Marathi. First, we analyzed a Marathi corpus with respect to the effect of syntax and information status on prosody. Second, we conducted a listening test which investigated the prosodic realisation of constituents in the Mara...
Article
Full-text available
This research paper describes the Impalement of the first, usable, Marathi Text to Speech system for Maharashtra Marathi using the open source Festival TTS engine. Besides that, this research paper also discusses a few practical applications that use this system. This system is developed using di-phone concatenation approach in its waveform generat...
Article
Full-text available
Diabetic retinopathy is a disease of the retina, occurring in about a quarter of people with diabetes. The retina contains cells that convert the light into the electric signals, and these signals are then sent on to the brain. The symptoms can blur or distort the patient's vision and are a main cause of blindness. Microaneurysm are one of the prim...
Article
Full-text available
This research paper addresses the problem of improving the intelligibility of the synthesized speech in Marathi TTS synthesis system. The human speech is artificially generated by Speech synthesis. The normal language text will be automatically converted into speech using Text-to-speech system. This research paper deals with a corpus-driven Marathi...
Article
Full-text available
This research paper addresses the problem of improving the intelligibility of the synthesized speech in Marathi TTS synthesis system. The human speech is artificially generated by Speech synthesis. The normal language text will be automatically converted into speech using Text-to-speech system. This research paper deals with a corpus-driven Marathi...
Article
Full-text available
A text-to-speech synthesis system is one that is capable of producing intelligible and natural speech corresponding to any given text. A popular approach to speech synthesis is unit selection synthesis (USS). The current work focuses on developing a USS system for Marathi. Literature suggests that syllable is a suitable unit for Indian languages. C...
Article
Full-text available
This research paper reports preliminary results of data-driven modeling of segmentalphoneme duration for Marathi. Classification and Regression Tree based data driven duration modeling for segmental duration prediction is presented. A number of features are considered and their usefulness and relative contribution for segmental duration prediction...
Article
Full-text available
We describe in detail a Grapheme-to-Phoneme (G2P) converter required for the development of a good quality Marathi Text-to-Speech (TTS) system. The Festival and Festvox framework is chosen for developing the Marathi TTS system. Since Festival does not provide complete language processing support specie to various languages, it needs to be augmented...
Article
The Many researches have been done in the transformation of emotion. However, for Marathi not many studies have been done. In this paper we construct a Marathi speech database to study the effects of change of emotion. Emotion is an important element in expressive speech synthesis and is investigated by many researchers. In this research paper we b...
Article
Full-text available
This research paper addresses the problem of Marathi compound word splitting and its relevance to developing a good quality phonetizer for Marathi Speech Synthesis. The constituents of a Marathi compound word are not separated by space or hyphen. Hence, most of the existing compound splitting algorithms cannot be applied to Marathi. We propose a ne...
Article
Cloud computing is latest trend in IT world. The relative novelty and rapidly increasing growth of cloud computing makes it an exciting area for research. The present paper aims to assess the state of cloud computing research. We portray a current landscape of this research stream, where it is today, and most importantly, given the current relevanc...
Article
Full-text available
Research on Text-to-speech technology has received the interest of professional researchers in many languages which is a consequence of wide range of applications where Text-To-Speech is implemented. However, Maharashtra Marathi pronunciation: locally is a state in the western region of India and is the nation's third largest state and also the wor...
Article
Full-text available
This paper presents an efficient ear recognition technique which derives benefits from the local features of the ear and attempt to handle the problems due to pose, poor contrast, change in illumination and lack of registration. Recognizing humans by their ear have recently received significant attention in the field of research. Ear is the rich in...
Article
The main objective of this paper is to provide a comparison between two di-phone-based concatenative speech synthesis systems for Marathi language. In concatenative speech synthesis systems, speech is generated by joining small prerecorded speech units which are stored in the speech unit register. A di-phone is a speech unit that begins at the midd...
Article
Full-text available
To diagnosis of Diabetic Retinopathy (DR) it is the prime cause of blindness in the working age population of the world. Detection method is proposed to detect dark or red lesions such as microaneurysms and hemorrhages in fundus images.Developed during this work, this first is for collection of lesion data information and was used by the ophthalmol...
Article
Diabetic retinopathy is a disease of the retina, occurring in about a quarter of people with diabetes. The retina contains cells that convert the light into the electric signals, and these signals are then sent on to the brain. The symptoms can blur or distort the patient's vision and are a main cause of blindness. Microaneurysm are one of the prim...
Article
The research paper briefs about the implementation of screen readers for Marathi in Windows and Linux platform using unrestricted domain Marathi Text To Speech with Indian English support. The application is an integration of MTTS with open source Screen readers NVDA and ORCA. MTTS is a syllable based unit selection concatenative system, built arou...
Article
The The automatic identification of Image processing techniques for abnormalities in retinal images. Its very importance in diabetic retinopathy screening. Manual annotations of retinal images are rare and exclusive to obtain. The ophthalmoscope used direct analysis is a small and portable apparatus contained of a light source and a set of lenses v...
Article
Full-text available
This review is presented in three parts. The first part explains such terms as climate, climate change, climate change adaptation, remote sensing (RS) and geographical information systems (GIS). The second part highlights some areas where RS and GIS are applicable in climate change analysis and adaptation. Issues considered are snow/glacier monitor...
Article
A biometric system is important to a pattern recognition system that operates by acquiring biometric data from an individual, extracting a feature set from the acquired data, and comparing this feature set against the template set in the database. Multimodal biometric systems are becoming more popular; Fingerprint recognition is the most popular ph...
Article
Full-text available
Text to speech synthesis (TTS) is the production of artificial speech by a machine for the given text as input. The speech synthesis can be achieved by concatenation and Hidden Markov Model techniques. The voice synthesized by these techniques should be evaluated for quality. The study extends towards the comparative analysis for quality of speech...
Article
Full-text available
The research presents the capability of a Hidden Markov Model-based TTS system to produce Marathi speech. In this synthesis method, routes of speech parameters are generated from the trained Hidden Markov Models. A final speech waveform is synthesized from those speech parameters. In our experiments, spectral properties were represented by Mel Ceps...
Article
Full-text available
In this research paper, we discuss our efforts in the development of Marathi language speech databases in Marathi for building large vocabulary. We have collected speech data from about 5 speakers in these one languages. We discuss the design and methodology of collection of speech databases. We also present preliminary speech recognition results u...
Article
Full-text available
The biometrics refers to technologies that measure and analyze human body characteristics, such as DNA, fingerprints, eye retinas and irises, voice patterns, facial patterns and hand measurements, for authentication purposes. Biometrics has been in the development for many years and with the recent advancements in technology has made some biometric...
Article
Full-text available
The The automatic identification of Image processing techniques for abnormalities in retinal images. Its very importance in diabetic retinopathy screening. Manual annotations of retinal images are rare and exclusive to obtain. The ophthalmoscope used direct analysis is a small and portable apparatus contained of a light source and a set of lenses v...
Article
Full-text available
Cloud computing is latest trend in IT world. The relative novelty and rapidly increasing growth of cloud computing makes it an exciting area for research. The present paper aims to assess the state of cloud computing research. We portray a current landscape of this research stream, where it is today, and most importantly, given the current relevanc...
Article
Full-text available
This review is presented in three parts. The first part explains such terms as climate, climate change, climate change adaptation, remote sensing (RS) and geographical information systems (GIS). The second part highlights some areas where RS and GIS are applicable in climate change analysis and adaptation. Issues considered are snow/glacier monitor...
Article
Full-text available
The research paper briefs about the implementation of screen readers for Marathi in Windows and Linux platform using unrestricted domain Marathi Text To Speech with Indian English support. The application is an integration of MTTS with open source Screen readers NVDA and ORCA. MTTS is a syllable based unit selection concatenative system, built arou...
Article
InMulti-modal biometrics has attracted strong interest in recent years and in future. This paper approaches to multi-modal biometrics based on biometric source, the type of sensing used, and the depth of collaborative interaction in the processing. This paper also attempts to identify some of the challenges and issues that confront research in mult...
Article
Full-text available
Text to speech synthesis (TTS) is the production of artificial speech by a machine for the given text as input. The speech synthesis can be achieved by concatenation and Hidden Markov Model techniques. The voice synthesized by these techniques should be evaluated for quality. The study extends towards the comparative analysis for quality of speech...
Article
Full-text available
Research on Text-to-speech technology has received the interest of professional researchers in many languages which is a consequence of wide range of applications where Text-To-Speech is implemented. However, Maharashtra Marathi pronunciation: locally is a state in the western region of India and is the nation's third largest state and also the wor...
Article
Full-text available
The research presents the capability of a Hidden Markov Model-based TTS system to produce Marathi speech. In this synthesis method, routes of speech parameters are generated from the trained Hidden Markov Models. A final speech waveform is synthesized from those speech parameters. In our experiments, spectral properties were represented by Mel Ceps...
Article
Full-text available
Text to speech synthesis (TTS) is the production of artificial speech by a machine for the given text as input. The speech synthesis can be achieved by concatenation and Hidden Markov Model techniques. The voice synthesized by these techniques should be evaluated for quality. The study extends towards the comparative analysis for quality of speech...
Article
Full-text available
In this research paper, we discuss our efforts in the development of Marathi language speech databases in Marathi for building large vocabulary. We have collected speech data from about 5 speakers in these one languages. We discuss the design and methodology of collection of speech databases. We also present preliminary speech recognition results u...
Article
Full-text available
Speech synthesis is the process of converting written text into machine-generated synthetic speech. Concatenative speech synthesis systems form utterances by concatenating pre-recorded speech units. Corpus-based methods use a large inventory to select the units to be concatenated. In this paper, we design and develop an intelligible and natural sou...
Article
Full-text available
The research paper rapid advancement in information technology and communications, computer systems increasingly offer the users the opportunity to interact with information through speech. The interest in speech synthesis and in building voices is increasing. Worldwide, speech synthesizers have been developed for many popular languages English, Sp...
Article
Full-text available
The main objective of this paper is to provide a comparison between two di-phone-based concatenative speech synthesis systems for Marathi language. In concatenative speech synthesis systems, speech is generated by joining small prerecorded speech units which are stored in the speech unit register. A di-phone is a speech unit that begins at the midd...
Article
Full-text available
The biometrics refers to technologies that measure and analyze human body characteristics, such as DNA, fingerprints, eye retinas and irises, voice patterns, facial patterns and hand measurements, for authentication purposes. Biometrics has been in the development for many years and with the recent advancements in technology has made some biometric...
Article
A biometric system is important to a pattern recognition system that operates by acquiring biometric data from an individual, extracting a feature set from the acquired data, and comparing this feature set against the template set in the database. Multimodal biometric systems are becoming more popular; Fingerprint recognition is the most popular ph...
Article
Full-text available
Speech is used to express information, emotions, and feelings. Speech synthesis is the technique of converting given input text to synthetic speech. Speech synthesis can be used to read text as in SMS, newspapers, site information etc. and can be used by blind people. Speech synthesis has been widely researched in last four decades. The quality and...
Article
Full-text available
The main objective of this paper is to provide a comparison between two di-phone-based concatenative speech synthesis systems for Marathi language. In concatenative speech synthesis systems, speech is generated by joining small prerecorded speech units which are stored in the speech unit register. A di-phone is a speech unit that begins at the midd...
Article
Full-text available
Diabetic retinopathy (DR) is caused by damage the retina because fluid leaks from blood vessels into the retina. Damage the posterior part of the eye of the diabetic patient. This disease that occurs when does not secrete enough insulin or the body is unable to process it properly. The main two types of diabetic retinopathy the first are non-prolif...
Research
Speech is an integral part of communication. Speech disorder is a problem with fluency, voice, and or how a person produces a speech sound. The main focus of this study is to identify the difference between normal and disordered speech. The proposed work classifies the normal and abnormal speech. The experimental investigation elucidated MFCC and D...
Research
Full-text available
Speech is a main source of human communication. Dysfluent speech is a problem with fluency, voice, and or how a person produces a speech sound. The main objective of the study is to identify the distinguish properties between fluent and dysfluent speech. The proposed work classifies the fluent and dysfluent speech. The KNN classifier is used to cla...
Research
Full-text available
This paper seeks to reveal the various aspects of Marathi Speech synthesis. This paper has reviewed research development in the International languages as well as Indian languages and then centering on the development in Marathi languages with regard to other Indian languages. It is anticipated that this work will serve to explore more in Marathi l...
Article
Full-text available
This paper seeks to reveal the various aspects of Marathi Speech synthesis. This paper has reviewed research development in the International languages as well as Indian languages and then centering on the development in Marathi languages with regard to other Indian languages. It is anticipated that this work will serve to explore more in Marathi l...
Article
Full-text available
Diabetic retinopathy is a medical condition where the retina is damaged because fluid leaks from blood vessels into the retina. Ophthalmologists recognize diabetic retinopathy based on features, such as blood vessel area, exudes, hemorrhages, microaneurysms and texture. In this paper we review algorithms used for the extraction of these features fr...
Article
Full-text available
This paper proposes a design of a framework structure for analysis of cardiac MRI to find out cardiovascular Disease easily and increase patent life. Segmentation of volumetric medical data is extremely time- consuming if using semi-automatically segmentation techniques with the first contribution involves the introduction of a new algorithm for fi...
Article
Full-text available
Driver safety is of utmost importance in the Indian, a country with approximately 308 million licensed drivers (Our Nation's Highways, 3010). Driving while distracted or drowsy decreases performance and endangers lives. Yet in today's bustling society, driving when distracted and/or sleepy is unfortunately more often the norm than the exception. In...
Article
Full-text available
This paper we are analysis of Land use/land cover (LU/LC) changes were determined in an urban area, Aurangabad, from 1990 to 2014 by using Geographical Information Systems (GISs) and remote sensing technology. These studies were employed by using the Survey of India topographic map 57 O/6 and the remote sensing data of LISS III and PAN of IRS ID of...
Article
Full-text available
This paper proposes a design of a framework structure for analysis of cardiac MRI to find out cardiovascular Disease easily and increase patent life. Segmentation of volumetric medical data is extremely time-consuming if using semi-automatically segmentation techniques with the first contribution involves the introduction of a new algorithm for fit...
Chapter
Full-text available
Diabetic retinopathy is the cause for blindness in the human society. Early detection of it prevents blindness. Diabetic retinopathy as a leading cause of blindness in developed countries, Diabetes Mellitus is the inability of the body to use and store sugar properly, resulting in high blood sugar levels. Results in changes in veins, arteries and c...
Article
Full-text available
In this paper, we present an algorithm for the classification and calculation of retinal blood vessels parameters. And calculate Tortuosity of extracted retinal blood vessels. The algorithm proceeds through three main steps 1. preprocessing operations on high resolution fundus images 2. For retinal vessel extraction, simple vessel segmentation tech...
Article
The number of institutions and enrollment in higher education continue their rapid growth, but the quality of this education remains uncertain. A small number of statesubsidized institutions attract a thin top layer of talent from each year's cohort. High selectivity of admission to these elite institutions provides a screen valued by potential emp...
Article
In a system of speech recognition containing words, the recognition requires the comparison between the entry signal of the word and the various words of the dictionary. The problem can be solved efficiently by a dynamic comparison algorithm whose goal is to put in optimal correspondence the temporal scales of the two words. An algorithm of this ty...
Article
Full-text available
Speech synthesis is the process of converting written text into machine-generated synthetic speech. Concatenative speech synthesis systems form utterances by concatenating pre-recorded speech units. Corpus-based methods use a large inventory to select the units to be concatenated. In this paper, we design and develop an intelligible and natural sou...

Network

Cited By