Conference Paper

Fuzzy voice segment classifier for voice pathology classification

Sch. of Mechatron. Eng., Univ. Malaysia Perlis, Arau, Malaysia
DOI: 10.1109/CSPA.2010.5545316 Conference: Signal Processing and Its Applications (CSPA), 2010 6th International Colloquium on
Source: IEEE Xplore

ABSTRACT Speech is one of the common modes of communication and it is a process of transferring information from one entity to another. In recent years there has been much research on unvoiced/voiced classification and voice pathology classification. In this research work a simple fuzzy classifier has been designed to segment the voiced and unvoiced portions of a speech signal. A simple feature extraction algorithm is proposed to extract the Tri Mean relative average perturbation (Tri Mean-RAP) features from the segmented voice portion of the signal. Further, using PCA transformation the significant Tri Mean-RAP features are extracted and a simple neural network model is developed. In the proposed fuzzy classifier, the energy per frame and change in energy level between the adjacent frames are fuzzified and rules are formulated to segment the voiced portion. The Tri Mean-RAP features are then extracted from the segmented voice portion. The proposed methods are validated through simulation.

  • [Show abstract] [Hide abstract]
    ABSTRACT: Speech recognition is one of the important areas in digital speech processing. The study of speech recognition is a part of a quest for artificially intelligent machines that can hear and understand spoken information. The conventional methods for speech recognition are very complicated and time consuming. To apply fuzzy logic to speech recognition is a new attempt in digital speech processing. The approach proposed in the paper simplifies the algorithm in speech recognition and makes the real-time processing time shorter. The situation considered in this paper is the simplest, i.e., the situation of speaker dependence, small vocabulary and isolated words
    Neural Networks, 1999. IJCNN '99. International Joint Conference on; 02/1999
  • [Show abstract] [Hide abstract]
    ABSTRACT: We present a neural network application to the diagnosis of vocal and voice disorders, these disorders should be diagnosed in the early stage and normally cause changes in the voice signal. So we use acoustic parameters extracted from the voice as inputs for the neural network. In this paper, we focus our application on the distinction between pathologic and nonpathologic voices. The performance of the neural network is very good, 100% percent correct in the test. Furthermore, we have used neural network techniques to reduce the initial number of inputs (35), we conclude that only two acoustic parameters are needed for the classification between normal and pathological voices. The application can be a very useful diagnostic tool because it is noninvasive, makes possible to develop an automatic computer-based diagnosis system, is objective and can also be useful for evaluation of surgical, pharmacological and rehabilitation processes. Finally, we discuss the limitation of our work and possible future research
    Neural Networks, 2000. IJCNN 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on; 02/2000

Full-text (2 Sources)

Available from
May 16, 2014