Hocine Teffahi’s research while affiliated with University of Sciences and Technology Houari Boumediene and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (37)


Mixed ASR System for Amazigh and Arabic UnderResourced Dialects in Maghreb Region
  • Preprint

July 2024

·

21 Reads

·

·

·

[...]

·

Hocine Teffahi

Automatic Speech Recognition (ASR) technology plays an essential role in human-machine interaction. In this paper, we describe our conducted speech experiments that are realized to develop and adapt a mixed automaticspeech recognition system based on spoken digits for Amazigh and Arabic dialects that are considered as under-resourced dialects in the Maghreb region.Our used database includes speech samples collected from 24 Moroccans and Algerian speakers including both males and females. The designed system is implemented based on the combination of hidden Markov models and Gaussian mixture models, as well as the Mel frequency spectral coefficients (MFCCs) feature extraction method.


A Transfer Learning Approach For Identifying Spoken Maghrebi Dialects

July 2024

·

32 Reads

This paper investigates a transfer learning approach to solve the spoken dialects identification problem for some under-resourced dialects of the Maghrebi region, including Algerian Arabic Dialect (AAD), Algerian Berber Dialect (ABD), Moroccan Arabic Dialect (MAD), andMoroccan Berber Dialect (MBD). In our experiments, we used different Transfer learning models, namely: Residual Neural Network (Resnet50, Resnet101), and Visual Geometric Group (VGG16, VGG19) using anin-house corpus that we built for each dialect. The corpus is composed of ten digits recorded for each of the aforementioned dialects, repeated ten times by six native speakers. The results vary according to different reasons: the number of epochs, neurons, batch size, and also the datasets combinations used in training and test phases. The best score found is 90.4% by the VGG19 model. Overall, the results show the robustness of our system based on the VGG16 model with an average identification rate of 62.7%.


Contribution à l'amélioration du signal de synthèse dans un système TTS pour la langue arabe

December 2023

·

16 Reads

Par ses propriétés morphologiques et syntaxiques la langue arabe est considérée comme une langue très difficile à maîtriser dans le domaine du traitement automatique et les systèmes de synthèse à partir du texte arabe sont donc très peu nombreux. Le but de notre travail est une réalisation d'un système de synthèse de la parole par concaténation (sélection) dynamique d'unités dans un corpus pour la langue arabe baptisé GArabic TTS sous l'environnement Matlab. Le texte à introduire est un texte non voyellé qui facilite l'utilisation du système, la sortie est disponible uniquement pour une voix masculine. L'évaluation de GArabic TTS est basée sur un test subjectif et objectif ; en ce qui concerne l'intelligibilité, aspects naturels (l'écoute) et la qualité (PESQ). L'évaluation finale de la qualité globale du système est jugée satisfaite.


System architecture
Acoustic-spectral based DI component
Spectrogram based DI component
ASR system
HMM structure with 3 states

+3

Enhancement of spoken digits recognition for under-resourced languages: case of Algerian and Moroccan dialects
  • Article
  • Publisher preview available

June 2022

·

179 Reads

·

13 Citations

International Journal of Speech Technology

In this paper, we present a set of experiments aiming to improve the recognition of spoken digits for under-resourced dialects of the Maghrebi region, using a hybrid system. Indeed, integrating a Dialect Identification module into an Automatic Speech Recognition (ASR) system has shown its efficiency in previous works. In order to make the ASR system able to recognize digits spoken in different dialects, we trained our hybrid system on Moroccan Berber Dialect “MBD,” Moroccan Arabic Dialect “MAD,” and Algerian Arabic dialect “AAD,” in addition to Modern Standard Arabic. We have investigated five machine learning based classifiers and two deep learning models: the first one is based on Convolutional Neural Network (CNN), and the second one uses two pre-trained models: Residual Deep Neural Network (Resnet50 and Resnet101). The findings show that the CNN model outperforms the other proposed methods and consequently enhances the performance of spoken digit recognition system by 20% for both Algerian and Moroccan dialects.

View access options

Emotion recognition in Arabic speech

November 2019

·

215 Reads

·

11 Citations

The general objective of this paper is to build a system in order to automatically recognize emotion in speech. The linguistic material used is a corpus of Arabic expressive sentences phonetically balanced. The dependence of the system on speaker is an encountered problem in this field; in this work we will study the influence of this phenomenon on our result. The targeted emotions are joy, sadness, anger and neutral. After an analytical study of a large number of speech acoustic parameters, we chose the cepstral parameters, their first and second derivatives, the Shimmer, the Jitter and the duration of the sentence. A classifier based on a multilayer perceptron neural network to recognize emotion on the basis of the chosen feature vector that has been developed. The recognition rate could reach more than 98% in the case of an intra-speaker classification and 54.75% in inter-speaker classification. We can see the system’s dependence on speaker clearly.



Design and Implementation of a Diacritic Arabic Text-To-Speech System

July 2017

·

1,827 Reads

·

16 Citations

The International Arab Journal of Information Technology

The absence of the diacritical marks from the modern Arabic text generates a significant increase of the ambiguity in the Arabic text, which can cause confusion in the pronunciation of a written word. Despite the fact that the reader with a certain level of Arabic knowledge can easily recover the missing diacritics by: using the words context, the morphology and the syntax knowledge of the Arabic language. This paper describes a design and implementation of a Text-To-Speech system for a diacritic Arabic text. The goal of this project is to obtain a set of high quality speech synthesizer based on unit selection using a bi-grams model taking into account the particularities of the language. It takes a diacritic Arabic text as input and produces corresponding speech; the output is available as male voice. The evaluation of our TTS system is based on subjective and objective tests. The final evaluation of GArabic TTS system, regarding the intelligibility, naturalness aspects (listening) and the quality (PESQ) is jugged successful.


Figure 1. Recording conditions for the first 4 speakers. 
Figure 2. Anechoic chamber described in [27]. 
Contribution to the Creation of an Arabic Expressive Speech Corpus

September 2015

·

494 Reads

·

7 Citations

Acta Acustica united with Acustica

In this article we present the methodology employed for the design and evaluation of a Basic Arabic Expressive Speech corpus (BAES-DB). The corpus, which has a total length of approximately 150 minutes, is constituted of 13 speakers uttering a set of 10 sentences while simulating 3 emotions (joy, anger and sadness) in addition to a neutral utterance. The 10 sentences have been selected to meet phonetic equilibrium and absence of emotional content criteria, from a corpus of sentences proposed in Boudraa [1]. The corpus was evaluated through various tests of guided categorization, performed through a website. The overall good recognition rate is currently at 83.03%, with sadness being the most well recognized type of expressive speech (90.93%) and joy being the least well recognized (73.87%).


Contribution to the Design of an Expressive Speech Synthesis System for the Arabic Language

September 2015

·

157 Reads

·

1 Citation

Lecture Notes in Computer Science

In this paper we will present a contribution to the design of an expressive speech synthesis system for the Arabic language. The system uses diphone concatenation as the synthesis method for the generation of 10 phonetically balanced sentences in Arabic. Rules for the orthographic-to-phonetic transcription are detailed, as well as the methodology employed for recording the diphone database. The sentences were synthesized with both “neutral” and “sadness” expressions and rated by 10 listeners, and the results of the test are provided.


Usage of a speech constraint for highlighting compensatory strategies developed in production of a second language

September 2014

·

32 Reads

International Journal of Speech Technology

This paper deals with the underlining and the prominent display of the compensatory strategies developed by six speakers which are from different geographic regions when they express themselves in a second language very different from their mother native language. The paradigm that had been used in this study is the elocution speed as a speaking constraint in order to investigate the effects of the flow rate on the acoustical parameters of the speech signal. The analysis is performed in both time and frequency domains. We have obtained very interesting results as we have found out that the used constraint allows a good discrimination between different speakers on the basis of their native language.


Citations (11)


... Notably, performance degradation is less pronounced under car noise compared to grinder noise environments. Lounnas et al. (2022) utilized a hybrid methodology to enhance spoken digit recognition for under-resourced languages in the Maghreb region. They incorporated a dialect identification module into the Automatic Speech Recognition (ASR) system, trained on Moroccan Arabic Dialect (MAD), Algerian Arabic Dialect (AAD), Moroccan Amazigh Dialect (MAD), and Modern Standard Arabic. ...

Reference:

Exploring data augmentation for Amazigh speech recognition with convolutional neural networks
Enhancement of spoken digits recognition for under-resourced languages: case of Algerian and Moroccan dialects

International Journal of Speech Technology

... While these tasks are interrelated-emotion can influence speech patterns differently for males and females, and gender-specific vocal characteristics can impact emotion perception-most existing approaches fail to exploit these relationships [6]. Furthermore, the scarcity of high-quality Arabic-language datasets limits progress in this field, leaving critical gaps in developing culturally and linguistically diverse solutions [7]. ...

Emotion recognition in Arabic speech
  • Citing Conference Paper
  • November 2019

... The recognition and analysis of fricatives in speech are important subjects with numerous applications, particularly in automatic speech recognition [5] and synthesis [6]. Many research works have been inspired by this complexity, including speaker recognition [7,8], telephonic speech [8][9][10], hearing aids for perception enhancement [11][12][13], speech therapy [14][15][16], Speech disorders [17][18][19], as well as the difficulty of non-native speakers' pronunciation and perception [1,20,21]. ...

Design and Implementation of a Diacritic Arabic Text-To-Speech System
  • Citing Article
  • July 2017

The International Arab Journal of Information Technology

... There has been some work done for Berber automatic language identification, for instance Chelali et al. (2015) created a Berber speaker identification system using some speech signal information as features. Also Halimouche et al. (2014) have used prosodic information to discriminate between affirmative and interrogative sentences in Berber. Both sets of work were done at the speaker level. ...

Detection of questions in Berber language using prosodic features
  • Citing Conference Paper
  • April 2014

... The corpus consists of 100 phonetically balanced sentences with emotionally neutral content. In 2015, L. Demri [40] selected a subset of 10 sentences from this database to create an expressive speech database. These sentences were recorded from 13 naive speakers. ...

Contribution to the Creation of an Arabic Expressive Speech Corpus

Acta Acustica united with Acustica

... Categorial estimation (CE) tests, preference test [102], [108], [117], DMOS test [108], and DRT tests [87], [96], [116] have also been used as subjective evaluation tests in some studies. Most studies measured intelligibility [91], [94], [95], [96], [99], [103], [104], [105], [61], [106], [109], [110], [112], [115], [116], [118], [120] and naturalness [94], [95], [99], [101], [103], [104], [105], [108], [109], [112], [115], [116], [118] whereas others measured pronunciation [95], [109], sound quality [95], [109], [111], prosody [91], nasality [87], graveness [87], compactness [87], clearness [109], emphasis [87], sibilation [87], features average/system [87], sustention [87], global acceptability [89], global quality [106], [108], [110], [117], [119], and system quality evaluation. While the MOS tests were based on listening tests, the number of listeners varied in each study. ...

Contribution to the Design of an Expressive Speech Synthesis System for the Arabic Language

Lecture Notes in Computer Science

... In other words, in a sufficiently large corpus, it is to include as many phonetic combinations as possible, including intra-and inter-syllabic structures. The unit types to be considered are the rarest allophone categories in priority [13], "sandwich" units [14], a balance of tri-phone, syllable, and morpheme elements [15], and di-phones [6,16]. Each possible unit should have at least one instance in words corpus. ...

Formantic Analysis of Speech Signal by Wavelet Transform
  • Citing Article
  • July 2011

... There are three categories of the Arabic: classical, Modern Standard Arabic (MSA), and Colloquial. Modern Standard Arabic (MSA) has unique attributes and characteristics which are include; Arabic sound system, namely, geminate, emphatic, uvular and pharyngeal constants and vowel duration [1]. Classical Arabic is an elder language comparing with MSA. ...

Analysis and Recognition of Arabic Consonants Using Locus Equations

Lecture Notes on Information Theory

... Dans un article récent, nous avons présenté un modèle de simulation des cordes vocales, connu sous la dénomination de modèle à deux masses [2]. Le schéma du modèle est donné sur la figure 1. Dans l'analogie habituelle mécanique électrique, la glotte peut être représentée par un circuit électrique composé de résistances et d'inductances. ...

Simulation dun modle de la source vocale et dtermination des paramtres de commande