Boris Doval

Boris Doval
Sorbonne Université | UPMC · Institut Jean Le Rond d'Alembert - équipe LAM Lutheries, acoustique, musique

About

58
Publications
12,104
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
1,274
Citations
Citations since 2016
6 Research Items
369 Citations
2016201720182019202020212022010203040506070
2016201720182019202020212022010203040506070
2016201720182019202020212022010203040506070
2016201720182019202020212022010203040506070

Publications

Publications (58)
Article
This paper presents Voks, a new family of digital instruments that allow for real-time control and modification of pre-recorded voice signal samples. An instrument based on Voks is made of Voks itself, the synthesis software and a given set of chironomic (hand-driven) interfaces. Rhythm can be accurately controlled thanks to a new methodology, base...
Conference Paper
Full-text available
T-Voks is an augmented theremin that controls Voks, a performative singing synthesizer. Originally developed for control with a graphic tablet interface, Voks allows for real-time pitch and time scaling, vocal effort modification and syllable sequencing for pre-recorded voice utterances. For T-Voks the theremin’s frequency antenna modifies the outp...
Article
Full-text available
Cantor Digitalis is a performative singing synthesizer that is composed of two main parts: a chironomic control interface and a parametric voice synthesizer. The control interface is based on a pen/touch graphic tablet equipped with a template representing vocalic and melodic spaces. Hand and pen positions, pen pressure, and a graphical user interf...
Article
The Yezidis are a Kurmanji-speaking (northern Kurdish) religious minority living scattered across northern Iraq, Syria, the Caucasus (Armenia and Georgia), and western Europe. Among the Yezidis of Armenia, “speech” ( axavtin ), “song” ( stran ; pl. stranen ), and “words about” ( kilamě ser , pl. kilamen ser ) constitute the three main categories of...
Conference Paper
Full-text available
This paper presents some pitch features for the characterization of intermediate vocal productions from the CNRS - Museée de l’Homme sound archives, in the context of the DIADEMS inter- disciplinary project gathering researchers from ethnomusicology and speech signal processing. Different categories – chanting, singing, recitation, storytelling, ta...
Article
Full-text available
This paper addresses the source-filter separation problem in the context of causal/anticausal linear filter model of voice production. An algorithm based on standard signal processing tools is proposed for the class of quasi-periodic signals (voiced sounds with quasi-stationary pitch). At first, a one-period frame of an equivalent stationary infini...
Conference Paper
Full-text available
A real-time and gesture controlled voice synthesis software is applied to edutainment in the field of voice pedagogy. The main goals are teaching how voice works and what makes the differences between voices in an interactive, real-time and audio-visual perspective. The project is based on ”Cantor Digitalis”, a singing vowel digital instrument, fea...
Article
Full-text available
Many methods have been considered for human larynx imaging and especially vocal-folds monitoring. A widely-used non-invasive technology for vocal-folds monitoring is elec-troglottography (EGG), a technology based on impedance measurement of the larynx. Electri-cal impedance tomography (EIT) is another non-invasive electrical measurement method whic...
Article
The impact that vowels have on the variation across voice range profiles (VRPs) is studied along with three factors: fundamental frequency, singer's gender, and laryngeal vibratory mechanism used to sing. VRPs of 21 advanced singers were established by recording, in separate takes, vocal productions using laryngeal vibratory mechanisms M1 and M2 fo...
Conference Paper
Full-text available
Cet article décrit le projet de recherche OrJo 2009-2012 (Orchestre de Joysticks) qui associe quatre structures, PUCE MUSE, le LAM (UPMC), le LIMSI (CNRS, associé à l’UPMC et à l'Université Paris-Sud), et 3Dlized, autour de quatre grands objectifs : 1. réaliser quatre versions du logiciel plateforme pour s'adapter aux différents usages, 2. proposer...
Conference Paper
Full-text available
Le découplage entre le geste de l’instrumentiste et le son produit par son instrument dans les musiques électroniques a entrainé une reconsidération totale de la notion d’instrument et de nouvelles pratiques pour la facture d'instruments numériques. Pour concevoir des instruments numériques riches, le facteur est souvent amené à réaliser des connex...
Article
Full-text available
Mapping is one of the most important aspects of software instruments design. We call "mapping" the relation defined between the parameters from hardware interaction devices, and those of the process to be controlled. For software instruments, this relation between the user's gestures and synthesis engine parameters has a decisive role in resulting...
Conference Paper
This paper presents a method for the joint estimation of the open quotient and the asymmetry quotient of the open phase of the glottal flow on speech. An algorithm based on a source/filter decomposition (the Zeros of the Z Transform - ZZT) is presented. This algorithm is first tested on a database of sustained vowels spoken at different voice quali...
Article
Full-text available
This paper explores how the laryngeal source is adjusted by the singer when the sung vowel changes. As the source parameter adjustments may depend on intensity and pitch, comparisons are done through phonetograms computed for each vowel. Distinction is made in the phonetograms between the two main laryngeal mechanisms (M1 and M2). Male and female s...
Conference Paper
Full-text available
Voice source analysis is an important but difficult issue for speech processing. In this talk, three aspects of voice source analysis recently developed at LIMSI (Orsay, France) and FPMs (Mons, Belgium) are discussed. In a first part, time domain and spectral domain modelling of glottal flow signals are presented. It is shown that the glottal flow...
Article
This paper deals with the vocal quality differences observed between two sounds produced at the same pitch and SPL in the two different laryngeal mechanisms M1 and M2. Glottal parameters such as spectral tilt would be relevant to describe these differences but inverse filtering is problematic for high pitch sounds. Instead, the energy distribution...
Article
Full-text available
In this paper, we describe a full computer-based musical instru-ment allowing realtime synthesis of expressive singing voice. The expression results from the continuous action of an inter-preter through a gestural control interface. In this context, ex-pressive features of voice are discussed. New real-time imple-mentations of a spectral model of g...
Article
Full-text available
In this paper, we describe a full computer-based musical instrument allowing realtime synthesis of expressive singing voice. In this paper, we describe a full computer-based musical instrument allowing realtime synthesis of expressive singing voice. The expression results from the continuous action of an interpreter through a gestural control inter...
Article
Full-text available
A unified description of the most-common glottal-flow models (KLGLOTT88, Rosenberg C, R++, LF) is proposed in the time domain, using a set of five generic glottal-flow parameters: fundamental period, maximum excitation, open quotient, asymmetry coefficient, and return-phase quotient. A unified set of time-domain equations is derived, and their anal...
Conference Paper
Full-text available
In this paper, a new voice source model for real-time gesture–controlled voice synthesis is described. The syn-thesizer is based on a causal-anticausal model of the voice source, a new approach giving accurate control of voice source dimensions like tenseness and effort. Aperiodic com-ponents are also considered, resulting in an elaborate mo-del su...
Article
Full-text available
We propose a new spectral representation called the zeros of z-transform (ZZT), which is an all-zero representation of the z-transform of the signal. We show that separate patterns exist in ZZT representations of speech signals for the glottal flow and the vocal tract contributions. A decomposition method for source-tract separation is presented ba...
Article
Full-text available
This article presents the results of glottal open-quotient measurements in the case of singing voice production. It explores the relationship between open quotient and laryngeal mechanisms, vocal intensity, and fundamental frequency. The audio and electroglottographic signals of 18 classically trained male and female singers were recorded and analy...
Article
Full-text available
Electroglottography is a common method for providing noninvasive measurements of glottal activity. The derivative of the electroglottographic signal, however, has not attracted much attention, although it yields reliable indicators of glottal closing instants. The purpose of this paper is to provide a guide to the usefulness of this signal. The mai...
Conference Paper
Full-text available
This study proposes a new spectral decomposition method for source-tract separation. It is based on a new spectral representation called the Zeros of Z-Transform (ZZT), which is an all-zero representation of the z-transform of the signal. We show that separate patterns exist in ZZT representations of speech signals for the glottal flow and the voca...
Conference Paper
Full-text available
This study presents an improved version of our previously introduced formant tracking algorithm. The algorithm is based on processing the negative derivative of the argument of the chirp-z transform (termed as the differential phase spectrum) of a given speech signal. No modeling is included in the procedure but only peak picking on differential ph...
Article
Full-text available
Les chanteurs de yodel utilisent les voyelles pour favoriser le passage de mécanisme laryngé. Les chanteurs lyriques ne gèrent pas la zone de passage de la même façon sur différentes voyelles. Quel lien entre voyelle et vibration laryngée ? Quelles conséquences pour le chanteur ? Les mécanismes laryngés Fig 2 -Exemple de glissando ascendant et desc...
Article
This study aims to explore the perceptual relevance of the variations of glottal flow parameters and to what extent a small variation can be detected. Just Noticeable Differences (JNDs) have been measured for three values of open quotient (0.4, 0.6, and 0.8) and two values of asymmetry coefficient (2/3 and 0.8), and the effect of changes of vowel,...
Article
Full-text available
A new type of glottal flow model, namely a causal- anticausal linear filter model, is proposed. It is shown that the glottal flow signal can be considered as the impulse re- sponse of a linear filter. Then the source/filter speech model can be interpreted as an excitation/filter speech model, the "fil- ter" comprising the glottal flow, vocal tract...
Article
Full-text available
Most of the main time-domain glottal flow models proposed in the literature can ultimately be described in a unified manner by a set of five parameters: the fundamental frequency f 0 , the amplitude of voicing A v , the open quotient O q , the asymmetry coefficient α m and the return phase quotient Q a . Time and frequency-domain analytical formula...
Article
A new method for the estimation of the voice open quotient is presented. Assuming abrupt glottal closures, the glottal ow waveform is considered as the impulse response of an anticausal two-poles lter. It is dened by four parameters : T 0 , A v , O q and m . The last three ones are estimated by a second-order linear prediction of the inverse ltered...
Article
) Nous pr'esentons `a cet atelier de recherche nos travaux r'ecents sur la mod'elisation du signal de d'ebit glottique. Un mod`ele du signal comprenant une composante p'eriodique (li'ee la vibration quasip'eriodique des cordes vocales) et une composante ap'eriodique (li'ee aux bruit d'aspiration, de frication, aux irr 'egularit'es de la vibration d...
Article
Several glottal flow models have been proposed for speech analysis and synthesis (e.g., LF, Rosenberg, R++, and Klatt). All these models do not use the same number of parameters, or the same name for similar parameters, and it appears difficult to compare their merits. Then, a unified framework for studying the time and frequency domain properties...
Article
A spectral approach is proposed for voice source parameters representation and estimation. Parameter estimation is based on decomposition of the periodic and the aperiodic components of the speech signal, and on spectral modelling of the periodic component. The paper focusses on parameters estimation for the periodic component of the glottal flow....
Conference Paper
Full-text available
This paper deals with the spectral representation of the glottal flow. The LF and the KLGLOTT88 models of the glottal flow are studied. We compute analytically the spectrum of the LF-model. Then, formulas are given for computing spectral tilt and amplitudes of the first harmonics as functions of the LF-model parameters. We consider the spectrum of...
Article
A new approach is presented for the estimation and tracking of the fundamental frequency (f0) of pseudoperiodic signals. It is based on a probabilistic model of pseudoperiodic signals that makes it possible to take prior knowledge into account and to include constraints on the evolution of the signal. The resulting method can operate on a large int...
Article
In many physical models of musical instruments, the instrument itself is represented by a passive linear system coupled to a nonlinear dynamical system (NDS) [M. E. McIntyre et al., ??On the Oscillations of Musical Instruments,?? J. Acoust. Soc. Am. 74, 1325?1345 (1983)]. Modeling the excitation process is one of the key points for music synthesis...
Article
This presentation deals with the estimation of fundamental frequency (f 0) of pseudoperiodic sound signals with important results for polyphonic frequency tracking, and voice separation. Given a set of candidate partials in the signal, the estimation of f 0 is taken in the sense of finding the optimal period duration(s) according to a criterion of...
Conference Paper
In order to estimate the fundamental frequency (fO) of pseudoperiodical sounds with a wide band of possible fO, a theoretical model based on a maximum likelihood for fO is proposed. The model is simplified to make it fast enough for extensive tests. The resulting algorithm is tested on musical speech sounds. As a musical application, an instrument...
Article
Full-text available
This study discusses the difficulties of phase spectrum analysis of speech signals and shows that appropriate windowing is very crucial for obtaining reliable phase spectra. The main difficulties of phase based analysis stem from the domination of spiky effects of roots (zeros) of the signal z-transform close to the unit circle. We show how this pr...
Article
Full-text available
Three real-time gesture controlled vocal instruments are presented. They are based on a time domain (LF) and a spectral domain (CALM) model of the glottal pulse signal. Gestural control is able to add expression to the synthetic voices, enabling simulation of various vocal behaviors. Expressive vocal instruments are demonstrated for musical and res...
Article
Full-text available
This paper presents new achievements done with our Cal-liphony software [1, 2] which allows for real-time modification of the intonation and rhythm of speech, driven by manual con-trol of both fundamental frequency and time scaling either inde-pendantly or conjointly. We will present here the main features of the software and discuss possible use f...
Article
Full-text available
A new method for estimation of the voice speed quotient (S q) from acoustic signals is presented. The method is based on source filter decomposition us-ing a new signal representation, the Zeros of Z Trans-form representation. A source dominated spectrum is obtained using the ZZT decomposition, and then the glottal for-mant frequency is estimated....

Network

Cited By

Projects

Projects (3)