Conference PaperPDF Available

Audibility of Initial Pitch Glides in String Instrument Sounds

Authors:

Abstract

Listening experiments were made to measure the detection thresholds for initial pitch glides in string instrument sounds, where a rapid decline of pitch is caused by tension modulation during the attack. Realistic sounding synthetic tones were generated by additive synthesis. The frequency decay of the glide was defined through the overall decay rate of am-plitude, simulating the behavior of real instruments. It was found that on the ERB frequency scale, the thresholds re-mained roughly constant at approximately 0.1 ERB with vary-ing fundamental frequency. Thus, any pitch glide weaker than the given threshold remains inaudible for most listeners and could be left unimplemented in digital sound synthesis.
Audibility of Initial Pitch Glides in String Instrument Sounds
Hanna J¨arvel¨ainen, Vesa V¨alim¨aki
Laboratory of Acoustics and Audio Signal Processing, Helsinki University of Technology
email: hanna.jarvelainen@hut.fi
Abstract
Listening experiments were made to measure the detection
thresholds for initial pitch glides in string instrument sounds,
where a rapid decline of pitch is caused by tension modu-
lation during the attack. Realistic sounding synthetic tones
were generated by additive synthesis. The frequency decay of
the glide was defined through the overall decay rate of am-
plitude, simulating the behavior of real instruments. It was
found that on the ERB frequency scale, the thresholds re-
mained roughly constant at approximately 0.1 ERB with vary-
ing fundamental frequency. Thus, any pitch glide weaker than
the given threshold remains inaudible for most listeners and
could be left unimplemented in digital sound synthesis.
1 Introduction
High-quality sound synthesis is possible with the modern
synthesis methods, such as physical modeling (Smith 1998)
and sinusoidal modeling (Serra and Smith 1990). However,
implementing all details of the sound is computationally costly.
It would be desirable to leave such features, whose effects are
not perceived by the listener, unimplemented.
A rapid descent of pitch during the attack is character-
istic to many plucked and struck string instruments in forte
playing. It can be detected for instance in the clavichord
(V¨alim¨aki et al. 2000), the guitar (Tolonen et al. 2000), and
the kantele – a traditional Finnish string instrument (V ¨alim¨aki
et al. 1999). The primary cause of the pitch descent is the
varying string tension as a consequence of finite string dis-
placement after plucking or striking the string (Legge and
Fletcher 1984). In the clavichord, the effect is boosted by
the mechanical aftertouch. The string tension can be directly
controlled by the player through key pressure.
Fig. 1 shows a fundamental frequency estimate obtained
from a recorded electric guitar tone by the autocorrelation
method (Tolonen et al. 2000). The estimate decreases
exponentially with time from 499 to 496 Hz, giving a glide
extent of approximately 3 Hz.
The tension modulation can be implemented in physical
modeling, for instance, by a special filter structure with signal-
dependent fractional delay elements (Tolonen et al. 2000),
(V¨alim¨aki et al. 1998), but ignoring it would bring remark-
able computational savings.
The detection and discrimination of frequency glides has
been previously studied from a more theoretical viewpoint.
Still the underlying mechanism remains unclear. It was sug-
gested by Madden and Fire (1997) that the detection is based
on changes in the low-frequency side of the excitation pattern.
Moore and Sek (1998) argued that at least both sides of the
excitation pattern should be compared, and that for low cen-
ter frequencies the time-related cues, such as phase locking
could have an effect as well. The studies agree that the detec-
tion and discrimination of glides is little affected by duration,
center frequency, or direction (up or down). However, the
previous results are of little help for the synthesis of instru-
ment tones, since the range of center frequencies was from
0.5 to 6.0 kHz and the shape of the glide as a function of time
was unnatural to real instruments.
The objective of this study is to set perceptually motivated
guidelines for the need to implement the initial pitch glide in
string instrument synthesis. The test tones were synthesized
and the pitch glides were defined in a way typical of string
instruments. The results of two listening experiments are re-
ported.
0 0.2 0.4 0.6 0.8 1
−1
0
1
Level
0 0.2 0.4 0.6 0.8 1
496
497
498
499
F0 (Hz)
Time (s)
Figure 1: Waveform of a single tone played on the electric
guitar (top) and its short-time fundamental frequency esti-
mate, which shows a typical descent (bottom).
Reprinted from Proceedings of the International Computer Music Conference. Havana, Cuba, 17-23 September 2001, pages 282-285.
2 Listening tests
The detection thresholds for initial pitch glides were mea-
sured in two separate listening experiments. In both experi-
ments, the independent variable was the transition span ,
i.e., the extent of the pitch glide; in addition, fundamental fre-
quency was used as a parameter in experiment I and decay
time constant in experiment II.
2.1 Test sounds
The test sounds were generated using additive synthesis
for easy control of the pitch glides. Each tone consisted of
its 20 lowest harmonics and had a duration of 2.0 seconds.
An exception was made in the highest tone (659 Hz), for
which only 16 harmonics could be generated before meeting
the Nyquist limit of 11 kHz at the sampling rate of 22.05 kHz.
Realistic sounding test tones were created by copying the
initial amplitudes and decay characteristics of real tones. The
acoustic guitar was taken as a reference, for its behavior is
well-known and plenty of analysis data are available. The ini-
tial phases of the harmonics were randomized. The amplitude
decay was controlled by two parameters, the time constant
of the overall decay and a frequency-dependent damping co-
efficient , which corresponds to the feedback coefficient of
the one-pole loop filter in a digital waveguide string model.
The initial amplitudes as well as the decay parameters were
chosen according to (V¨alim¨aki and Tolonen 1998).
The independent variable was the extent of the pitch glide
in Hz. The pitch contour decreased exponentially with time
from the highest value towards the steady state fun-
damental frequency . The time constant of the frequency
descent was 50% of the overalltime constant of amplitude
decay (Legge and Fletcher 1984).
In experiment I, the thresholds were measured at four dif-
ferent fundamental frequencies to cover the whole pitch range
of the acoustic guitar: (659.26 Hz), (349.23 Hz),
(196 Hz), and (116.5 Hz). The overall decay time con-
stant and frequency-dependent damping coefficient were
kept constant.
The motivation forthe second experiment is the known re-
lation of the time constant of amplitude decay and time con-
stant of pitch decay in string instrument tones (Legge and
Fletcher 1984). The perceptual tolerances for amplitude de-
cay have been published previously(Tolonen and J¨arvel¨ainen
2000), giving the allowable deviation of the time constant
from the reference value. However, varying the time con-
stant even withinthis tolerance affects the time constant of the
pitch decay and hence the duration of the pitch glide, which
can cause a shift in the detection threshold. This was studied
by fixing the fundamental frequency to 196 Hz and varying
the time constant . Three values were used for the time
constant according to the previous results on the perceptual
tolerances: 100%, 80%, and 60% of the reference value 0.39
0 0.5 1 1.5 2
−1
−0.5
0
0.5
1
Level
0 0.5 1 1.5 2
348
350
352
354
F0 (Hz)
Time (s)
Figure 2: Waveform of a synthetic guitar tone (top) and
its fundamental frequency with a 5-Hz pitch glide (bottom).
s, which is 50% of the measured time constant of the overall
amplitude decay (0.77 s).
2.2 Subjects and test method
Five subjects participated in both experiments. They were
20-30 years old, and all had previous experience in psychoa-
coustic listening tests. None of them reported any hearing de-
fects, and they were allowed to practise before the test. The
sound samples were played through headphones from a com-
puter.
A standard tone without a pitch glide and a stimulus tone
with a pitch glide were presented to the subject sequentially
in random order, and the task was to judge whether the sounds
were same or different. Five values of glide extent were
used for each fundamental frequency in experiment I and four
for each time constant in experiment II. Each trial was judged
four times together with as many corresponding fake trials
(two standard tones, no stimulus) in random order.
A detected difference was either a hit or a false alarm,
depending on whether the trial actually included the stimulus
tone or not. A measure of correct answers was derived
for each condition from the proportion of hits and false alarms
as follows (Yost 1994):
(1)
The function has values between 0.50, which corresponds to
chance level with equal proportions of hits and false alarms,
and 1.0, which requires 100% hit proportion and no false
alarms. The detection threshold was estimated by finding the
midpoint (i.e., the 75% point) of this function. If the thresh-
old was not directly evident in the data, it was interpolated
between the nearest higher and lower scores.
3 Results
3.1 Effect of fundamental frequency
The effect of fundamental frequency on the detection of
pitch glides was studied in experiment I. The detection thresh-
olds increasedmonotonically withfundamental frequency. The
mean thresholds were 3.1 Hz, 4.4 Hz, 5.4 Hz, and 11.7 Hz for
, , , and , respectively. The situation turns upside
down when the thresholds are expressed on the logarithmic
scale, see Fig. 3. For , the median of individual thresh-
olds is 52 cents (1/100 of a semitone) – more than half of a
semitone, while for the highest tone it is 30 cents.
116.5 196 349 659
20
40
60
80
100
f at threshold (cents)
Fundamental frequency (Hz)
Figure 3: Listening test results with fundamental frequency
as a parameter. Boxplot of (cents) at threshold.
The results were roughly normally distributed,but the er-
ror variance between different tones was typically unequal.
However, these differences were reasonably equalized by a
transformation from the linear frequency scale to the audi-
torily motivated ERB (Equivalent Rectangular Bandwidth)
scale as follows (Glasberg and Moore 1990):
(2)
where is frequency in kHz. The analysis of variance (ANO-
VA) was now performed on the data presented on the ERB
scale (Lehman 1991). The result was nonsignificant (
), indicating that the threshold could well be the same for
each . The results are summarized in Fig. 4. The figure
presents a boxplot of the results with themedian and the 75%
and 25% quartiles. The mean thresholds are between 0.083
and 0.122 ERB.
Since a true difference in the mean thresholds for each
note is relatively unlikely, the data were collapsed across fun-
damental frequency. The sample mean across all subjects and
fundamental frequencies is 0.10 ERB, which could be consid-
ered an estimate of the constant detection threshold of pitch
glides within the pitch range of the acoustic guitar.
3.2 Effect of decay rate
The listening test data from experiment II were processed
in the same way as the data from the first experiment. Again
116.5 196 349 659
0.05
0.1
0.15
f at threshold (ERB)
Fundamental frequency (Hz)
Figure 4: Listening test results with fundamental frequency
as a parameter. Boxplot of (ERB) at threshold.
the ANOVA revealed no significant differences among the
means of the different conditions ( ). This suggests
that relatively great variations in the time constant of the pitch
glide have no effect on detecting the glide. The results are
shown in Fig. 5. The boxplot is shown in the top figure and
the mean thresholds on the ERB scale in the bottom figure.
Of course, it is likely that the glide duration would show
some effect if the glides were short enough. Such behavior
would be unnatural to string instruments, but it is an interest-
ing task to extend this study to synthetic sounds with greater
variation in the decay characteristics. A third, rather infor-
mal test was conducted to study extremely fast and slowly
decaying sounds. The time constant was now varied with
five linear steps between 20% and 180% of the original value
of 0.39 s, i.e., between 0.08 s and 0.7 s. The procedure was
the same as before, but only three subjects participated. The
shortest caused a considerable increase in the thresholds
while the other thresholds were similar to the previousresults.
The mean threshold for = 0.08 s was 10.8 Hz. For the four
greater values of the mean thresholds were between 5.6 Hz
and 7.7 Hz.
4 Conclusions and future work
The thresholds for detecting initial pitch glides in string
instrument tones were measured. It was found that when
expressed on the ERB scale, the thresholds remain roughly
constant with fundamental frequency within the pitch range
of the acoustic guitar. Furthermore, no significant effect was
detected for glide duration, which was controlled through the
time constant of the pitch decay. This suggests that a constant
threshold of 0.1 ERB could be proposed for string instrument
sounds at least within the studied range. Thus any pitch glide
weaker than this could be left unimplemented.
For instance, the electric guitar tone in Fig. 1 exhibits an
initial pitch glide of 3 Hz at a fundamental frequency of 466
Hz, i.e., about 0.06 ERB. Thus it would probably remain in-
audible to most listeners. On the other hand, a kantele tone
presented in (V¨alim¨aki et al. 1999) shows a pitch glide of
Figure 5: Listening test results with decay time constant as a
parameter. Top: Boxplot with the median and 75% and 25%
quartiles, bottom: mean thresholds on the ERB scale.
almost 0.3 ERB, which should be clearly audible.
Although the effect of glide duration was insignificant
in the range typical of string instruments, the thresholds in-
creased for very fast decaying sounds. For these short sounds,
there was less time to listen to the steady pitch . This sug-
gests that the subjects were using end point detection, i.e.,
comparing the absolute frequencies at both ends of the glide,
to detect the glides. Both Madden and Fire (1997) and Moore
and Sek (1998) prevented their subjects from using such cues,
which might partly explain why the thresholds from this study
are lower than what they measured. On the other hand, in mu-
sical context the expectance of a certain pitch may well con-
nect the detection of a glide to absolute frequency,and this is
why our study enabled the end point cues.
The contradiction between glide detection for short and
long tones may also concern real instrument sounds, not only
synthetic sounds with unrealistic decay characteristics. Cut-
ing off the steady part of the tone might have a similar ef-
fect on the detection threshold as shortening the decay time
showed in this study. This calls for some more experiments.
The results from this study can be applied in digital sound
synthesis, where computational savings can be achieved by
ignoring the pitch glides whenever they are inaudible. Coding
is another field of application. The new structured methods of
sound representation (Vercoe et al. 1998) make it desirable to
control the perceptual features of sounds separately.
Acknowledgments
This work was supported by the Pythagoras graduate school,
Nokia Research Center, and the Academy of Finland.
References
Glasberg, B. and B. Moore (1990). Derivation of auditory filter
shapes from notched-noise data. Hearing Res. 47, 103–138.
Legge, K. and N. Fletcher (1984). Nonlinear generation of miss-
ing modes on a vibrating string. J. Acoust. Soc. Am. 76(1),
5–12.
Lehman, R. S. (1991). Statistics and Research Design in the Be-
havioral Sciences. Belmont, California: Wadsworth Publish-
ing Company.
Madden, J. and K. Fire (1997). Detection and discrimina-
tion of frequency glides as a function of direction, dura-
tion, frequency span, and center frequency. J. Acoust. Soc.
Am. 102(5), 2920–2924.
Moore, B. C. and A. Sek (1998). Discrimination of frequency
glides with superimposed random glides in level. J. Acoust.
Soc. Am. 104(1), 411–421.
Serra, X. and J. Smith (1990). Spectral modeling synthesis: a
sound analysis/synthesis system based on a deterministic
plus stochastic decomposition. Computer Music J. 14(4), 12–
24.
Smith, J. O. (1998). Principles of digital waveguide models
of musical instruments. In M. Kahrs and K. Brandenburg
(Eds.), Applications of Digital Signal Processing to Audio
and Acoustics, Chapter 10, pp. 417–466. Kluwer.
Tolonen, T. and H. J¨arvel¨ainen (2000). Perceptual study of decay
parameters in plucked string synthesis. Preprint 5205, 109th
Conv. Audio Eng. Soc., Los Angeles, California.
Tolonen, T., V. V¨alim¨aki, and M. Karjalainen (2000). Modeling
of tension modulation nonlinearity in plucked strings. IEEE
Trans. Speech and Audio Processing 8(3), 300–310.
alim¨aki, V., M. Karjalainen, T. Tolonen, and C. Erkut
(1999). Nonlinear modeling and synthesis of the kan-
tele – a traditional Finnish string instrument. In Proc.
Int. Computer Music Conf., Beijing, China, pp. 220–
223. Full paper and sound examples are available at
http://www.acoustics.hut.fi/publications/.
alim¨aki, V., M. Laurson, C. Erkut, and T. Tolonen
(2000). Model-based synthesis of the clavichord. In Proc.
Int. Computer Music Conf., Berlin, Germany, pp. 50–
53. Full paper and sound examples are available at
http://www.acoustics.hut.fi/publications/.
alim¨aki, V. and T. Tolonen (1998). Development and calibration
of a guitar synthesizer. J. Audio Eng. Soc. 46(9), 766–778.
alim¨aki, V., T. Tolonen, and M. Karjalainen (1998).
Signal-dependent nonlinearities for physical models us-
ing time-varying fractional delay filters. In Proc. Int.
Computer Music Conf., Ann Arbor, Michigan, pp. 264–
267. Full paper and sound examples are available at
http://www.acoustics.hut.fi/publications/.
Vercoe, B., W. G. Gardner, and E. D. Scheirer (1998). Structured
audio: Creation, transmission, and rendering of parametric
sound representations. Proc. IEEE 86(5), 922–940.
Yost, W. A. (1994). Fundamentals of Hearing – An Introduction
(3rd ed.). New York: Academic Press.
... glide'' can occur. This effect is most familiar in instruments with metal strings, such as steel-string guitars, and it has been shown to be perceptible under some circumstances [14]. ...
... This is a generalisation of Eq. (3.58) of [25], which gives the corresponding expression without the effects of damping and bending stiffness. This expression gives the shape of the string at each time step, from which the slope 1 ∕ can be calculated numerically, and hence the right-hand side of Eq. (14). The induced longitudinal displacement 3 is then obtained by projecting this excitation onto each longitudinal mode to obtain modal excitation waveforms, then using convolution with the appropriate modal impulse response functions, as described previously [20]. ...
... The shift in frequency is a little over 2%. According to the results of Järveläinen et al. [14], this should be above the threshold of perception. They estimate the threshold at around 0.1 ERB, in terms of the ''equivalent rectangular bandwidth'' measure of the critical bandwidth of the cochlea (see for example Moore [29]). ...
Article
The vibration response of a small harp is studied, by a combination of measurement, theoretical modelling and model-based sound synthesis. The harp’s distinctive angle between strings and soundboard is shown to have little impact on the results of linear modelling, but to have a significant effect on sound through a nonlinear phenomenon. Generation of axial force in the string through geometric nonlinearity is shown to produce very prominent effects, helped by the efficient coupling of longitudinal force to soundboard motion. The detailed pattern of the resulting “phantom partials”, already familiar from work on piano strings, is shown to be sensitive to the exact boundary condition between the string and the soundboard: better agreement between experiment and modelling is obtained by assuming a clamped boundary than a pinned boundary. It is suggested that the relative prominence of nonlinear effects in the harp, compared to other plucked-string instruments using non-metallic strings, may be one reason why harpists still prefer the sound of gut strings over nylon strings.
... The formulas of reference resistance for capacitance and inductance Ä in electrical circuits are: Ê ½ ¾ × and ÊÄ ¾ Ä × (19) where Ê and ÊÄ are the related WDF port resistances and × is the sample rate. However, due to the bilinear mapping between analog angle frequency ¾ and discrete-time angle frequency ª by á µ 3 Ì×µ Ø Ò ª Ì× ¾µ (20) the frequency scale is warped [3] so that while ª × ¾, the corresponding analog angle frequency ½ . Thus the inductances and capacitances in a WDF realization have to be prewarped to get correct modal frequencies and decay times. ...
... ´Òµ represents the excitation force signal applied on the string. [20], illustrated as a dotted line, suggests that the fundamental frequency drifts in both cases are audible. ...
Conference Paper
Full-text available
Physical (or physics-based) modeling of musical instruments is one of the main research fields in computer music. A basic question, with increasing research interest recently, is to understand how different discrete-time modeling paradigms are interrelated and can be combined, whereby wave modeling with wave quan- tities (W-methods) and Kirchhoff quantities (K-methods) can be understood in the same theoretical framework. This paper presents recent results from the HUT Sound Source Modeling group, both in the form of theoretical discussions and by examples of K- vs. W-modeling in sound synthesis of musical instruments.
... In addition to the attenuation of higher partials, Lindroos et al. (2011) also reported that the decay rate of partials changed rapidly for fortissimo plucks, whereas the effect is less dramatic for mezzo-forte and pianissimo plucks. Plucking dynamics also cause a pitch glide at the beginning of the tone, where stronger plucks create a larger pitch glide than softer plucks due to increased tension in the string (Lindroos et al., 2011;Järveläinen and Välimäki, 2001;Lee et al., 2009). ...
Thesis
Full-text available
The main motivation of this thesis is to explore several techniques for estimating electric guitar synthesis parameters to replicate the sound of popular guitarists. Many famous guitar players are recognisable by their distinctive electric guitar tone, and guitar enthusiasts would like to play or obtain their favourite guitarist’s sound on their own guitars. This thesis starts by exploring the possibilities of replicating a target guitar sound, given an input guitar signal, using a digital filter. A preliminary step is taken where a technique is proposed to transform the sound of a pickup into another on the same electric guitar. A least squares estimator is used to obtain the coefficients of a finite impulse response (FIR) filter to transform the sound. The technique yields good results which are supported by a listening test and a spectral distance measure showing that up to 99% of the difference between input and target signals is reduced. The robustness of the filters towards changes in repetitions, plucking positions, dynamics and fret positions are also discussed. A small increase in error was observed for different repetitions; moderate errors arose when the plucking position and dynamic were varied; and there were large errors when the training and test data comprised different notes (fret positions). Secondly, this thesis explored another possible way to replicate the sound of popular guitarists in order to overcome the limitations provided by the first approach. Instead of directly morphing one sound into another, replicating the sound with electric guitar synthesis provides flexibility that requires some parameters. Three approaches to estimate the pickup and plucking positions of an electric guitar are discussed in this thesis which are the Spectral Peaks (SP), Autocorrelation of Spectral Peaks (AC-SP) and Log-correlation of Spectral Peaks (LC-SP) methods. LC-SP produces the best results with faster computation, where the median absolute errors for pickup and plucking position estimates are 1.97 mm and 2.73 mm respectively using single pickup data and the errors increased slightly for mixed pickup data. LC-SP is also shown to be robust towards changes in plucking dynamics and fret positions, where the median absolute errors for pickup and plucking position estimates are less than 4 mm. The Polynomial Regression Spectral Flattening (PRSF) method is introduced to compensate the effects of guitar effects, amplifiers, loudspeakers and microphones. The accuracy of the estimates is then tested on several guitar signal chains, where the median absolute errors for pickup and plucking position estimates range from 2.04 mm to 7.83 mm and 2.98 mm to 27.81 mm respectively.
... Pitch glide is a short-lasting phenomenon where at the beginning of the string vibration the fundamental frequency is slightly larger then the nominal (desired) value. This effect can be clearly audible in some instruments, such as in the Finnish kantele [1]. Generally, this phenomenon is associated with and discussed in context with the nonlinear modulation of stiff string tension [2,3]. ...
Conference Paper
Full-text available
Interactions of a vibrating string with its supports and other spatially distributed barriers play a significant role in the physics of many stringed musical instruments. It is well known that the tone of the string vibrations is determined by the string supports, and that the boundary conditions of the string termination may cause a short-lasting initial fundamental frequency shifting. Generally, this phenomenon is associated with the nonlinear modulation of the stiff string tension. The aim of this paper is to study the initial frequency glide phenomenon that is induced only by the string–barrier interaction, apart from other possible physical causes, and without the interfering effects of dissipation and dispersion. From a numerical simulation perspective, this highly nonlinear problem may present various difficulties, not the least of which is the risk of numerical instability. We propose a numerically stable and a purely kinematic model of the string–barrier interaction, which is based on the travelling wave solution of the ideal string vibration. The model is capable of reproducing the motion of the vibrating string exhibiting the initial fundamental frequency glide, which is caused solely by the complex nonlinear interaction of the string with its termination. The results presented in this paper can expand our knowledge and understanding of the timbre evolution and the physical principles of sound generation of numerous stringed instruments, such as lutes called the tambura, sitar and biwa.
... A method based on windowed autocorrelation analysis [32] was used in order to obtain a good estimate of f 0 histories for the attack and sustain phase of the tone. The analysis shows a slight change in time of the pitch, which, however, is perceptually insignificant, with a variation of at most 1 to 2 cents, while audibility thresholds are usually much higher [33]. ...
Article
Full-text available
The Clavinet is an electromechanical musical instrument produced in the mid-twentieth century. As is the case for other vintage instruments, it is subject to aging and requires great effort to be maintained or restored. This paper reports analyses conducted on a Hohner Clavinet D6 and proposes a computational model to faithfully reproduce the Clavinet sound in real time, from tone generation to the emulation of the electronic components. The string excitation signal model is physically inspired and represents a cheap solution in terms of both computational resources and especially memory requirements (compared, e.g., to sample playback systems). Pickups and amplifier models have been implemented which enhance the natural character of the sound with respect to previous work. A model has been implemented on a real-time software platform, Pure Data, capable of a 10-voice polyphony with low latency on an embedded device. Finally, subjective listening tests conducted using the current model are compared to previous tests showing slightly improved results.
... In this figure, the dash-dotted line approximates the mean value of perceptual detection threshold of an initial pitch glide. The detection threshold in the frequency region of these tones is approximated to be 5.4 Hz[13]. The string model used in this figure had a total delay line length of 55.25 and the allpass coefficient was scaled using a constant value of 0.8 in order to simulate the behavior of the recorded sample. ...
Conference Paper
Full-text available
ABSTRACT Nonlinear behavior of a vibrating string is responsible for acousti- cal features in some plucked-string instruments, resulting in a char- acteristic and easily recognizable tone. That is also the case for the Finnish kantele, a traditional plucked-string instrument used in folk music. Earlier works have analyzed the general acous- tic properties of the kantele and discussed related sound synthesis techniques. In this study, a novel modeling and sound synthesis method,for simulating nonlinear string vibrations with spatially distributed tension modulation,is presented. The modeling,is con- ducted through a Digital Waveguide (DWG) approach, using con- trollable fractional delay elements in implementing,the distributed tension modulation,nonlinearity. The elongation of the vibrating string is estimated and the result is used in tuning the fractional de- lay values accordingly. Because of the spatially distributed nature of the approach, control of the string model parameters and obser- vation of its behavior can be implemented at any point along it, in contrast to prior digital waveguide string models. This new ap- proach is applied in constructing a physical model of a five-string kantele. Real-time sound synthesis is implemented,using an effi- cient, block-based modeling tool, the BlockCompiler.
... The fundamental frequencies of a real kantele recording and the synthesized tone are illustrated inFigure 8.Figure 7Figure 8: Illustration of the fundamental frequency glide in the synthesized tone (solid line) and in a real kantele tone, obtained via measurements (dashed line). The approximated detection threshold of a pitch drift [20], illustrated as a dotted line, suggests that the fundamental frequency drifts in both cases are audible. ...
... In spite of the results described above, determining the perceived loudness of two different instruments is not straightforward and in this case some of the reasons are the following. The amount of audible pitch glide [10] wasFigure 6. Listening test results as bar graphs for (a) strumming, (b) plucking the low strings, and (c) plucking the high strings, respectively . The y-axis indicates the percentage of cumulated answers and x-axis indicates the played dynamic level. ...
Article
Full-text available
The kantele, an ancient plucked string instrument, belongs to the family of zithers, and it is still used in traditional folk music in Finland, Northwest Russia, and the Baltic countries. Design rules for making the kantele louder are proposed and confirming analysis results are presented. A description of the main characteristics of the traditional design of the kantele is given, so that it supports the presentation of the design rules and analysis results. The design rules to make a plucked string instrument louder are (I) increase the string tension, (II) increase the radiation surface, and (III) isolate the top plate from the sound-box with an air gap. Rules (I) and (II) are almost self evident, for musical acousticians. Rule (III) on the other hand is more unique, since it enables a freely vibrating top plate. The design rules are confirmed by comparing the traditional design with the new one, through several methodological aspects. Results from the analysis are drawn from both analytical treatments and acoustical measurements. The effect that the new design has on the instrument's dynamic range and perception is investigated through a playability test with several professional kantele players. Results from two listening tests are presented that support the assumption of an increase in loudness for the new kantele design. Loud plucks of the modified design are, on the average, perceived as 3 dB louder than in the traditional design. The design ideas are also applicable to other string instruments.
... Here, the horizontal dotted line approximates the mean value of perceptual detection threshold of an initial pitch glide. The psychoacoustic detection threshold in the frequency region of these tones is about 5.4 Hz [38] . This shows that the fundamental frequency glide is an audible phenomenon in plucked string instruments such as the kantele even at modest plucking amplitudes, and thus it must be included in a synthesis model if realistic tones are desired. ...
Article
Full-text available
Nonlinearity in the vibration of a string is responsible for interesting acoustical features in many plucked string instruments, resulting in a characteristic and easily recognizable tone. For this reason, synthesis models have to be capable of modeling this nonlinear behavior, when high quality results are desired. This study presents two novel physical modeling algorithms for simulating the tension modulation nonlinearity in vibrating strings in a spatially distributed manner. The first method uses fractional delay filters within a digital waveguide structure, allowing the length of the string to be modulated during run time. The second method uses a nonlinear finite difference approach, where the string state is approximated between sampling instants using similar fractional delay elements, thus allowing run-time modulation of the temporal sampling location. The magnitude of the tension modulation is evaluated from the elongation of the string at every time step in both cases. Simulation results of the two models are presented and compared. Real-time sound synthesis of the kantele, a traditional Finnish plucked-string instrument with strong effect of tension modulation, has been implemented using the nonlinear digital waveguide algorithm.
Article
Full-text available
The clavichord is one of the oldest keyboard instruments, and it is still often used in performances and recordings of Renaissance and Baroque music. The sound of the instrument is pleasant and expressive but quiet. Consequently, the instrument can only be used in intimate performances for small audiences. This is the main reason why the clavichord was replaced by the harpsichord and finally by the modern piano, both of which produce a considerably louder output. Attempts have been made to amplify the sound of the clavichord using a piezoelectric pickup (Burhans 1973). One of our motivations in this research is to give the clavichord a new life in the digital world, where the faint sound level of the instrument can be amplified by simply turning a volume knob. The suggested synthesis model is based on digital waveguide modeling of string instruments (Smith 1992, 1998; Välimäki et al. 1996; Karjalainen, Välimäki, and Tolonen 1998) and uses the principle of commuted waveguide synthesis where the soundbox’s response is incorporated in the excitation signal
Conference Paper
Full-text available
We apply a recently developed nonlinear string synthesis algorithm to modeling and synthesis of the kantele, which is a plucked string instrument used in traditional Finnish music for the past few thou-sand years. The new model combines formerly proposed linear plucked-string models with recent nonlinear extensions. As a result, a dual-polarization nonlinear model is obtained where the differ-ence in the effective string length in two polarizations is accounted for, together with effects caused by the yielding termination of kantele strings and tension modulation. Calibration of model parame-ters is discussed. The model is also applicable to physical modeling of other plucked string instru-ments. Sound examples are available at www.acoustics.hut.fi/~vpv/publications/icmc99.htm.
Article
Full-text available
A listening experiment was conducted to study the audibility of variation of decay parameters in plucked string synthesis. A digital commuted-waveguide-synthesis model was used to generate the test sounds. The decay of each tone was parameterized with an overall and a frequency-dependent decay parameter. Two different fundamental fre-quencies, tone durations, and types of excitation signals were used totalling in eight test sets for both parameters. The results indicate that variations between 25% and 40% in the time constant of decay are inaudible. This suggests that large deviations in de-cay parameters can be allowed from a perceptual viewpoint. The results are applied in model-based audio processing.
Conference Paper
Full-text available
A synthesis model for the clavichord is developed applying the principles of digital waveguide modeling. A commuted waveguide synthesis model is used, where each tone is generated with two coupled string models that are excited with a common excitation signal that is obtained from analysis of recorded clavichord tones. The characteristic knock terminating tones played by the clavichord is reproduced by triggering a sample that is separated from a recorded tone. A simple technique enables the realistic variation of the fundamental frequency by using a control function that can be a constant value plus the impulse response of a low-order digital filter.
Conference Paper
Full-text available
This paper discusses the use of nonlinear filters that are based on linear digital filters with input-signal-dependent coefficients in the context of digital waveguide synthesis models. It is proposed that the practical uses of such filters in music synthesis may be generalized to a time-varying fractional delay (TVFD) filter with signal-dependent coeffi-cients. Three different cases are outlined: 1) simulation of a pressure-dependent wave propagation speed in a wind instrument bore, 2) nonlinear termination of a vibrating string, and 3) a tension-dependent variable speed of trans-verse vibrations on a string. Implementation structures for TVFD filters based on FIR filters are suggested.
Article
The nonlinear transfer of energy among modes of different frequencies on a vibrating string is investigated both theoretically and experimentally. The nonlinearity is associated with the well‐known variation of string tension caused by the vibration modes, but it is essential that at least one of the end supports has finite mechanical admittance if there is to be any mode coupling. If the nonrigid bridge support has zero admittance in a direction parallel to the string, the coupling is of third order in the mode amplitudes. For a more realistic model in which the string changes direction as it passes over a bridge of finite admittance there are additional coupling terms of second order. The first mechanism gives driving terms of frequency 2ω n ±ω m where ω n and ω m are, respectively, the angular frequencies of the nth and mth modes present on the string, while the second mechanism gives driving terms of angular frequencies 2ω n and 2ω m . Analysis shows that modes absent from the initial excitation of the string can be driven to appreciable amplitude by these mechanisms, reaching their maximum amplitude after a time typically of order 0.1 s. Modes that are in nearly harmonic frequency relationship behave simply but coupling of modes that are appreciably inharmonic may give rise to rapid amplitude fluctuations. A simple experiment with a wire deflected by a bridge of elastic cord and plucked so as to eliminate a particular mode from the initial excitation provided general semiquantitative confirmation of the theoretical predictions.
Article
A specific efficient implementation for the synthesis of acoustic guitar tones is described which is based on recent digital waveguide synthesis techniques. Enhanced analysis methods are used for calibrating the synthesis model based on digitized tones. The lowest resonances of the guitar body are implemented with separate parametric second-order resonators that run at a lower sampling rate than the string model.
Article
This book was written primarily as a textbook for students taking their first course in hearing, most likely in a speech and hearing program, psychology program, or neuroscience program. The book attempts to explain normal human hearing with a focus on the peripheral auditory system, coding of the basic attributes of sound, and the perception of simple sounds. . . . The way in which the nervous system codes for the basic attributes of sound and the perceptual consequences of that coding constitute the fundamentals of hearing. (PsycINFO Database Record (c) 2012 APA, all rights reserved)
Article
The book is arranged in four major sections. Part One begins with a discussion of the nature of research and theory in the behavioral sciences. Chapters 2-4 are devoted to research methods, measurement theory and procedures, and ways of controlling and manipulating variables. The underlying theme in Part One is how to ask questions by establishing the setting in which data can be collected. Part Two introduces univariate and bivariate descriptive statistics. These statistics are presented as ways of finding answers, in data, to the questions posed by virtue of the research design. Graphic methods are presented first, and then the usual univariate descriptive statistics are given careful treatment. Correlation and regression are developed as descriptive methods for two variables. When the classical experimental designs are developed in Part Three, they are shown to provide a structured way of posing questions and analyzing data. The presentation of experimental design begins with single-group studies, progresses through single-factor experiments, and concludes with the full treatment of factorial designs. Part Four presents alternative ways of carrying out research. The treatment of categorical data is covered fully here. In addition, research using single subjects, quasi-experiments, case studies, surveys, and archival and small-N research are discussed. (PsycINFO Database Record (c) 2012 APA, all rights reserved)
Chapter
Basic principles of digital waveguide modeling of musical instruments are presented in a tutorial introduction intended for graduate students in electrical engineering with a solid background in signal processing and acoustics. The vibrating string is taken as the principal illustrative example, but the formulation is unified with that for acoustic tubes. Modeling lossy stiff strings using delay lines and relatively low-order digital filters is described. Various choices of wave variables are discussed, including velocity waves, force waves, and root-power waves. Signal scattering at an impedance discontinuity is derived for an arbitrary number of waveguides intersecting at a junction. Various computational forms are discussed, including the Kelly-Lochbaum, one-multiply, and normalized scattering junctions. A relatively new three-multiply normalized scattering junction is derived using a two-multiply transformer to normalize a one-multiply scattering junction. Conditions for strict passivity of the model are discussed. Use of commutativity of linear, time-invariant elements to greatly reduce computational cost is described. Applications are summarized, and models of the clarinet and bowed-string are described in some detail. The reed-bore and bow-string interactions are modeled as nonlinear scattering junctions attached to the bore/string acoustic waveguide.
Article
Spectral modeling synthesis is an analysis-based technique capable of capturing the perceptual characteristics of a wide variety of sounds. The representation that results from the analysis is intuitive and is easily mapped to useful musical parameters. The analysis part is central to the system. It is a complex algorithm that requires the manual setting of a few control parameters. Further work may automate the analysis process, particularly if there is a specialization for a group of sounds. Some aspects of the analysis are also open to further research, in particular the peak-continuation algorithm. The synthesis from the deterministic plus stochastic representation is simple and can be performed in real time with current technology. A real-time implementation of this system would allow the use of this technnique in performance.