Conference PaperPDF Available

Audibility of Inharmonicity in String Instrument Sounds, and Implications to Digital Sound Synthesis

Authors:

Abstract and Figures

Listening tests were conducted in order to find out the audibility of inharmonicity in musical sounds produced by string instruments, such as the piano or the gui-tar. The threshold of the audibility of inharmonicity was measured as a function of the inharmonicity coef-ficient at five fundamental frequencies. It was found that the detection of inharmonicity is strongly depen-dent on the fundamental frequency. A simple model is presented for estimating the threshold as a function of the fundamental frequency. The need to implement inharmonicity in digital sound synthesis is discussed.
Content may be subject to copyright.
AUDIBILITY OF INHARMONICITY IN STRING INSTRUMENT SOUNDS, AND
IMPLICATIONS TO DIGITAL SOUND SYNTHESIS
Hanna Järveläinen, Vesa Välimäki, and Matti Karjalainen
Helsinki University of Technology
Laboratory of Acoustics and Audio Signal Processing
P.O.Box 3000, FIN-02015 HUT, Finland
hanna.jarvelainen@hut.fi, vesa.valimaki@hut.fi
matti.karjalainen@hut.fi
http://www.acoustics.hut.fi
ABSTRACT
Listening tests were conducted in order to find out the
audibility of inharmonicity in musical sounds produced
by string instruments, such as the piano or the gui-
tar. The threshold of the audibility of inharmonicity
was measured as a function of the inharmonicity coef-
ficient at five fundamental frequencies. It was found
that the detection of inharmonicity is strongly depen-
dent on the fundamental frequency. A simple model
is presented for estimating the threshold as a function
of the fundamental frequency. The need to implement
inharmonicity in digital sound synthesis is discussed.
1. INTRODUCTION
The frequencies of the partials of string instrument
sounds are not exactly harmonic. This is due to stiff-
ness of real strings, which contributes to the restoring
force of string displacement together with string ten-
sion. The stretching of the partials can be calculated
in the following way [1]:
(1)
(2)
where is the partial number, is Young’s modulus,
is the diameter, is the length and is the tension of
the string, and would be the fundamental frequency
of the string completely without stiffness.
Inharmonicity is not necessarily unpleasant. Fle-
tcher, Blackham, and Stratton [1] pointed out that a
slightly inharmonic spectrum added certain “warmth”
into the sound. They found that synthesized piano
tones sounded more natural when the partials below
middle C were inharmonic.
The effect of mistuning one spectral component in
an otherwise harmonic complex is well known. Moore
and his colleagues [2] reported that the thresholds for
detecting mistuning decreased progressively with in-
creasing harmonic number and increasing fundamen-
tal frequency. In their experiment, mistuning was ex-
pressed as percentage of the harmonic frequency, and
the test tones were complex tones with 12 harmonics
at equal levels.
Moore’s group also showed that mistuning is heard
in different ways depending on the harmonic number
[2]. Shortening the stimulus duration produced a large
impairment in performance for the higher harmonics,
while it had only little effect on the performance for
the lower harmonics. It was reasoned that particu-
larly for long durations beats provide an effective cue,
but for short durations many cyclesof beats cannot be
heard. For the lower harmonics, beats were generally
inaudible, and the detection of mistuning appeared to
be based on hearing the mistuned component stand out
from the complex. The thresholds varied only weakly
with duration.
Scalcon et al. [3] studied the bandwidth of correct
positioning of the partials of synthesized piano tones.
They found cutoff frequencies abovewhich it was un-
necessary to imitate inharmonicity. For low tones the
relevant bandwidth was smaller than for higher tones,
but on the other hand, many more partials were in-
cluded in the frequency range at low tones. They also
stated that the effect of inharmonicity was unimportant
on the highest part of the keyboard.
We studied the audibility of inharmonicity as a func-
tion of the inharmonicity coefficient with funda-
mental frequency and sound duration as parameters.
The aim is to find general rules for the need to imple-
ment inharmonicity in digital sound synthesis. If in-
harmonicity were ignored, computational savings could
be achieved, since for instance in digital waveguide
modeling, an additional allpass filter is needed to im-
plement inharmonicity [4], [5], [6].
2. LISTENING TESTS
Subjects were required to distinguish between a com-
plex tone whose partials were exactly harmonic and an
otherwise identical complex tone whose partials were
mistuned. The threshold of audible inharmonicity was
measured as a function of at fundamental frequen-
cies 55 Hz ( ), 82.4 Hz ( ), 220 Hz ( ), 392 Hz
( ), and 1108.7 Hz ( ).
2.1. Test sounds
The test signals were generated using additive synthe-
sis to enable accurate control of the frequency and am-
plitude of each partial. The sampling rate was 22.05
kHz. The spectrum of the tones had a lowpass charac-
ter of the form 1/frequency, which is similar to spectra
of many string instruments. The decay of all partials
was exponential with a time constant seconds.
The initial phase of each partial was chosen randomly.
The tones contained all partials of the fundamen-
tal frequency up to 10 kHz. A constant cutoff fre-
quency was chosen because it was impossible to use
the same number of partials for every fundamental fre-
quency. Up to 50 partials would be important to the
perception of inharmonicity at low bass tones [3]. How-
ever, less than ten partials could be generated for the
highest tone before meeting the Nyquist limit. Realiz-
ing that the variable number of partials might affect the
audibility of inharmonicity, we reasoned that a con-
stant cutoff would still be the most practical solution.
A constant upper limit was also needed to keep the
spectral width of all test sounds equal. Galembo and
Cuddy showed [7] that without control of the spec-
tral width, inharmonicity changes the balance between
high and low frequencies and creates an impression of
sharpening.
Based on initial listening, we found that the thresh-
old of audibility varies over frequency. Thus, the range
of the inharmonicity factor must also vary as a func-
tion of frequency due to the chosen test method. A
range of was found for each fundamental frequency
to cover the probable threshold of audibility. The val-
ues of were uniformly spaced within the range.
A pitch increase due to inharmonicity was heard
at the highest tone. The subjects might listen to pitch
differences instead of timbre and ignore the effect of
inharmonicity unless there was an audible change of
pitch. To prevent this, the harmonic references of
were slightly tuned up to match the pitch of each in-
harmonic test sound. The pitches were adjusted until
no annoying difference was heard.
The effect of duration of the tone on the audibility
of inharmonicity was also studied. Two tone lengths
were tested, 1.5 s and 300 ms. The short samples were
generated by cutting off most of the decay phase of the
longer sounds. The time constant remained the same
for both durations.
2.2. Subjects and test method
Four subjects participated the final listening test. The
listeners were 20-30 years old, and all of them had mu-
sical training in some string instrument, either the pi-
ano or the guitar. None of them reported any hearing
defects. One of the listeners was the author HJ. The
sound samples were played through headphones from
a Silicon Graphics workstation using the GuineaPig
software [8]. Before the test the subjects were allowed
to practise until they made firm judgments.
The method of constant stimuli [9] was used. The
subjects heard pairwise a perfectly harmonic reference
sound and a possibly inharmonic sound, and the task
was to decide whether they sounded equal or different.
Eight values of (including ) were used for
each fundamental frequency, and each sound pair was
judged four times. The playback order of the sound
pairs was randomized, and the harmonic reference was
the first sound within a pair twice and the second one
twice.
3. TEST RESULTS
The psychometric function was approximated by count-
ing how many times each subject had regarded a sound
pair as “different” [9]. If there were zero “different”
judgments out of the four trials, the subject had no-
ticed the possible inharmonicity 0% of the time, and
by making four “different” judgments100% of the time.
The 50% threshold was chosen for the threshold of
audibility. If the threshold was not directly seen in
the data, it had to be interpolated between higher and
lower percentages. If it was spread over two values of
, the mean was calculated between them.
The audibility thresholds are shown as a function
of fundamental frequency for each subject in fig. 1.
The sample duration was 1.5 seconds. The answers
of the four subjects were roughly normally distributed.
The mean thresholds over the listeners at different fun-
damental frequencies are shown in table 1.
Table 1: Inharmonicity coefficient at mean thresholds av-
eraged over the subjects for the different fundamental fre-
quencies.
at mean threshold Standard deviation
0.00000058 0.00000021
0.0000013 0.00000038
0.000033 0.000019
0.000055 0.000027
0.0012 0.00081
The thresholds show a strong linear trend when a
logarithmic scale is used both for the frequency and
the . The highest note was judged with least ac-
curacy, and the effect of inharmonicity was also small-
est. At the mean threshold the value of was more
than 2000 times higher than that for the lowest tone.
102103
10−6
10−4
10−2
Frequency Hz
Inharmonicity coefficient B
Results of the four subjects
Figure 1: The individual thresholds for the four listeners
at , , , , and . Sample duration was 1.5
seconds.
A straight line was fitted to the mean threshold
values in the least-squares sense. This way a simple
formula was derived that could be used to model the
audibility threshold as a function of fundamental fre-
quency: (3)
The natural logarithm was used. The fitted line is il-
lustrated in fig. 2. The thresholds of one subject were
especially near the estimated line (subject J, see figure
2). The thresholds of two other subjects differ from
the line at , where they were able to detect smaller
inharmonicity than at . The nonmonotonic behav-
ior is not surprising, since the subjects reported that
they used several different cues to detect inharmonic-
ity. The performance can depend on the existence of
certain cues at different fundamental frequencies and
the subject’s sensitivity to the particular cue such as
beating or roughness.
3.1. Effect of duration
The test was repeated using shorter sound samples (300
ms). The thresholds of the four subjects (see fig. 3)
showed again a linear trend, and a straight line was fit-
ted to the results as before. The following model was
derived:
(4)
Both models as well as the average thresholds over
all subjects in both cases are shown in fig. 4. The
slopes of the models suggest that at low fundamen-
tal frequencies the detection of inharmonicity becomes
harder and at high fundamental frequencies somewhat
102103
10−6
10−4
10−2
Frequency Hz
Inharmonicity coefficient B
o− Straight line fit to average data over all subjectso− Straight line fit to average data over all subjects
x−− Subject J
Figure 2: The straight line fitted to average data over all
subjects (solid line), and the thresholds of subject J (dashed
line).
102103
10−6
10−4
10−2
Frequency Hz
Inharmonicity coefficient B
Short data: Results of the four subjects
Figure 3: The individual thresholds for the four listeners at
, , , , and . Sample duration was 300 ms.
easier when the samples become shorter. To test the
significance of the linear models, t-tests were performed
on the slopes [10]. The steeper slope of the long dura-
tion model was tested against the slope of the short du-
ration model, and the slope of the short duration model
was tested against zero slope. Both results were highly
significant, suggesting that the two slopes are greater
than zero ( (slope=0) = 0.0028) and that the slope of
the long duration model is steeper than that of the short
duration model ( (slopes are equal) = 0.0008).
4. DISCUSSION
Typical values of for piano strings lie roughly be-
tween 0.00005 for low bass tones and 0.015 for the
high treble tones [11]. Though the sounds are less in-
102103
10−6
10−4
10−2
Frequency Hz
Inharmonicity coefficient B
o− Straight line fit to average long sample data
o−− Average long sample data
x− Straight line fit to average short sample data
x−− Average short sample data
Figure 4: The linear models of audibility of inharmonicity
as a function of fundamental frequency for long and short
durations (solid lines), and the corresponding mean thresh-
olds (dashed lines).
harmonic in the bass range, inharmonicity is detected
more easily than at the highest tones. Compared to the
audibility thresholds, inharmonicity would be clearly
audible at low fundamental frequencies. In the treble
range, audibility is questionable. At , the thresh-
old of audibility is closer to the possible values of
in real instruments. Furthermore, the highest tone was
judged less accurately than the others and even non-
monotonously, i.e., the performance weakened locally
when inharmonicity was increased.
Thus it would be necessary to implement the effect
of inharmonicity in digital sound synthesis systems at
least in the bass range. In the treble range computa-
tional savings could be achieved by omitting the all-
pass filter responsible for the effect of inharmonicity.
There can be several causes for the better perfor-
mance at lower frequencies. The subjects told that they
were using beats as a cue. Beats were mostly audible
at low fundamental frequencies. When most of the de-
cay phase was cut off, the performance weakend in the
bass range but improved in the treble range, where an-
other cue was possibly used.
The test results are consistent with the general as-
sumption that the effect of inharmonicity is greater
when more partials are present [7]. This could also be
a cause for the differences in performance. However,
the number of partials in our hearing range decreases
with increasing fundamental frequency also in sounds
of real musical instruments, so it would make no sense
to isolate these factors from each other. Inharmonicity
can be influenced also by many other features [7]. To
study all these in a formal listening experiment would
be too laborous.
Our future objective is to find more general rules
for the need to implement the effect of inharmonic-
ity in digital sound synthesis systems. More accurate
models are needed for the effect of duration, number
of partials and spectral width, relative level of the par-
tials, and different decay rates between them.
5. ACKNOWLEDGMENTS
This work was supported by the Academy of Finland
and the Pythagoras Graduate School. The authors ex-
press sincere thanks to the test subjects and also to
Alexander Galembo for kindly providing his recent
publications.
6. REFERENCES
[1] H. Fletcher, E. D. Blackham, and R. Stratton,
“Quality of piano tones,” J. Acoust. Soc. Am.,
vol. 34, no. 6, pp. 749–761, 1962.
[2] B. C. J. Moore, R. W. Peters, and B. C. Glasberg,
“Thresholds for the detection of inharmonicity
in complex tones,” J. Acoust. Soc. Am., vol. 77,
no. 5, pp. 1861–1867, 1985.
[3] F. Scalcon, D. Rocchesso, and G. Borin, “Subjec-
tive evaluation of the inharmonicity of synthetic
piano tones,” in Proc. Int. Comp. Music Conf.
ICMC’98, pp. 53–56, 1998.
[4] D. A. Jaffe and J. O. Smith, “Extensions of the
Karplus-Strong plucked-string algorithm,” Com-
puter Music Journal, vol. 7, no. 2, pp. 56–69,
1983.
[5] A. Paladin and D. Rocchesso, “A dispersive res-
onator in real time on MARS workstation,” in
Proc. Int Comp. Music Conf. (ICMC’92), (San
Jose, CA), pp. 146–149, Oct. 1992.
[6] S. Van Duyne and J. O. Smith, “A simplified ap-
proach to modeling dispersion caused by stiff-
ness in strings and plates,” in Proc. Int. Comp.
Music Conf. ICMC’94, pp. 407–410, 1994.
[7] A. Galembo and L. Cuddy, “String inharmonic-
ity and the timbral quality of piano bass tones:
Fletcher, Blackham, and Stratton (1962) revis-
ited.” Report to the 3rd US Conference on Mu-
sic Perception and Cognition, MIT, Cambridge,
MA, July - August 1997.
[8] J. Hynninen and N. Zacharov, “GuineaPig – a
generic subjective test system for multichannel
audio.” Presented at the 106th Convention of the
Audio Engineering Society, May 8-11 1999, Mu-
nich, Germany, preprint no. 4871, 1999.
[9] J. P. Guilford, Psychometric Methods. McGraw-
Hill, 1956.
[10] J. Milton and J. C. Arnold, Introduction to prob-
ability and statistics. McGraw-Hill, 1990.
[11] H. A. Conklin, “Generation of partials due to
nonlinear mixing in stringed instruments,” J.
Acoust. Soc. Am., vol. 105, no. 1, pp. 536–545,
1999.
... Due to the fixed points of the string these overtones gradually become more dissonant the higher the multiple. While these overtones are not prominent, they are incredibly important in the make-up of the sound [Järveläinen et al., 1999;Järveläinen and Karjalainen, 2006]. This is called the inharmonicity of strings [Järveläinen et al., 1999]. ...
... While these overtones are not prominent, they are incredibly important in the make-up of the sound [Järveläinen et al., 1999;Järveläinen and Karjalainen, 2006]. This is called the inharmonicity of strings [Järveläinen et al., 1999]. The inharmonicity is caused by the stiffness of the strings, where the stiffer the string the greater the inharmonicity [Järveläinen et al., 1999]. ...
... This is called the inharmonicity of strings [Järveläinen et al., 1999]. The inharmonicity is caused by the stiffness of the strings, where the stiffer the string the greater the inharmonicity [Järveläinen et al., 1999]. ...
... According to Elvander et al. (2020), human speech deviations are small enough to exploit harmonic structure when estimating pitch, fitting this definition. • The electric bass from 'R U Ready' (C.11.2) has inharmonic partials above the seventh position only, making it quasi-harmonic (Järveläinen et al., 1999). Figure 8: 'Boom', bass, vocals, and kick drum track, 0'16 to 0'18. ...
Preprint
Full-text available
We identify characteristic features of how pitch is manipulated for expressive purposes by Hyper Music, a mainstream commercial music company specialising in advertisement music for global corporations. The study shows that the use and organisation of pitch in the company's `Primaal' brand differs from Western classical music. Through interviews with producers and in-depth analysis of their work, we reveal that their methods centre on a conscious aim to construct a musical discourse based on pitch uncertainty, contrasting with the clear transmission of well-defined pitches in Western classical traditions. According to the Primaal producers, who acknowledge the influence of artists such as Kanye West and Daft Punk and use widely available technology, pitch uncertainty captures the listener's attention. We provide analyses of musical excerpts demonstrating their approach, alongside descriptions of the tools and methods employed to achieve their expressive goals. These goals and methods are placed in a broader historical context, contrasting with fundamental principles of pitch organisation in Western music. Techniques used by Hyper Music to introduce and control pitch uncertainty include boosting upper partials, expressive use of inharmonicity, continuous pitch distributions around 'poles' tied to specific 'modes', and continuously evolving pitch. We examine these techniques from a psychoacoustic perspective, and conduct listening tests corroborating some of the observations. The ultimate goal of the study is to introduce a set of methods suited to the analysis of pitch in contemporary popular music.
... 1.5 Hz), undulation is not perceived. Further, in contrast to tones produced by bowed strings, tones produced by plucked strings are slightly inharmonic (Järveläinen et al. 1999), and as they constantly decrease in loudness, perception of roughness and undulation becomes a enuated. Moreover, nothing is known of the precision with which such Mesopotamian tuning mechanisms as sticks and wrappings (Kilmer 1998, 14, Fig. 1;and Schauensee 1998, 24, Fig. 5) would have been adjusted to alter the fundamental frequency of a plucked string (Kilmer 1998, 14, Fig. 1 andSchauensee 1998, 24, Fig. 5). ...
Article
Full-text available
On the basis of assumptions and conclusions first advanced in 1968 concerning tuning instructions that were originally written down ca. 1800 BCE, Assyriologists have agreed that Mesopotamian tuning was diatonic. Nonetheless, Sam Mirelman (2013) has recently suggested that this consensus view is “uncomfortably familiar and Eurocentric.” As a follow-up to Mirelman’s misgiving, the present report begins by identifying flaws in the reasoning concerning Mesopotamian tuning that was disseminated more than half a century ago and have remained uncontested. The starting point of the present study is information directly recorded in Mesopotamian documents, as opposed to concepts dating from Greek Antiquity and beyond. This information includes the spatial ordering of strings and the relative fundamental frequencies of two pairs of strings on the sammû , a harp or lyre that is explicitly identified in cuneiform tablets, as well as the tuning instructions’ recursive and symmetrical patterning of prescriptions concerning the alterations of this instrument’s strings. At each step, this patterning involves loosening or tightening a string that is three or four strings away from the string that had just been tightened or loosened. Added to these observations are acoustical features of the harmonics produced by plucked strings, and the auditory roughness and smoothness produced by pairs of plucked strings that psychoacoustical studies have established as universally audible. On these bases, one can conclude that Mesopotamian tuning can be interpreted as diatonic in structure without assuming such notions as octave identity, scale degrees, and consonance, all concepts for which there is no known testimony until much later in the history of music theory.
... Більше того, вказана особливість порушує питання і перед розробниками цифрових піаніно -чи не варто враховувати ефект негармонічності фортепіанних струн задля більшої реалістичнос ті електронних імітацій? (Jarvelainen, 1999). В останні роки акустичним досліджен ням були піддані і мультифонічні звуки дерев'яних духових інструментів, наприклад в дисертації С. Ваттс представлено спектрограми мультифонічних звуків бас-кларне та, які, на думку дослідниці, допоможуть студентам в опануванні відповідними техні ками гри на інструменті (Watts, 2015). ...
Conference Paper
Full-text available
Із зростанням тембрального розмаїття європейської музики у XX столітті традиційні музикознавчі можливості для опису тембрів за допомогою епітетів та метафор стають неефективними. Натомість, спектральний аналіз звуків дозволив розробити типологію звукових об’єктів (77. Шеффер), вивчити особливості сприйняття звуків з гармонічними та негармонійними спектрами (А. Володін), обґрунтувати особливості настройки фортепіано з урахуванням негармонійності струн (Г. Ярвеляйнен та ін.), дослідити природу мультифонічних звуків дерев’яних духових (С. Ваттс та ін.). У музичній фольклористиці акустичні дослідження дозволили встановити синкретизм часового та динамічного аспектів народного виконавства на ударних інструментах (І. Фетисов).
... Most obviously many percussion instruments, in particular bells, feature large amounts of distinctly inharmonic partials and when it comes to computer-music, there is no physical limit to the extent of partial detuning. But also pitched instruments like the piano are often characterised by the slightly detuned nature of the partials of the hammered strings (Järveläinen, Välimäki and Karjalainen 1999;Schuck 1943), particularly the bass strings. The same goes for groups of instruments playing in unison in an orchestra or big band setting (Cohen 1984). ...
Article
Perceptual dimensions underlying timbre and sound-source identification have received considerable scientific attention. While these scholarly insights help us in understanding the nature of sound within a multidimensional timbral space, they carry little meaning for the majority of musicians. To help address this, we conducted two experiments to establish listeners’ perceptual thresholds (PT) for changes in sound using a staircase-procedure. Unlike most timbre perception research, these changes were sonic manipulations that are common in synthesisers, audio processors and instruments familiar to musicians and producers, and occurred within continuous sounds (rather than between discrete pairs of sounds). In experiment 1, two sounds (variants of a sawtooth oscillation) both with the same fundamental frequency (F1: 80 Hz, 240 Hz or 600 Hz) were played with no intervening gap. In each trial, the two sounds’ partials differed in amplitudes or frequencies to produce a timbre change. The sonic manipulations were varied in size to detect thresholds for the perceived timbre change – listeners were instructed to indicate whether or not they perceived a change within the sound. In experiment 2, we modified stimulus presentation to introduce the factor of transition time (TT). Rather than occurring instantaneously (as in experiment 1), the timbre manipulations were introduced gradually over the course of a 100 ms or a 1000 ms TT. Results revealed that PTs were significantly affected by the manipulations in experiment 1, and additionally by TT in experiment 2. Importantly, the data revealed an interaction between the F1 and the timbre manipulations, such that there were differential effects of timbre changes on the perceptual system depending on pitch height. Musicians ( n = 11) showed significantly smaller PTs compared to non-musicians ( n = 10). However, PTs for musicians and non-musicians were highly correlated ( r = .83) across different sonic manipulations, indicating similar perceptual patterns in both. We hope that by establishing PTs for commonly used timbre manipulations, we can provide musicians with a general perceptual unit, for each manipulation, that can guide music composition and assessment.
... Most obviously, many percussion instruments, in particular, bells, feature large amounts of distinctly inharmonic partials and when it comes to computer music, there is no physical limit to the extent of partial detuning. But also pitched instruments, like the piano, are often characterized by the slightly detuned nature of the partials of the hammered strings [18,19], particularly the bass strings. The same goes for groups of instruments playing in unison in an orchestra or big band setting [20]. ...
Article
Full-text available
Mixing music is a highly complex task. This is exacerbated by the fact that timbre perception is still poorly understood. As a result, few studies have been able to pinpoint listeners’ preferences in terms of timbre. In order to investigate timbre preference in a music production context, we let participants mix multiple individual parts of musical pieces (bassline, harmony, and arpeggio parts, all sounded with a synthesizer) by adjusting four specific timbral attributes of the synthesizer (lowpass, sawtooth/square wave oscillation blend, distortion, and inharmonicity). After participants mixed all parts of a musical piece, they were asked to rate multiple mixes of the same musical piece. Listeners showed preferences for their own mixes over random, fixed sawtooth, or expert mixes. However, participants were unable to identify their own mixes. Despite not being able to accurately identify their own mixes, participants consistently preferred the mix they thought to be their own, regardless of whether or not this mix was indeed their own. Correlations and cluster analysis of the participants’ mixing settings show most participants behaving independently in their mixing approaches and one moderate sized cluster of participants who are actually rather similar. In reference to the starting-settings, participants applied the biggest changes to the sound with the inharmonicity manipulation (measured in the perceptual distance) despite often mentioning that they do not find this manipulation particularly useful. The results show that listeners have a consistent, yet individual timbre preference and are able to reliably evoke changes in timbre towards their own preferences.
... The perception of inharmonicity in piano tones was studied from a synthesis viewpoint in [7], [8]. Another work on the perception of inharmonicity with a model-based synthesis motivation was presented in [9], [10]. The perception of the vibrato of violin tones was investigated in [11]. ...
Article
The audibility of variations of decay parameters in plucked string synthesis was studied by a listening test. The decay was described by an overall time constant and a frequency-dependent parameter. Two fundamental frequencies, tone durations, and types of excitations were used for both parameters. The results indicate that variations between 25 and 40% in the time constant are inaudible. This suggests large perceptual tolerances for the decay parameters. The results are applied in model-based audio processing.
... To compute the frequencies for the filters we used equation 2, where f 1 is the fundamental frequency and B is the inharmonicity factor. Having a correct B value is really critical in order to compute the real frequencies of the spectrum [3]; for this reason an estimation of the inharmonicity in the signal is performed evaluating the energy divergence of the spectral components from the multiple of the fundamental [7]: ...
Conference Paper
Full-text available
It is sometimes desirable, in the purpose of analyzing recorded piano tones, to remove from the original signal the noisy components generated by the hammer strike and by other elements involved in the piano action. In this article we propose an efficient method to achieve such result, based on adaptive filtering and automatic estimation of fundamental frequency and inharmonicity; the final method, applied on a recorded piano tone, produces two separate signals containing respectively the hammer knock and the harmonic components. Some sound examples to listen for evaluation are available on the web as specified in the paper.
... It was also shown that lower strings produce larger inharmonicities than higher strings. In [96] the audibility of inharmonicity is studied for guitar and piano sounds as a function of β. Results from the listening test ...
Thesis
Full-text available
This thesis addresses the development of a system for pitch-informed solo and accompaniment separation capable of separating main instruments from music accompaniment regardless of the musical genre of the track, or type of music accompaniment. For the solo instrument, only pitched monophonic instruments were considered in a single-channel scenario where no panning or spatial location information is available. In the proposed method, pitch information is used as an initial stage of a sinusoidal modeling approach that attempts to estimate the spectral information of the solo instrument from a given audio mixture. Instead of estimating the solo instrument on a frame by frame basis, the proposed method gathers information of tone objects to perform separation. Tone-based processing allowed the inclusion of novel processing stages for attack refinement, transient interference reduction, common amplitude modulation (CAM) of tone objects, and for better estimation of non-harmonic elements that can occur in musical instrument tones. The proposed solo and accompaniment algorithm is an efficient method suitable for real-world applications. A study was conducted to better model magnitude, frequency, and phase of isolated musical instrument tones. As a result of this study, temporal envelope smoothness, inharmonicty of musical instruments, and phase expectation were exploited in the proposed separation method. Additionally, an algorithm for harmonic/percussive separation based on phase expectation was proposed. The algorithm shows improved perceptual quality with respect to state-of-the-art methods for harmonic/percussive separation. The proposed solo and accompaniment method obtained perceptual quality scores comparable to other state-of-the-art algorithms under the SiSEC 2011 and SiSEC 2013 campaigns, and outperformed the comparison algorithm on the instrumental dataset described in this thesis. As a use-case of solo and accompaniment separation, a listening test procedure was conducted to assess separation quality requirements in the context of music education. Results from the listening test showed that solo and accompaniment tracks should be optimized differently to suit quality requirements of music education. The Songs2See application was presented as a commercial music learning software which includes the proposed solo and accompaniment separation method.
Conference Paper
Full-text available
The effect of inharmonicity on pitch was measured by listening tests at four fundamental frequencies. Inharmonicity was defined in a way typical of string instruments, such as the piano, where all partials are elevated in a systematic way. It was found that the pitch judgment is usually dominated by some other partials than the fundamental; however, with a high degree of inharmonicity, the fundamental also began to dominate. Guidelines are given for compensating for the pitch difference between harmonic and inharmonic sounds in digital sound synthesis.
Article
Full-text available
The GuineaPig test system has been developed as a generic, software based, multichannel audio system for subjective testing. Based around the SGI hardware platform, the system provides testing with several test categories, including stan-dardised tests, providing currently up to 8-channel output and graphical UIs for testing. The GuineaPig provides experimenters fast access to subjective testing.
Article
Full-text available
Thresholds were measured for the detection of inharmonicity in complex tones. Subjects were required to distinguish a complex tone whose partials were all at exact harmonic frequencies from a similar complex tone with one of the partials slightly mistuned. The mistuning which allowed 71% correct identification in a two-alternative forced-choice task was estimated for each partial in turn. In experiment I the fundamental frequency was either 100, 200, or 400 Hz, and the complex tones contained the first 12 harmonics at equal levels of 60 dB SPL per component. The stimulus duration was 410 ms. For each fundamental the thresholds were roughly constant when expressed in Hz, having a mean value of about 4 Hz (range 2.4-7.3 Hz). In experiment II the fundamental frequency was fixed at 200 Hz, and thresholds for inharmonicity were measured for stimulus durations of 50, 110, 410, and 1610 ms. For harmonics above the fifth the thresholds increased from less than 1 Hz to about 40 Hz as duration was decreased from 1610-50 ms. For the lower harmonics (up to the fourth) threshold changed much less with duration, and for the three shorter durations thresholds for each duration were roughly a constant proportion of the harmonic frequency. The results suggest that inharmonicity is detected in different ways for high and low harmonics. For low harmonics the inharmonic partial appears to "stand out" from the complex tone as a whole. For high harmonics the mistuning is detected as a kind of "beat" or "roughness," presumably reflecting a sensitivity to the changing relative phase of the mistuned harmonic relative to the other harmonics.(ABSTRACT TRUNCATED AT 250 WORDS)
Article
A synthesizer was constructed to produce simultaneously 100 pure tones with means for controlling the intensity and frequency of each one of them. The piano tones were analyzed by conventional apparatus and methods and the analysis set into the synthesizer. The analysis was considered correct only when a jury of eight listeners could not tell which were real and which were synthetic tones. Various kinds of synthetic tones were presented to the jury for comparison with real tones. A number of these were judged to have better quality than the real tones. According to these tests synthesized piano like tones were produced when the attack time was less than 0.01 sec. The decay can be as long as 20 sec for the lower notes and be less than 1 sec for the very high ones. The best quality is produced when the partials decrease in level at the rate of 2 db per 100 cps increase in the frequency of the partial. The partials below middle C must be inharmonic in frequency to be piano like.
Article
Nonlinear mixing appears to account for the production of 'phantom partials' in piano tones. 'Phantom partials' are those that appear at frequencies exactly harmonic to normal inharmonic string partials, and at frequencies equal to the sums of the frequencies of normal inharmonic partials. Nonlinear mixing can occur because tension varies during transverse vibration. This produces longitudinal string forces of Phantom-partial frequency that appear at the soundboard bridge and are coupled to the soundboard. 'Phantom partials' were found in piano tones, in the motion of a piano bridge, in the longitudinal vibrations of a monochord string, in the acoustical output of a soundboard coupled to a monochord string, and in the acoustical output of a guitar. Any plucked-string or struck-string instrument having appreciable acoustical response to longitudinal string forces could be expected to produce phantom partials. The relation in frequency between phantom partials and normal partials, which varies with inharmonicity, may play a part in differentiating the timbre of tones at the same fundamental frequency in pianos of different size and design.
Article
A difference equation of the digital filter is given and the noise burst is defined. The string simulator is analyzed, a single-pluck string note is synthesized. Both tuning and decay of harmonies are derived and the effect of decay string on tuning interpreted. A block diagram of the string simulator contains algorithm extensions discussed. The algorithm is a high-order digital filter, which represents the string, and a short noise burst, which represents the ″pluck″ .
String inharmonic-ity and the timbral quality of piano bass tones: Fletcher, Blackham, and Stratton (1962) revis-ited
  • A Galembo
  • L Cuddy
A. Galembo and L. Cuddy, “String inharmonic-ity and the timbral quality of piano bass tones: Fletcher, Blackham, and Stratton (1962) revis-ited.” Report to the 3rd US Conference on Mu-sic Perception and Cognition, MIT, Cambridge, MA, July - August 1997.