Conference PaperPDF Available

An Idiom-independent Representation of Chords for Computational Music Analysis and Generation

Authors:

Abstract and Figures

In this paper we focus on issues of harmonic representa-tion and computational analysis. A new idiom-independent representation is proposed of chord types that is appropriate for encoding tone simultaneities in any harmonic context (such as tonal, modal, jazz, octatonic, atonal). The General Chord Type (GCT) representation, allows the re-arrangement of the notes of a harmonic simultaneity such that abstract idiom-specific types of chords may be derived; this encoding is inspired by the standard roman numeral chord type labeling, but is more general and flexible. Given a consonance-dissonance classification of intervals (that reflects culturally-dependent notions of consonance/dissonance), and a scale, the GCT algorithm finds the maximal subset of notes of a given note simultaneity that contains only con-sonant intervals; this maximal subset forms the base upon which the chord type is built. The proposed representa-tion is ideal for hierarchic harmonic systems such as the tonal system and its many variations, but adjusts to any other harmonic system such as post-tonal, atonal music, or traditional polyphonic systems. The GCT representa-tion is applied to a small set of examples from diverse musical idioms, and its output is illustrated and analysed showing its potential, especially, for computational music analysis & music information retrieval.
Content may be subject to copyright.
An Idiom-independent Representation of Chords for
Computational Music Analysis and Generation
Emilios Cambouropoulos
Maximos Kaliakatsos-Papakostas
Costas Tsougras
School of Music Studies,
Aristotle University of Thessaloniki
emilios@mus.auth.gr
School of Music Studies,
Aristotle University of Thessaloniki
maxk@mus.auth.gr
School of Music Studies,
Aristotle University of Thessaloniki
tsougras@mus.auth.gr}
ABSTRACT
In this paper we focus on issues of harmonic representa-
tion and computational analysis. A new idiom-
independent representation is proposed of chord types
that is appropriate for encoding tone simultaneities in any
harmonic context (such as tonal, modal, jazz, octatonic,
atonal). The General Chord Type (GCT) representation,
allows the re-arrangement of the notes of a harmonic
simultaneity such that abstract idiom-specific types of
chords may be derived; this encoding is inspired by the
standard roman numeral chord type labeling, but is more
general and flexible. Given a consonance-dissonance
classification of intervals (that reflects culturally-
dependent notions of consonance/dissonance), and a
scale, the GCT algorithm finds the maximal subset of
notes of a given note simultaneity that contains only con-
sonant intervals; this maximal subset forms the base upon
which the chord type is built. The proposed representa-
tion is ideal for hierarchic harmonic systems such as the
tonal system and its many variations, but adjusts to any
other harmonic system such as post-tonal, atonal music,
or traditional polyphonic systems. The GCT representa-
tion is applied to a small set of examples from diverse
musical idioms, and its output is illustrated and analysed
showing its potential, especially, for computational music
analysis & music information retrieval.
1. INTRODUCTION
There exist different typologies for encoding note simul-
taneities that embody different levels of harmonic infor-
mation/abstraction and cover different harmonic idioms.
For instance, for tonal musics, chord notations such as the
following are commonly used: figured bass (pitch classes
denoted above a bass note no concept of ‘chord’),
popular music guitar style notation or jazz notation (abso-
lute chord), roman numeral encoding (relative to a key)
[1]. For atonal and other non-tonal systems, pc-set theo-
retic encodings [2] may be employed.
A question arises: is it possible to devise a ‘universal’
chord representation that adapts to different harmonic
idioms? Is it possible to determine a mechanism that,
given some fundamental idiom features, such as pitch
hierarchy and consonance/dissonance classification, can
automatically encode pitch simultaneities in a pertinent
manner for the idiom at hand?
Before attempting to answer the above question one
could ask: What might such a ‘universal’ encoding sys-
tem be useful for? Apart from music-theoretic interest
and cognitive considerations/implications, a general
chord encoding representation may allow developing
generic harmonic systems that may be adaptable to di-
verse harmonic idioms, rather than designing ad hoc sys-
tems for individual harmonic spaces. This was the prima-
ry aim for devising the General Chord Type (GCT) repre-
sentation. In the case of the project COINVENT [3], a
creative melodic harmonisation system is required that
relies on conceptual blending between diverse harmonic
spaces in order to generate novel harmonic constructions;
mapping between such different spaces is facilitated
when the shared generic space is defined with clarity, its
generic concepts are expressed in a general and idiom-
independent manner, and a common general representa-
tion is available.
In recent years, many melodic harmonisation systems
have been developed, some rule-based [4,5] or evolution-
ary approaches that utilize rule based fitness evaluation
[6, 7] others relying on machine learning techniques like
probabilistic approaches [8,9] and neural networks [10],
grammars [11] or hybrid systems (e.g. [12]). Almost all
of these systems model aspects of tonal harmony: from
“standard” Bachlike chorale harmonisation [4,10]
among many others) to tonal systems such as “classic”
jazz or pop ([9,11] among others). These systems aim to
produce harmonizations of melodies that reflect the style
of the discussed idiom, which is pursued by utilising
chords and chord annotations that are characteristic of the
idiom. For instance, the chord representation for studies
in the Bach chorales include usually standard Roman
numeral symbols, while jazz approaches encompass addi-
tional information about extensions in the guitar style
encoding.
For tonal computational models, Harte’s representa-
tion [13] provides a systematic, context-independent syn-
tax for representing chord symbols which can easily be
written and understood by musicians , and, at the same
time, is simple and unambiguous to parse with computer
programs. This chord representation is very useful for
annotating manually tonal music - mostly genres such as
pop, rock, jazz that use guitar-style notation. However, it
Copyright: © 2014 Emilios Cambouropoulos et al. This is an open-
access article distributed under the terms of the Creative Commons
Attribution License 3.0 Unported, which permits unrestricted use, distri-
bution, and reproduction in any medium, provided the original author
and source are credited.
cannot be automatically extracted from chord reductions
and is not designed to be used in non-tonal musics.
In this paper, firstly, we present the main concepts
behind the General Chord Type representation and give
an overall description, then, we describe the GCT algo-
rithm that automatically computes chord types for each
chord, then, we present examples form diverse music
idioms that show the potential of the representation and
give some examples of applying statistical learning on
such a representation, and, finally, we will discuss prob-
lems and future improvements.
2. REPRESENTING CHORDS
Harmonic analysis focuses on describing the harmonic
content of pitch collections/patterns within a given music
context in terms of harmonic labels, classes, functions
and so on. Harmonic analysis is a rather complex musi-
cal task that involves not only finding roots and labelling
chords within a key, but also segmentation (points of
harmonic change), identification of non-chord notes, met-
ric information and more generally musical context [14].
In this paper, we focus on the core problem of labelling
chords within a given pitch hierarchy (e.g. key); thus we
assume that a full harmonic reduction is available as in-
put to the model (manually constructed harmonic reduc-
tions).
Our intention is to create an analytic system that may
label any pitch collection, based on a set of user-defined
criteria rather than on standard tonal music theoretic
models or fixed psychoacoustic properties of harmonic
tones. We intend our representation to be able to cope
with chords not only in the tonal system, but any harmon-
ic system (e.g. octatonic, whole-tone, atonal, traditional
harmonic systems, etc.).
Root-finding is a core harmonic problem addressed
primarily following two approaches: the standard stack-
of-thirds approach and the virtual pitch approach. The
first attempts to re-order chord notes such that they are
separated by (major or minor) third intervals preserving
the most compact ordering of the chord; these stacks of
thirds can then be used to identify the possible root of a
chord (see, for instance, recent advanced proposal by
[15]). The second approach, is based on Terhard’s virtual
pitch theory [16] and Parncutt’s psychoacoustic model of
harmony [17]; it maintains that the root of a chord is the
pitch most strongly implied by the combined harmonics
of all its constituent notes (intervals derived from the first
members of the harmonic series are considered as ‘root
supporting intervals’).
Both of these approaches rely on a fixed theory of
consonance and a fixed set of intervals that are consid-
ered as building blocks of chords. In the culture-sensitive
stack-of-thirds approach, the smallest consonant intervals
in tonal music, i.e. the major and minor thirds, are the
basis of the system. In the second ‘universal’ psychoa-
coustic approach, the following intervals, in decreasing
order of importance, are employed: unison, perfect fifth,
major third, minor seventh, and major second. Both of
these approaches are geared towards tonal harmony, each
with its strengths and weaknesses (for instance, the se-
cond approach has an inherent difficulty with minor har-
monies). Neither of them can be readily extended to other
idiosyncratic harmonic systems.
Harmonic consonance/dissonance has two major
components: Sensory-based dissonance (psychoacoustic
component) and music-idiom-based dissonance (cultural
component)[18]. Due to the music-idiom dependency
component, it is not possible to have a fixed universal
model of harmonic consonance/dissonance. A classifica-
tion of intervals into categories across the dissonance-
consonance continuum can be made only for a specific
idiom. The most elementary classification is into two
basic categories: consonant and dissonant. For instance,
in the common-practice tonal system, unisons, octaves,
perfect fifths/fourths (perfect consonances) and thirds and
sixths (imperfect consonances) are considered to be con-
sonances, whereas the rest of the intervals (seconds, sev-
enths, tritone) are considered to be dissonances; in poly-
phonic singing from Epirus, major seconds and minor
sevenths may additionally be considered ‘consonant’ as
they appear in metrically strong positions and require no
resolution; in atonal music, all intervals may be consid-
ered equally ‘consonant’.
Let’s examine the case of tonal and atonal harmony;
these are probably as different as two harmonic spaces
may be. In the case of tonal and atonal harmony, some
basic concepts are shared; however, actual systematic
descriptions of chord-types and categories are drastically
different (if not incompatible), rendering any attempt to
‘align’ two input spaces challenging and possibly mis-
leading (Figure 1). On one hand, tonal harmony uses a
limited set of basic chord types (major, minor, dimin-
ished, augmented) with extensions (7ths, 9ths etc.) that
have roots positioned in relation to scale degrees and the
tonic, reflecting the hierarchic nature of tonal harmony;
on the other hand, atonal harmony employs a flat mathe-
matical formalism that encodes pitches as pitch-class sets
leaving aside any notion of pitch hierarchy, tone centres
or more abstract chord categories and functions. It seems
as if it is two worlds apart having as the only meeting
point the fact that tones sound together (physically sound-
ing together or sounding close to one another allowing
implied harmony to emerge).
Figure 1. Is mapping between ‘opposing’ harmonic
spaces possible?
Pc-set theory of course, being a general mathematical
formalism, can be applied to tonal music, but, then its
descriptive potential is mutilated and most interesting
tonal harmonic relations and functions are lost. For in-
stance, the distinction between major and minor chords is
lost if Forte’s prime form is used (037 for both - these
two chord have identical interval content), or a dominant
seventh chord is confused with half-diminished seventh
(prime form 0258); even, if normal order is used, that is
less general, for the dominant seventh (0368), the root of
the chord is not the 0 on the left of this ordering (pc 8 is
the root). Pitch-class set theory is not adequate for tonal
music. At the same time, the roman-numeral formalism is
inadequate for atonal music as major/minor chords and
tonal hierarchies are hardly relevant for atonal music.
In trying to tackle issues of tonal hierarchy, we have
devised a novel chord type representation, namely the
General Chord Type (GCT) representation, that takes as
its starting point the common-practice tonal chord repre-
sentation (for a tonal context, it is equivalent to the stand-
ard roman-numeral harmonic encoding), but is more gen-
eral as it can be applied to other non-standard tonal sys-
tems such as modal harmony and, even, atonal harmony.
This representation draws on knowledge from the domain
of psychoacoustics and music cognition, and, at the same
time, ‘adjusts’ to any context of scales, tonal hierarchies
and categories of consonance/dissonance.
At the heart of the GCT representation is the idea that
the ‘base’ of a note simultaneity should be consonant.
The GCT algorithm tries to find a maximal subset that is
consonant; the rest of the notes that create dissonant in-
tervals to one or notes of the chord baseform the chord
‘extension’. The GCT representation has common char-
acteristics with the stack-of-thirds and the virtual pitch
root finding methods for tonal music, but has differences
as well (see section 4.3). Moreover, the user can define
which intervals are considered ‘consonant’ giving thus
rise to different encodings. As will be shown in the next
sections, the GCT representation encapsulates naturally
the structure of tonal chords and at the same time is very
flexible and can readily be adapted to different harmonic
systems.
3. THE GENERAL CHORD TYPE REPRE-
SENTATION
3.1 Description of the GCT Algorithm
Given a classification of intervals into conso-
nant/dissonant (binary values) and an appropriate scale
background (i.e. scale with tonic), the GCT algorithm
computes, for a given multi-tone simultaneity, the ‘opti-
mal’ ordering of pitches such that a maximal subset of
consonant intervals appears at the ‘base’ of the ordering
(left-hand side) in the most compact form. Since a tonal
centre (key) is given, the position within the given scale
is automatically calculated.
Input to the algorithm is the following:
Consonance vector: The user defines which intervals
are consonant/dissonant through a 12-point Boolean
vector of consonant (1) or dissonant (0) intervals. For
instance, the vector [1,0,0,1,1,1,0,1,1,1,0,0] means
that the unison, minor and major third, perfect fourth
and fifth, minor and major sixth intervals are
consonant dissonant intervals are the seconds,
sevenths and the tritone; this specific vector is
referred to in this text as the common-practice
consonance vector.
Pitch Scale Hierarchy: The pitch hierarchy (if any) is
given in the form of scale tones and a tonic (e.g. a D
maj scale is given as: 2, [0,2,4,5,7,9,11], or an A
minor pentatonic scale as: 9, [0,3,5,7,10]).
Input chord: list of MIDI pitch numbers (converted to
pc-set).
GCT Algorithm (core) - computational pseudocode
Input: (i) the pitch scale (tonality), (ii) a vector of the
intervals considered consonant, (iii) the pitch class set
(pc-set) of a note simultaneity
Output: The roots and types of the possible chords de-
scribing the simultaneity
1. find all maximal subsets of pairwise consonant
tones
2. select maximal subsets of maximum length
3. for all selected maximal subsets do
4. order the pitch classes of each maximal subset in
the most compact form (chord ‘base’)
5. add the remaining pitch classes (chord ‘exten-
sions’) above the highest of the chosen maximal
subset's (if necessary, add octave - pitches may
exceed the octave range)
6. the lowest tone of the chord is the ‘root’
7. transpose the tones of the chord so that the low-
est becomes 0
8. find position of the ‘root’ in regards to the given
tonal centre (pitch scale)
9. endfor
The GCT algorithm encodes most chord types ‘correctly’
in the standard tonal system. In example 1, Table 1 the
note simultaneity [C,D,F#,A] or [0,2,6,9] in a G major
key is interpreted as [7,[0,4,7,10]], i.e. as a dominant sev-
enth chord (see similar example in Section 3.3).
However, the algorithm is undecided in some cases,
and even makes ‘mistakes’ in other cases. In most in-
stances of multiple encodings, it is suggested that these
ideally should be resolved by taking into account other
harmonic factors (e.g., bass line, harmonic functions,
tonal context, etc.). For instance, the algorithm gives two
possible encodings for a [0,2,5,9] pc-set, namely minor
seventh chord or major chord with sixth (see Table1, ex-
ample 2); such ambiguity may be resolved if tonal con-
text is taken into account. For the [0,3,4,7] pc-set with
root 0, the algorithm produces two answers, namely, a
major chord with extension [0,[0,4,7,15]] and a minor
chord with extension [0,[0,3,7,16]]; this ambiguity may
be resolved if key context is taken into account: for in-
stance, [0,4,7,15] would be selected in a C major or G
major context and [0,3,7,16] in a C minor or F minor
context. Symmetric chords, such as the augmented chord
or the diminished seventh chord, are inherently ambigu-
ous; the algorithm suggests multiple encodings which can
be resolved only by taking into account the broader har-
monic context (see Table1, example 3). Since the aim of
this algorithm is not to perform sophisti
!
!
Example 1
Example 3
Tonality - key
Cons. Vector
Input
pc-set
G: [7, [0, 2, 4, 5, 7, 9, 11]]
[1, 0, 0, 1, 1, 1, 0, 1, 1, 1, 0,
0]
[60, 62, 66, 69, 74]
[0, 2, 6, 9]
C: [0, [0, 2, 4, 5, 7, 9, 11]]
[1, 0, 0, 1, 1, 1, 0, 1, 1, 1, 0, 0]
[62, 68, 77, 71]
[2, 5, 8, 11]
Maximal subsets
Narrowest range
Add extensions
Lowest is root
Chord in root position
Relative to key
[2, 6, 9]
[2, 6, 9]
[2, 6, 9, 12]
2 (note D)
[2, [0, 4, 7, 10]]
[7, [0, 4, 7, 10]]
[2, 5], [5, 8], [8, 11], [2, 11]
[2, 5], [5, 8], [8, 11], [2, 11]
all rotations of [2,5,8,11]
2,5,8,11 (resp. for each rotation)
[X,[0,3,6,9]], where X{2,5,8,11}
[X,[0,3,6,9]], where X{2,5,8,11}
Extra
steps:
Subset overap
Base in scale
[11,[0,3,6,9]]
Table 1. Examples of applying the GCT algorithm.
cated harmonic analysis, but rather to find a practical and
efficient encoding for tone simultaneities (to be used, for
instance, in statistical learning and automatic harmonic
generation see end of Section 4), we decided to extend
the algorithm so as to reach in every case a single chord
type for each simultaneity (no ambiguity).
!
GCT Algorithm (additional steps) - for unique encoding
If more than one maximal subsets exist:
Overlapping of maximal subsets: create a sequence of
maximal subsets by ordering them so as to have maxi-
mal overlapping between them and keep the maximal
subset that appears first in the sequence (chord's base)
Chord base notes are scale notes: prefer maximal subset
that contains only pcs that appear in the given scale (to-
nal context) i.e. avoid non-scale notes in the chord
base (this rule is rather arbitrary and is under considera-
tion)
if neither of the above give a unique solution, chose one
encoding at random
Additional adjustment: for dyads, in a tonal context, pre-
fer perfect fifth over perfect fourth, and prefer seventh to
second intervals
!
The additional steps select chord type [2, [0,3,7,10]] in
example 2, Table1 (maximal overlapping between two
maximal subsets), and [11, [0,3,6,9]] in example 3, Table
1 (last pitch-class is Ab that is a non-scale degree in C
major).
3.2 Formal description of the Core GCT Algorithm
The proposed algorithm for extracting the computation of
GCT receives a simultaneity of pitches that are trans-
formed into pitch classes and produces a chord type rela-
tive to a key, namely the root, the base and the extension,
which specify qualitative information about the chord
that more precisely describes this simultaneity. A detailed
description of the algorithm follows, based on an exam-
ple input simultaneity. Suppose that the input set of notes
results in the pc-set [0, 2, 6, 9], which could be described
as a D major chord with minor seventh regarding the to-
nal music environment described by the υ = [1, 0, 0, 1,
1, 1, 0, 1, 1, 1, 0, 0] consonance vector. Therefore, the
algorithm should produce an output in the form: [r, [b],
[e]] = [2, [0, 4, 7], [10]].
By utilising the input pc-set and given a consonance
vector that represents a selected music idiom, a binary
matrix is constructed that is denoted as B. Each row and
column of B represents a pitch class of the input chord,
while a matrix entry is 1 or 0, signifying whether the pair
of row and column pcs are consonant or dissonant respec-
tively according to the current consonance vector.
Strictly, if the consonance vector is denoted as υ and the
input pcset as p, then!!i, j!!{1, 2, . . ., length(p)}
!
(1)
where the function length(x) return the length of vector x.
The B matrix in the discussed example, where p = [0, 2,
6, 9], is the following:
!
!
(2)
Afterwards, a tree is constructed for each of the rows of
B. The root node of these trees is the pitch class that cor-
responds to the respective row, while their branches from
leaves to nodes include pitch classes that are pairwise
consonant (according to υ). The construction of the tree
that corresponds to the ith element of p, is implemented
by recursively traversing B in a depthfirstsearch (DFS)
fashion, beginning from the ith row and following the
paths ‘circumscribed’ by the occurrences of units. Such a
traversal is exhibited in Table 2 for the second row of the
current example’s B matrix. This step’s outcome is a col-
lection of trees, each of which corresponds to a row of B.
The trees of the current example are shown in Table 3.
Τable 2. The steps of the algorithm when scanning the path of the second row.
!
Table 3. All the trees for the current example. The max-
imal path is highlighted with boldface typesetting.
After the application of the above procedure, the paths
from root to leaves with maximal length are kept either as
the output chord candidates, or for further processing in
the steps described in the remaining of this section. In the
current example there is a single maximal path ([2, 6, 9]),
which is highlighted with boldface typesetting (Table 3).
After the longest path has been extracted, the pitch clas-
ses that constitute it, are recombined in their most com-
pact form, which in the current example is [2, 6, 9] (unal-
tered). The pitch class 0 of the initial [2, 6, 9] pc-set is
considered as an extension. Thereby, the simultaneity [0,
2, 6, 9] is circularly shifted to [2, 6, 9, 12], disregarding
the fact that pitch classes can take integer values between
0 and 11. In turn, [2, 6, 9, 12] is transformed to the fol-
lowing [r, [b], [e]] denotation: [2, [0, 4, 7], [10]]. This
denotation clarifies that the simultaneity [0, 2, 6, 9] is
actually a major chord (base [0, 4, 7]) with a minor sev-
enth (extension [10]) and fundamental pitch class 2, (i.e.
D7). As the tonal context is given as input, for instance G
major key, the absolute chord type [2, [0,4,7,10]] (i.e. D7
chord) is converted to relative chord type, i.e.,
[7,[0,4,7,10]] which means dominant seventh in G major.
This is equivalent to the roman numeral analytic types.
3.3 An example analysis with GCT
An example harmonic analysis of a Bach Chorale phrase
illustrates the proposed GCT chord representation (Figure
2). For a tonal context, chord types are optimised such
that pcs at the left hand side of chords contain only con-
sonant intervals (i.e. 3rds & 6ths, and Perfect 4ths & 5ths).
For instance, the major 7th chord is written as [0,4,7,10]
since set [0,4,7] contains only consonant intervals where-
as 10 that introduces dissonances is placed on the right-
hand side this way the relationship between major
chords and major seventh chords remains rather transpar-
ent and is easily detectable. Within the given D major
key context it is simple to determine the position of a
chord type in respect to the tonic e.g. [7,[0,4,7,10]]
means a major seventh chord whose root is 7 semitones
above the tonic, amounting to a dominant seventh. This
way we have an encoding that is analogous to the stand-
ard roman numeral encoding (Figure 2, top row). If the
tonal context is changed, and we have a chromatic scale
context (arbitrary ‘tonic’ is 0, i.e. note C) and we consid-
er all intervals equally ‘consonant’, we get the second
GCT analysis in Figure 1 which amounts to normal or-
ders (not prime forms) in a standard pc-set analysis for
tonal music this pc-set analysis is weak as it misses out
important tonal hierarchical relationships (notice that the
relation of the dominant seventh chord type to the plain
dominant chord is obscured). Note that relative ‘roots’ to
the ‘tonic’ 0 are preserved as they can be used in harmon-
ic generation tasks.
!
Figure 2 Chord analysis of a Bach Chorale phrase by
means of traditional roman numeral analysis, pc-sets
and two versions of the GCT algorithm.!
For practical reasons of space in the musical illustrations,
the form [r,[b],[e]] is not preserved: the base and exten-
sion is concatenated and brackets are omitted. For in-
stance: [7,[0,4,7],[10]] may be depicted as 7,[0,4,7,10] or
even as 7.04710.
4. HARMONIC ENCODING & ANALYSIS
WITH THE GCT
The GCT algorithm has been applied to tonal extracts
from standard tonal pieces, such as Bach Chorales, but
additionally it has been tested out on harmonic structures
from diverse harmonic idioms. Some examples are pre-
sented below to give an idea of the potential of the GCT
representation. Strong points of the encoding are given
along with weaknesses. Some aspects of the analysis are
difficult to judge in some idioms and further study in
required.
4.1 GCT Encoding Examples
In common-practice tonal music, GCT works very
well. Mistakes are sometimes made in case of symmetric
chords such as the diminished seventh chord or the aug-
mented triad. In the case of the half diminished seventh
chord GCT ‘prefers’ to label it as a minor chord with
added sixth instead of a diminished chord with minor
seventh. Chords that include chromatic notes such as the
German sixth, Italian sixth, Neapolitan sixth are encoded
consistently even though not necessarily coinciding with
analytic interpretations by theorists (the French sixth is
more tricky as it is a symmetric chord and GCT finds two
equally prominent ‘roots’).
Below, a number of examples are presented that illus-
trate the application of the GCT algorithm on diverse
harmonic textures. The first example (Figure 3) is taken
from the first measures of Beethoven’s Moonlight Sona-
ta. In this example, GCT encodes classical harmony in a
straightforward manner. All instances of the tonic chord
inverted or not (i.e., C# minor) are tagged as 0,[0,3,7] and
[10] is added when the 7th is present; the dominant sev-
enth is 7,[0,4,7,10] and it appears once without the fifth
[7]; the fourth chord is a Neapolitan sixth and it is encod-
ed as 1,[0,4,7] which means major chord on lowered se-
cond degree (Db major chord in the C# minor key).
Figure 3 Beethoven, Sonata 14, op.27-2 (reduction of
first five measures). Top row: roman numeral harmonic
analysis; bottom row: GCT analysis. GCT successfully
encodes all chords, including the Neapolitan sixth chord
(fourth chord).
In the example of Figure 4 a tonal chord progression by
G. Gershwin is presented. Chromaticism is apparent in
this passage. The GCT ‘agrees’ with the roman numeral
analysis of the excerpt including the Italian sixth chord
that is labelled as 8,[0,4,10], and it even labels the chord
that was left without a roman numeral tag by the analyst
(see question mark) encoding it as a minor chord with
sixth on the flattened sixth degree (Gb-Bbb-Db-Eb)
(Note: actually it could be even encoded as a half-
diminished 7th on the fourth degree Eb-Gb-Bbb-Db).
Figure 4. G. Gershwin, Rhapsody in Blue (reduction of
first five measures). Top row: roman numeral harmonic
analysis; bottom row: GCT analysis. GCT successfully
identifies all chords (see text).
Figure 5 illustrates an Early Renaissance example of
fauxbourdon by G. Dufay. Parallel motion of voices is
typical in this idiom. The GCT labels correctly all dyads
and triads, taking into account musica ficta that produces
rather unusual chord progressions in regards to standard
tonal harmony.
In Figure 6 an example from the polyphonic
singing tradition of Epirus is presented. This very old 2-
voice to 4-voice polyphonic singing tradition is based on
the anhemitonic pentatonic pitch collection and more
specifically the pentatonic minor scale that functions as
source for
Figure 5. G. Dufay’s Kyrie (reduction) - first phrase in
A phrygian mode that exemplifies parallel motion in
fauxbourdon and a phrygian cadence (early Renais-
sance). GCT correctly identifies and labels the open
fifths as well as the triadic chords.
both the melodic and harmonic content of the music. A
unique harmonic aspect of these songs is the unresolved
dissonances (major second and minor seventh intervals)
at structurally stable positions of the pieces (e.g. cadenc-
es). In the example two GCT versions are presented: the
first (top row) depicts the encoding for the standard con-
sonance vector and the second (bottom row) presents the
GCT labelling that considers additionally major seconds
and minor sevenths as ‘consonant’ (it is the same as for
the ‘atonal’ consonance vector as no minor seconds and
major sevenths exist in the idiom). It is interesting to note
that for the standard consonance vector almost all chords
have the drone tone as their root. On the other hand, in
the second encoding different relations between chords
become apparent (e.g. 10,[0,2,5] and 10,[0,2,5,7]) and
also an oscillation of the chord ‘root’ between the tonic
and a note a tone lower is highlighted. Polyphonic songs
from Epirus are the focus of a different study [19].
!
Figure 6 Excerpt from a traditional polyphonic song
from Epirus. Top row: GCT encoding for standard
common-practice consonance vector; bottom row: GCT
encoding for atonal harmony all intervals ‘consonant’
(this amounts to pc-set ‘normal orders’)
4.2 Learning and generation with GCT
In a current study, the GCT representation has been uti-
lised in automatically analysing and encoding scores (ac-
tually, harmonic reductions of scores) from diverse idi-
oms, and then employing this extracted information for
melodic harmonisation. In [20] the authors discuss the
utilization of a well–studied probabilistic methodology,
namely, the hidden Markov model (HMM) methodology,
in combination with constraints that incorporate fixed
beginning and ending chords and intermediate anchor
chords. To this end, a constrained HMM (CHMM) is
developed, which allows the manual insertion of interme-
diate chords, providing alternative harmonisations that
comply with specific constraints.
The reported results indicate that the CHMM method,
harnessed with the novel General Chord Type (GCT)
algorithm, functions effectively towards convincing me-
lodic harmonisations in diverse idioms. In Figures 7 & 8,
two examples of melodic harmonisation are illustrated for
a Bach chorale melody and for a traditional melody from
Epirus. In both cases, the system has been trained on a
corpus of harmonic reductions of pieces in the idiom,
and, then, used to generate new melodic harmonisations.
The results are very good: the Bach chorale harmonisa-
tion is typical of the style and at the same time not trivial
(uses secondary dominants that enrich the harmonisa-
tion); the Epirus melody harmonisation is close to the
style of polyphonic singing (if additional melodic and
rhythmic elements were added the phrase would become
rather typical of the idiom).
Figure 7. Automatically generated GCTs for a Bach
Chorale melody employing a HMM for fixed bounda-
ries (first and last chords are given). Voice leading has
been arranged manually.
Figure 8. Automatically generated GCTs for an Epirus
melody (reduced version) employing a HMM for fixed
boundaries. Voice leading has been arranged manually.
4.3 Discussion and future development
The current version of GCT encodes only the chord type
and the relative position of its ‘root’ to the local tonic of a
given scale. However, it can readily be extended to in-
corporate explicit information on chord inversions (i.e.
bass note position), on scale degrees (chromatic notes
that do not belong to the current scale can be tagged so
that indirectly scale degrees are indicated), and, even, on
voice-leading (for instance, motion of bass, or even for
note extensions that may require resolution by down-
wards step-wise motion). A rich chord representation
should embody such information.
The organisation of tones by GCT for the ‘standard’
consonance vector gives results quite close to those pro-
duced by the stack-of-thirds technique, as implicit in the
latter is consonance of thirds and fifths (as two thirds sum
up to a fifth). Some difference are:
the stack-of-thirds approach usually requires
traditional note names (that allow enharmonic
spellings) whereas the GCT is based on pitch classes
(no direct explicit link to a scale). For instance, GCT
considers the chord CEG# or CEAb ([0,4,8]) as
consonant since its intervals are pairwise consonant1,
1 Question: why is the augmented triad considered dissonant
when all its tones are pairwise consonant?
i.e. two 4 semitone intervals (major thirds) and one 8
semitone interval (minor sixth or augmented fifth)
with root any one of the three tones; stack-of-thirds
determines C as the root in the first case and Ab in the
second case. The GCT algorithm misses out on
sophisticated tonal scale information but is still
informative at the same time being simpler, and easier
to implement.
in the standard consonance vector version of GCT,
diminished fifths are not allowed whereas in the
stack-of-thirds approach all fifths are allowed. For
instance, the root of the half-diminished chord BDFA
is B according to the stack-of-thirds whereas GCT
considers D as the root and B as a sixth above the root
(DFAB), i.e. diminished triads are not consonant
chords according to CGT. Of course, the consonance
vector in GCT may be altered so that the tritone is
also consonant in which case the two approaches are
closer.
the stack-of-thirds method allows empty third
positions in the lower part of the stack whereas GCT
always prefers to have a compact consonant set of
pitches at the bottom. For instance, a chord
comprising of notes: CEFG ([0,4,5,7]) will be
arranged as FCEG by the stack-of-thirds technique
and CEGF ([0,4,7,17]) by GCT.
In relation to the virtual pitch root finding method, the
proposed approach differs in that minor thirds are equally
consonant to major thirds allowing equal treatment of
major and minor chord (as opposed to the virtual pitch
approach that is biased towards major thirds due to the
structure of the harmonic series).
It is also possible to redesign the GCT algorithm alto-
gether so as to make use of non-binary conso-
nance/dissonance values allowing thus a more refined
consonance vector. Instead of filling in the consonance
vector with 0s and 1s, it can be filled with fractional val-
ues that reflect degrees of consonance derived from per-
ceptual experiments (e.g., [21]) or values that reflect cul-
turally-specific preferences. Such may improve the algo-
rithm’s performance and resolve ambiguities in certain
cases (future work).
5. CONCLUSIONS
In this paper a new representation of chord types has been
presented that adapts to diverse harmonic idioms allow-
ing the analysis and labelling of tone simultaneities in
any harmonic context. The General Chord Type (GCT)
representation, allows the re-arrangement of the notes of
a harmonic simultaneity such that idiom-specific types of
chords may be derived. Given a consonance/dissonance
classification of intervals (that reflects culturally-
dependent notions of consonance/dissonance), and a (set
of) scales, the GCT algorithm finds the maximal subset of
notes of a given note simultaneity that contains only con-
sonant intervals; this maximal subset forms the basis up-
on which the chord type is built. The proposed represen-
tation is ideal for hierarchic harmonic systems such as the
tonal system and its many variations, but adjusts to any
other harmonic system such as post-tonal, atonal music,
or traditional polyphonic systems.
The GCT representation was applied to a small set of
examples from diverse musical idioms, and its output was
presented and analysed showing its potential use, espe-
cially, for computational music analysis and music in-
formation retrieval tasks. The encoding provided by GCT
is not always correct according to the interpretation given
by music theorists, but, at least, it is consistent (i.e. a cer-
tain chord will always be encoded the same way) render-
ing it adequate for machine learning and generation (e.g.
melodic harmonisation) where music theoretical correct-
ness is not so important. Sometimes GCT ‘uncovers’
chordal relations that are obscured by notation and en-
harmonic spellings, and may assist a musician in harmon-
ic analysis. Overall, the proposed encoding seems to be
promising and potentially useful in computational music
applications.
Acknowledgments
The project COINVENT acknowledges the financial sup-
port of the Future and Emerging Technologies (FET)
programme within the Seventh Framework Programme
for Research of the European Commission, under FET-
Open grant number: 611553. Special thanks are due to
Andreas Katsiavalos for preparing the harmonic dataset
that has been used in the music examples in the paper.
6. REFERENCES
[1] Laitz, S. G. (2008). The complete musician: An in-
tegrated approach to tonal theory, analysis, and lis-
tening (Vol. 1). Oxford University Press, USA.
[2] Forte, A. (1973). The structure of atonal music. Yale
University Press.
[3] M. Schorlemmer, A. Smaill, K.-U. Kühnberger, O.
Kutz, S. Colton, E. Cambouropoulos, and A. Pease:
COINVENT: Towards a Computational Concept In-
vention Theory, 5th International Conference on
Computational Creativity (ICCC) 2014, Ljubljana,
Slovenia, June 2014.
[4] Ebcioglu, K. (1988) An expert system for harmoniz-
ing four part chorales, Computer Music Journal, vol.
12, no. 3, pp. 4351.
[5] Pachet F. and Roy, P. (2001) Musical harmonization
with constraints: A survey. Constraints, vol. 6, no. 1,
pp. 719, Jan. 2001.
[6] Phon-amnuaisuk S. and Wiggins, G.A. (1999) The
four-part harmonisation problem: A comparison be-
tween genetic algorithms and a rulebased system,”
in In proceedings of the AISB99 symposium on mu-
sical creativity. AISB, 1999, pp. 2834.
[7] Donnelly P. and Sheppard J. (2011) Evolving four-
part harmony using genetic algorithms, in Proceed-
ings of the 2011 International Conference on Appli-
cations of Evolutionary Computation - Volume Part
II, ser. EvoApplications’11. Berlin, Heidelberg:
Springer-Verlag, 2011, pp. 273282.
[8] Paiement, J.-F., Eck, D. and Bengio, S. (2006) Prob-
abilistic melodic harmonization, in Proceedings of
the 19th International Conference on Advances in
Artificial Intelligence: Canadian Society for Compu-
tational Studies of Intelligence, ser. AI’06. Berlin,
Heidelberg: Springer-Verlag, pp. 218229.
[9] Simon, I. Morris, D. and Basu, S. (2008) Mysong:
Automatic accompaniment generation for vocal
melodies, in Proceedings of the SIGCHI Conference
on Human Factors in Computing Systems, ser. CHI
’08. New York, NY, USA: ACM, pp. 725734.
[10] Hild, H. Feulner, J. and Menzel, W. (1991) HAR-
MONET: A neural net for harmonizing chorales in
the style of J. S. Bach. J. E. Moody, S. J. Hanson,
and R. Lippmann, Eds. Morgan Kaufmann, 1991,
pp. 267274.
[11] Granroth-Wilding, M.T. (2013) Harmonic analysis
of music using combinatory categorial grammar,
Ph.D. dissertation, Institute for Language, Cognition
and Computation School of Informatics University
of Edinburgh, Edinburgh, Scotland, Nov. 2013.
[12] Chuan C.-H. and Chew E. (2007) A hybrid system
for automatic generation of style-specific accompa-
niment, in Proceedings of the 4th International Joint
Workshop on Computational Creativity. Goldsmiths,
University of London.
[13] Harte, C., Sandler, M. B., Abdallah, S. A., &
Gómez, E. (2005). Symbolic Representation of Mu-
sical Chords: A Proposed Syntax for Text Annota-
tions. In ISMIR (pp. 66-71).
[14] Temperley, D. (2001). The cognition of basic musi-
cal structures. MIT press.
[15] Sapp, C. S. (2007). Computational Chord-Root Iden-
tification in Symbolic Musical Data: Rationale,
Methods, and Applications. Computing in Musicolo-
gy, 15, pp. 99-119.
[16] Terhardt, E. (1974). Pitch, consonance and harmony.
Journal of the Acoustical Society of America, 55, pp.
1061-1069.
[17] Parncutt, R. (1989). Harmony: A psychoacoustical
approach. Springer-Verlag Publishing.
[18] Oxenham, A.J. (2013) The Perception of Musical
Tones. In The Psychology of Music. Deutsch, D.
(Ed.). Academic Press.
[19] M. Kaliakatsos-Papakostas, A. Katsiavalos, C.
Tsougras, and E. Cambouropoulos: Harmony in the
Polyphonic Songs of Epirus: Representation, Statis-
tical Analysis and Generation, 4th International
Workshop on Folk Music Analysis 2014, Istanbul,
Turkey, June 2014.
[20] Kaliakatsos-Papakostas, M. & Cambouropoulos, E.
(2014). Probabilistic harmonisation with fixed in-
termediate chord constraints. In Proceeding of the
Joint 11th Sound and Music Computing Conference
(SMC) and 40th International Computer Music Con-
ference (ICMC), (To appear), Athens, Greece.
[21] William Hutchinson & Leon Knopoff (1978) The
acoustic component of western consonance, Inter-
face, 7:1, pp. 1-29.
... There has been no general approach to cover a wide range of harmonic idioms with a unique model that reflects specific qualities of each individual idiom and reduces the degree of human intervention. Recently, the General Chord Type (GCT) representation [6,7] was developed with the aim to be flexible and adaptable to both tonal and non-tonal music. Based on a user-defined classification of intervallic dissonance, GCT determines from the notes of a given simultaneity a 'root' and a 'chord type' such that the base of the chord encoding is consonant, automatically translating any pitch collection into a legible abstract symbolic representation using integer notation. ...
... The main objective behind the General Chord Type (GCT) algorithm [6,7] is to be able to represent chords in a meaningful way in diverse musical idioms. This representation organises pitches in a compact form, which includes information on root, or rather a pitch reference, positioned within a pitch hierarchy/key. ...
... Finally, it considers the lowest note as the 'root' and transposes the chord so that the root is 0. In addition, it finds the relative position of the 'root' in the context of the assigned pitch hierarchy. For instance, the MIDI pitch input chord [54,60, [6,7]. ...
Conference Paper
Full-text available
Encoding note simultaneities (chords) has been approached in different ways, such as Roman numerals for tonal harmony, or pitch class sets encountered in atonal and non-tonal music. A novel chord representation, the General Chord Type (GCT) representation, was developed to be adaptable to a broad variety of harmonic idioms from tonal to atonal. Given a binary classification of intervals into consonant or dissonant, GCT rearranges the notes of a given simultaneity such that the base of the chord encoding is consonant. This study proposes a refined elegant version of the GCT algorithm that takes into account a graded raking of intervallic consonance, maintains the core characteristics of GCT and resolves known problems such as certain types of chord ambiguities (more than one chord types for a single pitch collection) and orderings of intervals (wrong base/root and chord extensions). To evaluate the performance of the new version, common tonal chords, whose encodings are well established, are compared with encodings from both versions, as well with other existing encoding systems. In the novel algorithm, ambiguous outputs are reduced significantly, inversions are correctly identified, and foreign chord notes are organised towards the most dissonant top end of the encodings. The algorithm performs well in atonal contexts by encoding Tn-Types with great accuracy.
... Within the COINVENT project the model has been primarily applied to examples from mathematics (Bou et al. 2015) and music (Cambouropoulos, Kaliakatsos-Papakostas, and Tsougras 2014). The constituent processes have been automated to differing degrees, and for some simple examples complete automation is feasible. ...
... Other aspects of visual design (e.g. sprites) could be developed through blending, building on recent work in icon blending (Confalonieri et al. 2015). Given its focus on game-playing agents, VGDL does not currently support any audio that could be blended, although it would be relatively easy to extend with simple audio effects to support research in that direction. ...
Article
We sketch the process of creating a novel video game by blending two video games specified in the Video Game Description Language (VGDL), following the COINVENT computational model of conceptual blending. We highlight the choices that need to be made in this process, and discuss the prospects for a computational game designer based on blending.
... In order to deal with the problem of labelling any collection of pitches within a given hierarchy (e.g. key) and also functioning properly in different harmonic contexts, the General Chord Type (GCT) representation has been proposed, which will be described below (Cambouropoulos, Kaliakatsos-Papakostas, Tsougras, 2014). ...
... The main difference between GCT's roots and roots proposed by Hindemith is on chords built on fourths. In case of quartal chords, GCT picks the note placed in the lowest consecutive fourths, a somewhat appropriate decision, since there is a principle to always output a solution (Cambouropoulos, Kaliakatsos-Papakostas, Tsougras, 2014). ...
Conference Paper
Full-text available
The concept of root is of great significance in chord encoding in tonal music. Is this notion useful in non-tonal idioms or should it be extended, changed or abandoned in different musical contexts? A series of harmonic excerpts from diverse idioms are examined through the application of different root-finding and chord encoding models, such as Parncutt's perceptual virtual pitch root-finding model, the harmonic system of Paul Hindemith, and the General Chord Type (GCT) representation. This way, the models are tested in various contexts, such as tonal, neo-tonal, whole-tone or atonal harmonies. In this process, the abstract encoding of chords in diverse tonal or non-tonal contexts is explored, employing a utilitarian notion of 'reference tone' in cases where root ambiguity is strong and cannot be resolved.
... The General Chord Type (GCT) representation ( [8], [9]), allows the re-arrangement of the notes of a harmonic verticality such that abstract idiom-specific types of chords may be derived. Given a consonance-dissonance classification of intervals (that reflects sensory and/or culturally-dependent notions of consonance / dissonance), the GCT algorithm finds the maximal subset of notes of a given note simultaneity that contains only consonant intervals; this maximal subset forms the base upon which the chord type is built and the lowest note of the base is the root of the chord. ...
Chapter
The difficulty of modelling musical structure in a general and cognitively plausible manner is due primarily to music’s inter-dependent multi-parametric and multi-level nature that allows multiple structural interpretations to emerge. Traditional AI symbolic processing methods, however, can be used effectively for modelling particular analytic and creative aspects of musical structure. In this paper three specific problems of music structure, namely, segmentation and streaming, pattern extraction, harmonic abstraction and generation, will be addressed with a view to highlighting the importance of problem definitions, music representation and multi-parametric hierarchical cognitively-inspired processing methodologies. Existing proof-of-concept models are used as a basis for a theoretical discussion.
... Given that participants were free to use any type of harmonic pallet, thus potentially avoiding tonal harmonic devices, harmonic content had to be captured using idiomindependent features of harmonic plurality. To this end, three different features based on the Pitch Class Profiles (PCPs), the General Chord Type (GCT) [10] and the isolated type component of GCTs (without root information) were extracted from each harmonisation. The PCP is the 12-dimensional vector that describes the percentages of pitch classes in the entire harmonisation (harmonic part without the melody). ...
Conference Paper
Full-text available
This paper presents an experiment designed to investigate the influence of a creativity support tool on music creation. Twenty five participants were asked to harmonise two very similar melodies, the first on their own and the second while given the opportunity to interact with the CHAMELEON harmonisation assistant. CHAMELEON can offer a variety of solutions in a melodic harmonisation task by harmonising according to a number of idioms and/or their blends. Comparison between the produced harmonisations by the participants and their selection of favourite CHAMELEON examples indicated that the majority of them were directly influenced by the solutions offered by the system. Three strategies by which participants exploited CHAMELEON were identified: borrowing of full measures or long chord sequences, borrowing of one or more single chords and finally, adoption of general concepts existing in the CHAMELEON examples. We argue that these findings indicate that the system has the potential to stimulate and promote creative thinking.
... Another related approach is [76], which uses HMMs on specific hand-crafted chord representations to generate homorhythmic sequences. These representations are based on an expert knowledge of the common-practice harmony and are called General Chord Type (GCT) [22]. A drawback of these models is that they are not agnostic, in the sense that they include a priori knowledge about music such as the concept of dissonance, consonance, tonality or scale degrees. ...
Thesis
This thesis discusses the use of deep generative models for symbolic music generation. We will be focused on devising interactive generative models which are able to create new creative processes through a fruitful dialogue between a human composer and a computer. Recent advances in artificial intelligence led to the development of powerful generative models able to generate musical content without the need of human intervention. I believe that this practice cannot be thriving in the future since the human experience and human appreciation are at the crux of the artistic production. However, the need of both flexible and expressive tools which could enhance content creators' creativity is patent; the development and the potential of such novel A.I.-augmented computer music tools are promising. In this manuscript, I propose novel architectures that are able to put artists back in the loop. The proposed models share the common characteristic that they are devised so that a user can control the generated musical contents in a creative way. In order to create a user-friendly interaction with these interactive deep generative models, user interfaces were developed. I believe that new compositional paradigms will emerge from the possibilities offered by these enhanced controls. This thesis ends on the presentation of genuine musical projects like concerts featuring these new creative tools.
Chapter
In this chapter, recent research in the domain of melodic harmonization and computational creativity is presented with a view to highlighting strengths and weaknesses of the classical cognitively inspired symbolic AI approach (often in juxtaposition to contemporary deep learning methodologies). A modular melodic harmonization system that learns chord types, chord transitions, cadences, and bassline voice leading from diverse harmonic datasets is presented. Then, it is shown that the harmonic knowledge acquired by this system can be used creatively in a cognitively inspired conceptual blending model that creates novel harmonic spaces, combining in meaningful ways the various harmonic components of different styles. This system is essentially a proof-of-concept creative model that demonstrates that new concepts can be invented which transcend the initial harmonic input spaces. It is argued that such original creativity is more naturally accommodated in the world of symbolic reasoning that allows links and inferences between diverse concepts at highly abstract levels. Moreover, symbolic representations and processing facilitate interpretability and explanation that are key components of musical knowledge advancement. Finally, reconciling symbolic AI with deep learning may be the way forward to combine the strengths of both approaches toward building more sophisticated robust musical systems that connect sensory auditory data to abstract musical concepts.
Article
This chapter discusses musical similarity focusing on issues of representation and processing of patterns in symbolic music data. Various facets of musical similarity are explored that pertain to practical problems encountered when developing formal models for pattern identification and induction in musical corpora; the representation of musical data that strongly influences the search process is of particular interest. The chapter refers to research in the domain field of musical pattern processing, but pays special attention to corpus-based similarity analysis in the domain of harmony. Harmonic processing are presented in two case studies: the first illustrates strengths and weaknesses of different chord representation schemes, and the second shows the potential of statistical pattern extraction and learning from data in the development of a creative melodic harmonization assistant.
Article
Traditional harmonic analysis annotations can be represented in a computer model of a piece of music by plain text strings. But whenever automated processing like analysis, comparison or retrieval is intended, a formal definition is helpful. This should cover not only the syntactic structure, but also the semantics, i.e. the intended meaning, and thus adheres to the technique of mathematical remodelling of existing cultural phenomena. The resulting models can serve as a basis for automated processing, but also help to clarify the communication and discussion among humans substantially. This article proposes such a definition in four layers, which address different problems of encoding and communication: (a) relation of symbol sequences to staff positions, (b) combining functions, (c) chord roots, and (d) interval structure and voice leading. Only one of them is specific to functional (Riemannian) theory and can possibly be replaced to represent scale degree theory. The proposal is configurable to different interval specification methods and open to localisation. Syntax and semantics are defined by precise mathematical means, borrowed from computer science, and thus are unambiguously documented.
Article
Full-text available
CHAMELEON is a computational melodic harmonization assistant. It can harmonize a given melody according to a number of independent harmonic idioms or blends between idioms based on principles of conceptual blending theory. Thus, the system is capable of offering a wealth of possible solutions and viewpoints for melodic harmonization. This study investigates how human creativity may be influenced by the use of CHAMELEON in a melodic harmonization task. Professional and novice music composers participated in an experiment where they were asked to harmonize two similar melodies under two different conditions: one with and one without computational support. A control group harmonized both melodies without computational assistance. The influence of the system was examined both behaviorally, by comparing metrics of user-experience, and in terms of the properties of the artifacts (i.e., pitch class distribution and number of chord types characterizing each harmonization) that were created between the two experimental conditions. Results suggest that appreciation of the system was expertise-dependent (i.e., novices appreciated the computational support more than professionals). At the same time, users seemed to adopt more explorative strategies as a result of interaction with CHAMELEON based on the fact that the harmonizations created this way were more complex, diverse, and unexpected in comparison to the ones of the control group.
Conference Paper
Full-text available
During the last decades, several methodologies have been proposed for the harmonization of a given melody with al-gorithmic means. Among the most successful are method-ologies that incorporate probabilistic mechanisms and sta-tistical learning, since they have the ability to generate har-monies that statistically adhere to the harmonic character-istics of the idiom that the training pieces belong to. The current paper discusses the utilization of a well–studied probabilistic methodology, the hidden Markov model (HMM), in combination with additional constraints that incorporate intermediate fixed–chord constraints. This work is moti-vated by the fact that some parts of a phrase (like the ca-dence) or a piece (e.g. points of modulation, peaks of ten-sion, intermediate cadences etc.) are characteristic about the phrase's or piece's idiomatic identity. The presented methodology allows to define and isolate such important parts/functions and include them as constraints in a proba-bilistic harmonization methodology. To this end, the con-strained HMM (CHMM) is developed, harnessed with the novel general chord type (GCT) representation, while the study focuses on examples that highlight the diversity that constraints introduce in harmonizations.
Conference Paper
Full-text available
This paper examines a previously unstudied musical corpus de-rived from the polyphonic singing tradition of Epirus employing statistical methods. This analysis will mainly focus on unique harmonic aspects of these songs, which feature, for instance, unresolved dissonances (major second and minor seventh inter-vals) at structurally stable positions of the pieces (e.g. cadenc-es). Traditional triadic tonal chord types are inadequate for this corpus' unconventional harmonic language; pc-set theoretic tools are too general/abstract. A novel chord representation has been devised that adapts to different non-standard tonal harmon-ic spaces. In the General Chord Type (GCT) representation, the notes of a harmonic simultaneity are re-arranged, depending on a given classification of intervals (that reflects culturally-dependent notions of consonance/dissonance), such that abstract idiom-specific types of chords may be derived. Based on these harmonic representations, statistical analyses are performed that provide insights regarding underlying harmony and, especially, on the idiosyncratic use of consonance/dissonance within this idiom. Then, characteristics of harmonic successions are exam-ined via statistical analysis of the common chord transitions in the idiom. Finally, the learned statistical features are used to generate new harmonisations in the Epirus-song style for unseen Epirus-song melodies or for melodies from other distant idioms.
Article
Full-text available
Our approach to tonal harmonic analysis, developed after evaluating a number of other approaches, involves the analysis of pitch-class, rhythmic and metric context, melodic content, tonality, and form. Datasets examined are encoded in **kern. Par- ticular emphasis is given to assessing problems of variable texture, in which chordal information is ambiguous or incomplete. Evaluation of passing tones, inner voices, and other minutiae of musical scores which interfere with simpler chordal assess- ments are among the topics discussed. Recent applications of the evaluation proce- dures are also described.
Article
Full-text available
Creating distinctive harmonizations in an identifiable style may be one of the most difficult tasks for amateur song writers, a novel and acceptable melody being relatively easier to produce; and this difficulty may result in the abandonment of otherwise worthwhile projects. To model and assist in this creative process, we propose a hy-brid system for generating style-specific accompaniment, which is capable of creating new harmonizations for melodies, with proper harmonic resolutions, in a style that is learned from only a few examples. In the proposed system, a chord tone determination module first learns, then determines, which notes in a given melody are likely chord tones. According to these chord tones, triads are assigned first to the bars with unambiguous solutions, and these triads serve as checkpoints. The system then con-structs possible chord progressions using neo-Riemannian transforms between checkpoints, and represents the al-ternate paths in a tree structure. A Markov chain with learned probabilities for these neo-Riemanian transforms then generates the final chord progression. We select four songs by the British rock band, Radiohead, to evaluate the system. Three songs are used for training, and an accom-paniment is generated for the held out melody. We present the results of two case studies. We find that the system generates chords closely related to the original, and the resulting chord transitions reinforce the phrase structure of the melody.
Thesis
Various patterns of the organization of Western tonal music exhibit hierarchical structure, among them the harmonic progressions underlying melodies and the metre underlying rhythmic patterns. Recognizing these structures is an important part of unconscious human cognitive processing of music. Since the prosody and syntax of natural languages are commonly analysed with similar hierarchical structures, it is reasonable to expect that the techniques used to identify these structures automatically in natural language might also be applied to the automatic interpretation of music. In natural language processing (NLP), analysing the syntactic structure of a sentence is prerequisite to semantic interpretation. The analysis is made difficult by the high degree of ambiguity in even moderately long sentences. In music, a similar sort of structural analysis, with a similar degree of ambiguity, is fundamental to tasks such as key identification and score transcription. These and other tasks depend on harmonic and rhythmic analyses. There is a long history of applying linguistic analysis techniques to musical analysis. In recent years, statistical modelling, in particular in the form of probabilistic models, has become ubiquitous in NLP for large-scale practical analysis of language. The focus of the present work is the application of statistical parsing to automatic harmonic analysis of music. This thesis demonstrates that statistical parsing techniques, adapted from NLP with little modification, can be successfully applied to recovering the harmonic structure underlying music. It shows first how a type of formal grammar based on one used for linguistic syntactic processing, Combinatory Categorial Grammar (CCG), can be used to analyse the hierarchical structure of chord sequences. I introduce a formal language similar to first-order predicate logical to express the hierarchical tonal harmonic relationships between chords. The syntactic grammar formalism then serves as a mechanism to map an unstructured chord sequence onto its structured analysis. In NLP, the high degree of ambiguity of the analysis means that a parser must consider a huge number of possible structures. Chart parsing provides an efficient mechanism to explore them. Statistical models allow the parser to use information about structures seen before in a training corpus to eliminate improbable interpretations early on in the process and to rank the final analyses by plausibility. To apply the same techniques to harmonic analysis of chord sequences, a corpus of tonal jazz chord sequences annotated by hand with harmonic analyses is constructed. Two statistical parsing techniques are adapted to the present task and evaluated on their success at recovering the annotated structures. The experiments show that parsing using a statistical model of syntactic derivations is more successful than a Markovian baseline model at recovering harmonic structure. In addition, the practical technique of statistical supertagging serves to speed up parsing without any loss in accuracy. This approach to recovering harmonic structure can be extended to the analysis of performance data symbolically represented as notes. Experiments using some simple proof-of-concept extensions of the above parsing models demonstrate one probabilistic approach to this. The results reported provide a baseline for future work on the task of harmonic analysis of performances.
Chapter
Tones form the building blocks of most music. This chapter reviews how musical tones are perceived when presented alone and in combination with other tones. After a brief review of psychoacoustic methods, the concepts of loudness, pitch, timbre, and their interactions are discussed along with classic findings and recent advances in the field. The final section reviews the perception of tones when they are presented in combination, along with the concepts and recent empirical findings on the perception of consonance and dissonance.
Article
This paper reprots on a rule-based expert system called CHORAL, for harmonization and Schenkerian analysis of chorales in the style of J. S. Bach. The author first briefly compares his approach with some current trends in algorithmic composition and music analysis, and then describes the CHORAL system itself. 35 Refs.
Article
A formalism is developed for providing a measure of dissonance in a superposition of complex tones. The formalism is based on an extension of the Helmholtz‐Plomp and Levelt model of beating as the cause of dissonance.For dyads this measure of dissonance gives a good fit to psychological rank orderings of dissonance and its absence (consonance), and to orderings of consonance and dissonance found in Western common practice and pedagogy. A logarithmic scale for the perception of consonance and dissonance is indicated.