ArticlePDF Available

Abstract and Figures

The application of Burnaby's similarity index is discussed by using structural data from mediterranean vegetation. The index, suggested to compare objects described by characters measured on different scales (mixed data), was applied in a fuzzy theory context. Ordinations of vegetation relevés and structural characters by joint plots have been obtained. These are very useful to map vegetation structural diversity in multidimensional spaces and to test the efficiency of an intuitive classification based on qualitative assessment.
Content may be subject to copyright.
Plant Ecology 138: 77–87, 1998.
© 1998 Kluwer Academic Publishers. Printed in the Netherlands.
77
Analysis of vegetation structural diversity by Burnaby’s similarity index
L. Carranza, E. Feoli & P. Ganis
Department of Biology, University of Trieste, I-34127 Trieste, Italy
Received 12 December 1997; accepted in revised form 6 May 1998
Key words: Character state, Classification, Eigenvectors, Fuzzy ordination, Mixed data, Probability, Weight
Abstract
The application of Burnaby’s similarity index is discussed by using structural data from mediterranean vegetation.
The index, suggested to compare objects described by characters measured on different scales (mixed data), was
applied in a fuzzy theory context. Ordinations of vegetation relevés and structural characters by joint plots have
been obtained. These are very useful to map vegetation structural diversity in multidimensional spaces and to test
the efficiency of an intuitive classification based on qualitative assessment.
Introduction
Vegetation is a complex system whose components
(plant species) are difficult to measure. In vegetation
science data are collected in different ways accord-
ing to the aim of the research (Mueller Dombois &
Ellenberg 1974), in the majority of the cases only
nominal and ordinal scales are used. Complexity has
several definitions and implications in vegetation sci-
ences (Anand 1994; Anand & Orlóci 1996). However,
Feoli & Zuccarello (1994) prefer to associate the con-
cept of complexity with the difficulty to measure the
vegetation states: a sytem is complex when at least
some of its state variables are practically unmeasur-
able by any kind of instrument or, if measures are
done, they are very imprecise. As a consequence of
vegetation complexity the description of vegetation
structure is often only qualitative. Quantitative struc-
tural description of vegetation can be done indirectly,
as was suggested by Feoli (1984), on the basis of
multiplication of the matrix relevés-species by the
matrix species-structural characters (or textural char-
acters accordingBarkman1979, 1988). In this case the
resulting matrix (relevés-structural characters) gives
for each character and rele a numerical weight. This
is the number of species, in case of binary data, or the
total cover of species, in case of cover data presenting
the character in each relevé. Feoli et al. (1985) and
De Patta Pillar & Orlóci (1991, 1993) show how the
structural characters can be arranged in hierarchical
order and how vegetation entities (relevés or synthetic
phytosociologicaltables) can be numerically analysed
in a hierarchicalprocess. The disadvantageofthis kind
of descriptionis that it does not take into consideration
the size of the plants and their vertical arrangement in
the strata. The proposal of Orlóci & Orlóci (1985) to
describe directly the vegetation structure by character
set types may have the same disadvantage and it looks
more difficult to be carried out in the field. Arrigoni
(1996a) tries to offer a standard that can be more eas-
ily followed by phytosociologists and plant ecologists
interested to collect data for mapping physiognomic
types. The method suggested by Arrigoni will not be
discussed here. What is relevant for this paper is the
type of data that it generates. This type of data is very
common when dealing with complex systems, namely
data originated by measuring the different characters
by different scales (nominal, ordinal, continuous that
is ‘mixed data’). In Arrigoni’s approach each relevé
is a vector describing the vegetation by strata. Each
stratum is described by its percentage cover value, its
average height in meters, and by the presence of the
structural (textural) character characterising it.
There are many similarity indices that can be
used to compare structural descriptions of vegetation
(Sneath & Sokal 1973; Orlóci 1978; Dale 1988; Po-
dani 1995), however only few have been proposed for
mixed data. These may be grouped into two types:
78
those that keep into consideration the association be-
tween characters and those that do not. The similarity
indices of Goodall (1964, 1966, 1993) and Gower
(1971) are of the second type. The most widely used
indexthat keeps into consideration the association pat-
tern between characters is the Mahalanobis distance
(Orlóci 1978), however it applies directly to continu-
ous data only. One similarity index accepts mixed data
is COCHIS (Feoli & Lagonegro, 1983). According
to COCHIS, variables are divided into two sets, one
which contributes to similarity and the other which
contributes to dissimilarity. For each set an index (S
1
and S
2
) is constructed by summing the probabilities
of character association. This is calculated by chi
square test or correlation coefficient depending on the
variable scale. By these two indices (S
1
,S
2
)asim-
ilarity between two relevés (S
jk
) may be computed
in many ways, COCHIS uses S
jk
= S
1
/(S
1
+ S
2
).
In updating COCHIS, to make it more efficient, we
reviewed again the literature in Gower (1971). There
we discovered that an index based on the same idea
of COCHIS has been already proposed by Burnaby
(1970) in a geological context. However, in Burnaby’s
index each character is weighted not only on the basis
of its association pattern with the other characters, as
in COCHIS, but also on its information content. In
the present paper Burnaby’s index is discussed as an
idea to base comparative analysis of vegetation struc-
ture rather than as a strict technical tool. The index is
used to produce ordination diagrams to map the struc-
tural diversity of vegetation based on fuzzy set theory
(Zimmerman 1984; Feoli & Zuccarello 1986; Roberts
1986; Marsili-Libelli 1989). The performance of the
index is discussed on the basis of its application to
data from Arrigoni (1996a). We use Arrigoni’s data
since they give a clear example of a structural descrip-
tion of relevés based on a critical revision of plant
growth form classifications (Arrigoni 1996b). A com-
puter program is offered to apply the Burnaby index
(P. Ganis, BURNY-COCHIS).
Burnaby’s similarity index
As COCHIS, Burnaby’s idea opens many alternatives.
We focus only on two, the one suggested by Burnaby
(1970), giving weight to independent characters and
the other implemented by COCHIS giving weight to
associated characters.
Given N objects described by M characters mea-
sured on different scales, Burnaby’s similarity index
between two objects j and k is defined by the following
formula:
K
v
jk
=
M
P
i=1
w
i
v
i
jk
I
i
jk
M
P
i=1
w
i
I
i
jk
, (1)
where w
i
is the weight assigned to the ith character
based on its independencewith all the other characters.
It is defined as:
w
i
=
N
2
N
2
+
M
P
h6=i
2
hi
)
2
, (2a)
where χ
2
hi
is the chi square statistic computed for the
M-1 pairs of characters. w
i
ranges between 1/M for
complete association to 1 for complete independence.
For weighting association the following formula is
used:
W
i
=
1
Mw
i
, (2b)
v
i
jk
assumes differentvalues accordingto the scale
of the ith examined character. For nominal scale v
i
jk
is
1 if the character states agree, 0 if not. For ordinal and
continuous scale, v
i
jk
is calculated according to:
v
i
jk
= 1
x
0
ij
x
0
ik
x
0
max
x
0
min
!
2
, (3)
where x
0
ij
, x
0
ik
, x
0
max
and x
0
min
are quantile class marks
(1, 2, 3, 4, 5 if quintiles are used, as suggested by
Burnaby), all calculated using ranks of the character
values.
I
i
jk
is the information weight of the states of the ith
character in the jth and kth rele calculated according
to the following formula:
I
i
jk
=−
logp(x
ij
) + logp(x
ik
)
, (4)
where p(x
ij
) and p(x
ik
) are respectively the proba-
bilities to find the character i in the states of j and
k relevés. I
i
jk
is the information carried out by the
probability of finding the two states of character i in
the couple of relevés under comparison. For ordinal
and continuous variables there is no weight since the
probabilities of quantiles are equal.
Burnaby suggests to compute w
x
i
always on 2 ×
2 contingency tables independently of the character
scale of measure. To achieve this, the qualitative
79
characters with m states are transformed into m di-
chotomouscharactersdescribingthe presence-absence
of each character state. Ordinal and continuous scales
are transformed in quintiles and the chi square statis-
tic is computed on 2 × 2 contingency tables obtained
by removing the median classes and by grouping the
remaining cells by tetrads. This suggestion has been
strongly criticised by Gower (1970, 1971). However,
several trials carried out by us have demonstrated that,
when compared to other ways of obtaining contin-
gencytables from continuousvariables, it gives results
closest to the product moment correlation coefficient.
The use of quantiles instead of other types of classes
(i.e., the classical equal rangefrequency classes) guar-
anteesthe equidistributionof the objectsin the classes,
avoiding empty classes (i.e., in the case of bi- or multi-
modal distributions) or classes with few elements (i.e.,
in the tails of the normal distributions).
Data and methods
The data from Arrigoni (1996a) are given in Table 1.
They correspond to 23 relevés described by 40 char-
acters, 26 of them are nominal with two states (binary
characters), 4 are nominal with more than 2 states and
10 are continuous (height of plants and percentage of
cover of the strata). The data have been collected on
the calcareous mountain of Sardinia between the sea
level to 1400 m above sea level.
The 23 relevés are classified by Arrigoni into 4
main plant formations of Mediterranean vegetation,
namely evergreen forest (relevés 1 to 6), evergreen
maquis (relevés 7 to 11), garrigue (relevés 12 to 18)
and a mixed formation on rocks (relevés 19 to 23).
Burnaby’s index is compared to Gower’s and
Goodall’s index. Two options are considered: one
giving weight to character association and the other
giving weight to character independence. Ordination
axes have been obtained by averaging the similar-
ity values of relevés within the four plant formations
of the Arrigoni’s classification. According to Feoli
& Zuccarello (1986) and Zhao (1986) (see Marsili-
Libelli 1989) these average values are the degrees
of belonging of the relevés to the fuzzy sets corre-
sponding to the plant formations. In this case the axes
represent independent fuzzy sets and not a fuzzy par-
tition. Ordination axes have been also obtained by the
eigenvectors of the similarity matrices calculated by
Burnaby’s, Gower’s and Goodall’s indices. The ordi-
nations are non-centred (Noy-Meir 1973; Feoli 1977)
since they use similarity matrices with values ranging
between0 and 1. The first eigenvectoris alwaysunipo-
lar, other unipolar eigenvectors are obtained if the
data matrix presents disjoint submatrices. Noy-Meir
(1973) and Feoli (1977) show the advantage of us-
ing non-centred ordination for interpreting clusters by
eigenvectors. According to an algebraic theorem, al-
ready presented in ecological context by Feoli (1977),
each disjoint submatrix of a similarity matrix has its
independent set of eigenvalues and eigenvectors. The
magnitude of eigenvaluesdepends on the combination
between the size of the submatrix and its average sim-
ilarity. From this it follows that if a similarity matrix
has submatrices ‘tendencially’ disjoint the elements of
the submatrices with highest dimension and/or highest
average similarity, have higher scores in the corre-
sponding eigenvectors. This allows to interpret the
eigenvectors in terms of clusters of elements corre-
sponding to the submatrices, i.e. to characterise the
eigenvectors by clusters of elements. Thus, the co-
efficient of correlation calculated between the fuzzy
axes and the eigenvectors of the similarity matrices
gives a measure of the correspondence between the
eigenvectors and the clusters of elements.
The Jancey’s relocation method (Anderberg1973),
by using the 4 fuzzy axes and the first eigenvectors of
the similarity matrices capable to represent the clusters
of relevés, was applied to test the best classification in
terms of separation between the clusters suggested by
Arrigoni’s classification. The within sum of squares is
used as the optimality criterion: the lower the sum of
squares the higher the separation between clusters.
Joint plots have been obtained by binarising all the
qualitative data and by the multiplication of the matrix
so obtained by the matrix of fuzzy sets and by the ma-
trix of eigenvectors. The matrix multiplication is done
in such a way that the scores of each character state
and/or each continuous variable in the ordination axes
are the weighted averages according to the following
formula:
S
ih
=
N
P
j=1
x
ij
b
jh
N
P
j=1
x
ij
, (5)
where x
ij
is the score in the data matrix, b
jh
is the
score of the jth rele in the h fuzzy set or eigenvector.
This multiplication is done in analogywith the method
of reciprocal averaging (Orlóci 1978; ter Braak 1995).
However the use of one of the similarity indices for
80
Table 1. Table of 23 relev
´
es described by 40 structural characters from Arrigoni 1996a. Symbols: Char. type = character type: B.binary,Q. qualitative, C. continuous. Codes for strata:
S1 (0–0.5 m), S2 (0.5–2 m), S3 (2–5 m), S4 (5–12 m), S5 (12–25 m). State of qualitative characters: Herbaceous: 1. Polymorphous 2. Tuberous 3. Graminoid 4. Succulent (crassulent) 5.
Caulescent Life cycle: 1. Perennial 2. Annual 3. Annual and perennial Vegetative cycle: 1. Vernal 2. Latevernal 3. Estival 4. Mixed Leaf consistency: 1. Sclerophyllic 2. Laurel-leaf 3.
Other Leaf cycle: 1. Evergreen 2. Deciduous and semideciduous.
81
Figure 1. Ordination of the 23 relev
´
es in Table 1 according to the fuzzy sets (f.s.) corresponding to evergreen forest and vegetation on rocks,
based on Burnaby similarity index weighting character independence.
Table 2. Evaluation of the efficiency of the similarity indices in producing clusters of relev
´
es by means of sum of squares from centroids;Weight
ind.: weight based on character independence, Weight ass.: weight based on character association (see text).
mixed data (Burnaby’s, Gower’s and Goodall’s in-
dices) has the advantage that the continuous variables
may be left untransformed. The structural diversity of
vegetation is evaluated by the number of qualitative
structural characters in the relevés and by the number
of strata.
Results
Table 2 shows the efficiency of the Burnaby’s index
with respect to Gower’sand Goodall’s indices in terms
of sum of squares calculated directly and calculated
by the Jancey’s relocation method. This table shows
that the Burnaby’s index gives rise to more efficient
classifications especially if we consider the reloca-
tion. Table 3(a,b) shows respectively the scores of the
relevés in the ordination axes given by 4 fuzzy sets
(fuzzy axes) and by the first 3 eigenvectors (capa-
ble to represent the clusters of relevés). In this table
the relevés that, according to the Jancey’s method,
are relocated in other clusters are also indicated. The
Burnaby’s option ‘weighting character independence’
is more efficient than the one weighting character
association. In terms of number of relocations the
Burnaby option weighting character independence is
the most efficient. In fact only three relocations are
suggested.
Table 3 is a very useful table since it shows the
degree of belonging of the relevés to the sets corre-
sponding to the four plant formations and their scores
82
Table 3. Fuzzy axes (a) and eigenvectors (b) for the ordinations of the relev
´
es. Symbols: Arrigoni’s Plant formations: A. Evergreen forest, B. Evergreen maquis, C. Garrigue, D. Vegetation
on rocks. In (a) 1, 2, 3, 4 indicate fuzzy sets corresponding to the plant formations, in (b) 1, 2, 3 indicate the eigenvectors corresponding to the plant formations. The shaded cells indicate
the allocation of relev
´
es according to Jancey’s method.
(a) (b)
83
Figure 2. Ordination of some selected character states by fuzzy axes (f.s.) corresponding to those in Figure 1. (a): ordination of the main life
forms of the strata, (b): ordination of the type of leaf cycles of the strata, (c): ordination of the height of the vegetation and cover of the strata.
84
Table 4. Correlation coefficients between the fuzzy sets and the eigenvectors in Table 3. The shaded cells indicate the maximum positive or negative
correlation.
in the first three eigenvectors. The fact that three
eigenvectors are enough to represent the four clusters
of relevés proves the gradual change in the structure
of the considered vegetation. The continuity of the
change is well reflected by the high number of relo-
cated relevés of the two intermediate plant formations,
namely evergreen maquis and garrigue.
Table 4 presents the correlation coefficients be-
tween the fuzzy ordination axes and the eigenvectors
for the three similarity matrices (Burnaby, Gower and
Goodall); it is clear that eigenvector 1 corresponds
to the plant formation on rocks, the negative side
of eigenvector 2 corresponds to the evergreen for-
est and the positive side to garrigue, eigenvector 3
corresponds to evergreen maquis.
All the jointplots of the main ordination axesshow
more or less similar ordination patterns. There is a
clear structural gradient from the evergreen forest to
the vegetation formationon rocks. For this reason only
theordinationbased on the fuzzysets correspondingto
evergreen forest and the rock vegetation is presented
(Figure 1). The evergreen forest shows the highest
structuraldiversity, the rock vegetation the lowest one.
The average number of qualitative characters in the
relevés and the number of different strata are respec-
tively: evergreen forest 19.5, 4.5; evergreen maquis
18, 3.4; garrigue 15.5, 2; vegetation on rocks 15, 2.2.
The number of strata in the last plant formation is
some time higher than that in garrique since trees can
grow within rocks.
By superimposing the fuzzy ordination of struc-
tural characters (Figure 2a–c) to the relevés ordination
in Figure 1, the correspondence between the structural
characters and the gradient becomes evident. Table 5
presents the degrees of belonging of the structural
charactersto the sets correspondingto Arrigoni classi-
fication calculated according to formula (5) and shows
also their weight in terms of association and indepen-
dence calculated according to formulas (2a) and (2b).
The table shows that the extremeformations(A and D)
are characterised by more characters than the interme-
diate formations (B and C) and that in C and D there
are many characters with similar degree of belonging.
Discussion
According to Gower (1970), Burnaby formulated his
index already in 1965, but he was reluctant to publish
it because he wanted to analyse in more details some
of its properties and because Goodall (1964, 1966)
had anticipated some of his ideas. Probably nobody
applied Burnaby’s index in ecological work because it
was presented in a geological journal, without an ac-
cessible computer program and because Gower (1970)
stressed many points that he was judging weak. We do
not want to discuss here the criticism of Gower, since
the computer program BURNY-COCHIS lets the user
to apply the idea of Burnaby (1970) and of Feoli &
Lagonegro (1983) in a very flexible way. This means
that the variables may be transformed in different
ways and that the contingency tables for association
measures can be obtained also according to Gower’s
suggestions (i.e., beside using only 2 × 2 contingency
tables).
Tests to measure the best performance on the ba-
sis of class separation based on internal or external
criteria are largely available and the Jancey method
is one of them! What we want to stress here is the
idea of weighting the characters by their informa-
tion content and by their association. The context is
pragmatic: weighting characters by their information
content means to give more weight to rare characters,
i.e. to give more similarity to vegetation states that
have rare characters in common. This may be useful
from a conservation point of view. Weighting charac-
85
Table 5. Degree of belonging of the character states to the four fuzzy sets corresponding to the plant formation of Arrigoni based on Burnaby index
weighting character independence (see Table 3 for symbols). The weight of character is also indicated; Weight ind.: weight based on character
independence, Weight ass.: weight based on character association (see text). The shaded cells show the characters that are more linked to the four
plant formations (arbitrary threshold 0.6 has been chosen).
86
ters by their association or independence (in this case
the association is also considered before to compute
the independence) is particularly useful to limit the
negative influence of many redundant non-predictive
characters over few predictive ones. This is in line
with what Intersection Analysis (Feoli & Lagonegro
1979; Feoli et al. 1981) is supposed to do, namely
to give the same importance to groups with many
characters and to character groups with few charac-
ters. As was shown by Feoli et al. (1981), polythetic
classifications, based on the Adansonian principle of
equal weight, may not be free from ecological mis-
classifications. Polythetic classifications may be less
meaningfulthanmonotheticones in the sense that may
be less predictive with respect to chemical physical
factors of ecological relevance. The Burnaby’s idea
of weighting characters may integrate monothetic and
polythetic classifications as supported also by Feoli &
Lagonegro (1983) by COCHIS. The fact that the clas-
sification based on Burnaby’s index is more similar to
the one given by Arrigoni (1996a) suggests that the
logic of Burnaby’s index is more close to the process
of classification worked out by our brain, namely,
given a system to be analysed, the analysis of char-
acter variation is done considering their association
network rather than by considering only one character
at a time. The association pattern between characters
is a very important character itself that in vegetation
system analysis should not be neglected. The possibil-
ity to quantify relationships in a clear fuzzy set theory
context is one advantage of using Burnaby’s index in
vegetation studies.
Acknowledgements
This study was sponsored by Italian C.N.R. and by
Italian M.U.R.S.T. (funds of 40%). We would like to
thank M. Dale, C. Candian, E. Pitacco and one anony-
mous reviewer for useful comments and suggestions.
References
Anand, M. 1994. Pattern, process and mechanism. The fundamen-
tals of scientific inquiry applied to vegetation science. Coenoses
9 (2): 81–92.
Anand, M. & Orlóci, L. 1996. Complexity in Plant Communities:
the Notion and Quantification. J. Theor. Biol. 179: 179–186.
Anderberg, M. R. 1973. Cluster analysis for applications. Academic
Press. New York.
Arrigoni, P. V. 1996a. Documenti per la carta della vegetazione delle
montagne calcaree della Sardegna centro-orientale. Parlatorea 1:
5–33.
Arrigoni, P. V. 1996b. A classification of plant growth forms ap-
plicable to the Floras and Vegetation types of Italy. Webbia 50
(2): 193–203.
Barkman, J. J. 1979. The investigation of vegetation texture and
structure. Pp. 123–160. In: M. J. A. Wreger (ed.), The Study of
Vegetation. Junk, The Hague.
Barkman, J. J. 1988. New system of plant growth forms and pheno-
logical plant types. In: Werger, M. J. A. et al. (eds), Plant form
and vegetation structure. SPB Acad. Publ. The Hague.
Burnaby, T. P. 1970. On a method for character weighting a similar-
ity coefficient, employing the concept of information. J. Int. Ass.
Math. Geol. 2: 25–38.
Dale, M. 1988. Mutational and Nonmutational similarity Measures:
A Preliminary Examination. Coenoses 3: 121–133.
De Patta Pilar, V. & Orlóci, L. 1991. Fuzzy Components in
Community Level Comparisons. Pp. 87–93. In: Feoli, E. & Or-
lóci, L. (eds), Computer Assisted Vegetation Analysis. Kluwer
Academic Publishers, Dordrecht.
De Patta Pilar, V. & Orlóci, L. 1993. Character-based community
analysis: the theory and an application program. SPB Academic
Publishing bv.
Ezcurra, E. 1987. A comparison of reciprocal averaging and non-
centred principal component analysis. Vegetatio 71: 41–47.
Feoli, E. 1977. On the resolving power of principal component
analysis in plant community ordination. Vegetatio 33 (2/3):
119–125.
Feoli, E. 1984. Some aspects of classification and ordination of
vegetation data in perspective. Studia Geobot. 4: 7–21.
Feoli, E. & Lagonegro, M. 1979. Intersection analysis in phy-
tosociology: computer program and application. Vegetatio 40:
55–59.
Feoli, E. & Lagonegro, M. 1983. A resemblance function based on
probability: Applications to field and simulated data. Vegetatio
53: 3–9.
Feoli, E., Lagonegro, M. & Biondani, F. 1981. Strategies in syntax-
onomy: A discussion of two classifications of grasslands of Friuli
(Italy). Pp. 95–107. In: Dierske, H. (ed.), Syntaxonomy. Cramer,
Vaduz.
Feoli, E., Orlóci, L., & Scimone, M. 1985. Measuring structural
convergence of vegetation types on the basis of floristic data.
Abstr. Bot. 9: 17–32.
Feoli, E. & Zuccarello, V. 1986. Ordination based on classification:
yet another solution? Abstr. Bot. 10: 203–219.
Feoli, E. & Zuccarello, V. 1994. Naiveté of fuzzy system spaces in
vegetation dynamics? Coenoses 9 (1): 25–32.
Gower, J. C. 1970. A note on Burnaby’s character-weighted similar-
ity coefficient. J. Int. Ass. Math. Geol. 2: 39–45.
Gower, J. C. 1971. A general coefficient of similarity and some of
its properties. Biometrics 27: 857–871.
Goodall, D. W. 1964. A probabilistic similarity index. Nature 203:
1098.
Goodall, D. W. 1966. A new similarity index based on probability.
Biometrics 22: 882–907.
Goodall, D. W. 1993. Probabilistic indices for classification - Some
extensions. Abstr. Bot. 17 (1–2): 125–132.
Marsili Libelli, S. 1989. Fuzzy clustering of ecological data.
Coenoses 2: 95–106.
Mueller Dombois, D. & Ellenberg, H. 1974. Aims and Methods of
Vegetation Ecology. Wiley, New York.
Orlóci, L. 1978. Multivariate analysis in vegetation research. Junk,
Den Haag.
Orlóci, L. & Orlóci, M. 1985. Comparison of communities without
the use of the species: model and example. Ann. Bot. 43: 275–
285.
87
Noy-Meir, I. 1973. Data transformation in ecological ordination. I.
some advantages of non-centering. J. Ecol. 61: 329–341.
Podani, J. 1995. Multivariate data analysis in ecology and system-
atics. A methodological guide to the SYN-Tax 5.0 package. SPB
Academic Publishing bv.
Roberts, D. W. 1986. Ordination on the basis of fuzzy set theory.
Vegetatio 66: 123–131.
Sneath, P. H. A. & Sokal, R.R. 1973. Numerical taxonomy. W.H.
Freeman and Company.
ter Braak, C. J. F. 1995. Ordination. Pp. 91–173. In: Jongman,
R.F.G., C.J.F. ter Braak & O.F.R. Van Tongeren (eds), Data
Analysis in Community and Landscape Ecology. Cambridge
University Press, Cambridge.
Zhao, S.X. 1986. Discussion on Fuzzy Clustering. Pp. 612–614. 8
th
int. Conf. On Pattern Recognition, IEEE Press, New York.
Zimmermann, H.J. 1984. Fuzzy Set Theory and its Applications.
Kluwer-Nijhoff Publishing, Boston.
... The sets are given by the list of species present in the OGUs. Several methods of hierarchical clustering have been applied with SYNTAX package (Podani 2001), as optimal classification we have chosen the one with the dendrogram showing the most clear separation between the clusters in the space defined by the eigenvectors of the Jaccard similarity matrix (Carranza et al. 1998). In particular, we have applied complete linkage, single linkage, average linkage between the groups and average linkage within the groups ; the optimal classification according to the method proposed by Carranza et al. (1998) was obtained with average linkage within the groups. ...
... Several methods of hierarchical clustering have been applied with SYNTAX package (Podani 2001), as optimal classification we have chosen the one with the dendrogram showing the most clear separation between the clusters in the space defined by the eigenvectors of the Jaccard similarity matrix (Carranza et al. 1998). In particular, we have applied complete linkage, single linkage, average linkage between the groups and average linkage within the groups ; the optimal classification according to the method proposed by Carranza et al. (1998) was obtained with average linkage within the groups. Once a classification is selected, the clusters can be characterized in several ways. ...
... Average linkage clustering within the groups has produced the optimal classification according to the method suggested by Carranza et al. (1998) by showing 9 well separated clusters of OGUs. The description of the clusters is given in Table 1 according to the concentration values of the species groups (Wittig et al. 1985). ...
Article
Full-text available
We propose a method that has general relevance to the digital representation of spatial variation of multivariate landscape data. It is based on the average similarity that operational geographic units (OGU) have with the adjacent ones according to characters relevant understanding landscape patterns and dynamics. The method is flexible and easily executable within the technological framework of geographic information systems (GIS) that today is available even free of charge or at very low cost. An example shows how the method, applied to spatial data of a floristic project for the urban area of Trieste (NE-Italy), can identify floristically homogeneous patches and can quantify the heterogeneity of the transition zones between such patches.
... Assuming soil as a complex natural body organized in components of increasing complexity that are recorded in terms of the hierarchical organization of soil categories (Hoosbeek et al., 1999), local relationships between soil and biological communities may be investigated by focusing on discrete, lower-level classes like soil series and phases (Soil Science Division Staff, 2017) that delineate mutually exclusive soil categories and map their spatial distribution as a set of non-overlapping polygons. However, the partition of the soil continuum into lower-level classes has to deal with the transition of classification criteria from clear-cut genetic relationships to rules assessing the influence of local factors and processes on soil characteristics and behaviour (Butler, 1980), and the shift from clear discontinuities in landforms to diffuse vertical and lateral subsurface variations in the feature space (Burrough et al., 1997). A further question is the lack of knowledge about local relationships between soil classes and biological communities, which can be addressed by delineating soil classes with no predefined diagnostic characteristics, thus avoiding to neglect meaningful soil-biological community relationships. ...
... Catena 180 (2019) 169-182 weight to rare events and the use of uncorrelated (Goodall, 1966) or correlated but automatically weighted attributes (Burnaby, 1970). Taking account of Gower's observations and the progress of fuzzy logic, Goodall andCarranza et al. (1998) set up software tools that successfully tested both indexes in quantitative ecology investigations. ...
... The overall pairwise similarity S jk is in the end calculated as the complement to 1 of the χ 2 value probability. Burnaby's similarity coefficient (Burnaby, 1970) was reconsidered and improved by Carranza et al. (1998). It is defined by the equation ...
Article
Land use assessment is among the practical purposes of soil classification. Several researches has been specifically focused on the use of conventional surveys in the evaluation of soil suitability for agriculture products whose quality is influenced by the interactions between soil, plants and the biological stock of the rhizosphere. Our aim was to expand the application of the soil suitability protocol by relating soil types with biodiversity and the ecological equilibria of biological communities. This goal can be pursued by combining the approaches to soil type delineation with the techniques of quantitative ecology that are based on the similarity theory. Given the qualitative scale of several field-recorded attributes make their numerical processing difficult, we focused on numerical techniques for multivariate sets of data to combine with geostatistics. We thought these techniques would originate local soil classes meaningful in terms of both soil processes and soil suitability evaluation. Since auger boring data are formally comparable to vegetation data, we tested Goodall's and Burnaby's pairwise similarity indexes: the former assumes that pairs of observations sharing an infrequent value are more similar than pairs which share more frequent values; the latter considers associations among attributes, giving higher weights to independent ones. We did an intensive soil survey in a 1200-ha flood plain of the Istria region, Croatia. The morphological characteristics of soil cores were recorded and analysed to produce pairwise similarities that were partitioned by hierarchical clustering into similarity vectors. Such vectors were in the end submitted to geostatistical analysis for the drawing up of similarity maps. Both similarity measures originated five partially overlapping clusters that were consistent with the main soil forming processes present in the investigated area. Goodall's index gave the most meaningful results, fulfilling three compulsory requirements for soil mapping: i) similarity vectors were meaningful in terms of fluvial dynamics; ii) similarities displayed a structured spatial variability; and, iii) similarity maps were consistent with the soil forming factors acting in the investigated area. The results obtained indicate that Goodall's similarity index could be currently used in soil suitability evaluation, allowing to better exploit field-recorded data and to extend its application to the relations existing between soil types and the ecological equilibria of biological communities.
... Theoretical Ecology 2020). In addition, among the several dissimilarity coefficients that have been developed to handle mixed data sets (Gower 1971;Carranza et al. 1998;Podani 1999) some of them (see, e.g., Pavoine et al. 2009) allow the inclusion of variable weights for the various data sources such that the different variables may contribute differently to the calculation of multivariate landscape complexity (see Rocchini et al. 2017 for an example). In this way, if some variables are more important than others in determining landscape complexity and functioning, then they should be given greater relevance for the calculation of quadratic diversity (Pavoine et al. 2009). ...
Article
Full-text available
Spatio-ecological heterogeneity has a significant impact on various ecosystem properties, such as biodiversity patterns, variability in ecosystem resources, and species distributions. Given this perspective, remote sensing has gained widespread recognition as a powerful tool for assessing the spatial heterogeneity of ecosystems by analyzing the variability among different pixel values in both space and, potentially, time. Several measures of spatial heterogeneity have been proposed, broadly categorized into abundance-related measures (e.g., Shannon’s H) and dispersion-related measures (e.g., Variance). A measure that integrates both abundance and distance information is the Rao’s quadratic entropy (Rao’s Q index), mainly used in ecology to measure plant diversity based on in-situ based functional traits. The question arises as to why one should use a complex measure that considers multiple dimensions and couples abundance and distance measurements instead of relying solely on simple dispersion-based measures of heterogeneity. This paper sheds light on the spatial version of the Rao’s Q index, based on moving windows for its calculation, with a particular emphasis on its mathematical and statistical properties. The main objective is to theoretically demonstrate the strength of Rao’s Q index in measuring heterogeneity, taking into account all its potential facets and applications, including (i) integrating multivariate data, (ii) applying differential weighting to pixels, and (iii) considering differential weighting of distances among pixel reflectance values in spectral space.
... There are several methods to select the most informative classification of a set of VUs (e.g. Feoli & Lausi 1980;Dale 1988;Pillar 1996Pillar , 1999Pillar and Orló ci 1996;Carranza et al. 1998;Feoli et al. 2006Feoli et al. , 2009); among the many methods the one proposed by Feoli et al. (2009) looks the most coherent with the concept of diversity. It calculates for each hierarchical level the evenness of the eigenvalues of the corresponding similarity Diversity patterns of vegetation systems 799 matrix given by within-between clusters' average similarity and tests the value with a permutation technique (see Manly 1997). ...
Article
Full-text available
In this article, I suggest a simple method, based on similarity theory that can be used to generate diversity patterns of vegetation systems at different hierarchical levels of their description. It is a very flexible method that measures the diversity according to hierarchical classifications of vegetation units (VUs) sampled from the vegetation system under study. The VUs may be either individual plants or vegetation relevès (preferably of the Braun Blanquet approach). It follows that the diversity measures of a vegetation system can be “individual plant based diversity measures” or “plant community based diversity measures”. The two kinds of measures of vegetation diversity are complementary and the choice to calculate both or a single one of them depends on the aim of the study.
... Besides to be the basis for numerical classifications and ordinations, similarity matrices can be very useful to correlate two or more sets of characters that describe the same set of objects, as was proposed by Mantel (1967), i.e. by calculating the correlation between similarity matrices, or as was proposed by Zerihun Woldu & al. (1989), by the correlation between fuzzy sets obtained by similarity matrices, or, as was proposed by Feoli & Ganis (1986), by the autocorrelation of single or composite variables in spaces defined by the similarity matrices. These ways of finding correlations between sets (representing classes of objects and or measures on single of groups of variables), overcomes all the problems related to different scales of measuring variables that we may find in traditional "parametric" methods treated in Mardia & al. (1979) and Jongman & al. (1995), because we can use different types of similarity functions including those suitable for mixed data such Goodall's (Goodall 1964, 1966, Burnaby's (1970) and Gower's (1970Gower's ( , 1971) ones (see Carranza & al. 1998). ...
Article
Full-text available
The role of similarity-dissimilarity matrices is discussed within a conceptual framework that shows the strict connections between similarity, classification and diversity in vegetation studies. Examples of application of the evenness of the eigenvalues of similarity matrices (E()) to define classes, to measure correlation between biological communities and environmental factors and to measure diversity of vegetation systems as a parameter , by the formula k, where k is the number of classes and E(), are given by considering two data sets regarding beech forests of the Italian peninsula.
... Climate regulation, Recreation and Tourism, and Existence value of biodiversity; Appendix G). In order to maximise signal (Paracchini et al., 2011), in some cases the range within which each indicator value was normalised was restricted based on its distribution on the quintiles of the observed values (Burnaby, 1970;Carranza et al., 1998). ...
Preprint
Coastal sand dunes are complex transitional systems hosting high levels of biodiversity and providing important benefits to society. In this paper we aimed to evaluate the multi-service nature of ecosystem services (ES) supply in the dunes of the Italian Adriatic coastwithin Natura 2000 (N2K) sites. We i) identified ES indicators and assessed the supply capacity (Climate regulation, Protection fromwind and aerosol, Erosion regulation, Recreation and Tourismand Existence value of biodiversity) of natural dune ecosystems of European conservation concern; ii) upscaled this data to create an inventory of ES supply for all dune N2K sites in the study area; iii) explored the trade-offs among ES; and iv) summarized and spatially compared the overall multi-service value of the N2K sites.The study provides a method for quantifying the role of N2K sites in supplying benefits for our society. We found that the multi-service capacity of coastal dunes is uneven within sites and within administrative regions. This variability is related to both ecological (e.g. distribution, ecological integrity, extent and conservation status of dune habitats) and administrative (e.g. local implementation of the Habitats Directive) characteristics of the analysed area. ES are not coupled as several sites with high values for one ES show very low values for others. The results suggest that conservation actions should favour restoration of the natural dune zonation, since this underpins multi-service ES supply. The approach can distinguish regions with high ES values and regions where the paucity of protected areas represents a gap in ES supply, fact that offers an incentive to enhance the protection strategy butalso suggests an urgent need to improve the N2K network by enlarging existent sites and including new ones.
... Climate regulation, Recreation and Tourism, and Existence value of biodiversity; Appendix G). In order to maximise signal (Paracchini et al., 2011), in some cases the range within which each indicator value was normalised was restricted based on its distribution on the quintiles of the observed values (Burnaby, 1970;Carranza et al., 1998). ...
Preprint
Coastal sand dunes are complex transitional systems hosting high levels of biodiversity and providing important benefits to society. In this paper we aimed to evaluate the multi-service nature of ecosystem services (ES) supply in the dunes of the Italian Adriatic coast within Natura 2000 (N2K) sites. We i) identified ES indicators and assessed the supply capacity (Climate regulation, Protection from wind and aerosol, Erosion regulation, Recreation and Tourism and Existence value of biodiversity) of natural dune ecosystems of European conservation concern; ii) upscaled this data to create an inventory of ES supply for all dune N2K sites in the study area; iii) explored the trade-offs among ES; and iv) summarized and spatially compared the overall multi-service value of the N2K sites.The study provides a method for quantifying the role of N2K sites in supplying benefits for our society. We found that the multi-service capacity of coastal dunes is uneven within sites and within administrative regions. This variability is related to both ecological (e.g. distribution, ecological integrity, extent and conservation status of dune habitats) and administrative (e.g. local implementation of the Habitats Directive) characteristics of the analysed area. ES are not coupled as several sites with high values for one ES show very low values for others. The results suggest that conservation actions should favour restoration of the natural dune zonation, since this underpins multi-service ES supply. The approach can distinguish regions with high ES values and regions where the paucity of protected areas represents a gap in ES supply, fact that offers an incentive to enhance the protection strategy but also suggests an urgent need to improve the N2K network by enlarging existent sites and including new ones.
... Climate regulation, Recreation and Tourism, and Existence value of biodiversity; Appendix G). In order to maximise signal (Paracchini et al., 2011), in some cases the range within which each indicator value was normalized was restricted based on its distribution on the quintiles of the observed values (Burnaby, 1970;Carranza et al., 1998). ...
Article
Coastal sand dunes are complex transitional systems hosting high levels of biodiversity and providing important benefits to society. In this paper we aimed to evaluate the multi-service nature of ecosystem services (ES) supply in the dunes of the Italian Adriatic coast within Natura 2000 (N2K) sites. We i) identified ES indicators and assessed the supply capacity (Climate regulation, Protection from wind and aerosol, Erosion regulation, Recreation and Tourism and Existence value of biodiversity) of natural dune ecosystems of European conservation concern; ii) upscaled this data to create an inventory of ES supply for all dune N2K sites in the study area; iii) explored the trade-offs among ES; and iv) summarized and spatially compared the overall multi-service value of the N2K sites. The study provides a method for quantifying the role of N2K sites in supplying benefits for our society. We found that the multi-service capacity of coastal dunes is uneven within sites and within administrative regions. This variability is related to both ecological (e.g. distribution, ecological integrity, extent and conservation status of dune habitats) and administrative (e.g. local implementation of the Habitats Directive) characteristics of the analysed area. ES are not coupled as several sites with high values for one ES show very low values for others. The results suggest that conservation actions should favour restoration of the natural dune zonation, since this underpins multi-service ES supply. The approach can distinguish regions with high ES values and regions where the paucity of protected areas represents a gap in ES supply, fact that offers an incentive to enhance the protection strategy but also suggests an urgent need to improve the N2K network by enlarging existent sites and including new ones.
... There are several methods to select the most informative classification of a set of VUs (e.g. Feoli & Lausi 1980;Dale 1988;Pillar 1996Pillar , 1999Pillar and Orló ci 1996;Carranza et al. 1998;Feoli et al. 2006Feoli et al. , 2009); among the many methods the one proposed by Feoli et al. (2009) looks the most coherent with the concept of diversity. It calculates for each hierarchical level the evenness of the eigenvalues of the corresponding similarity Diversity patterns of vegetation systems 799 matrix given by within-between clusters' average similarity and tests the value with a permutation technique (see Manly 1997). ...
Article
Full-text available
In this article, I suggest a simple method, based on similarity theory that can be used to generate diversity patterns of vegetation systems at different hierarchical levels of their description. It is a very flexible method that measures the diversity according to hierarchical classifications of vegetation units (VUs) sampled from the vegetation system under study. The VUs may be either individual plants or vegetation releve`s (preferably of the Braun Blanquet approach). It follows that the diversity measures of a vegetation system can be ‘‘individual plant based diversity measures’’ or ‘‘plant community based diversity measures’’. The two kinds of measures of vegetation diversity are complementary and the choice to calculate both or a single one of them depends on the aim of the study. The method consists in a procedure that computes the similarity between the VUs on the basis of a set of characters that can be defined from single different disciplines (taxonomy, evolution, chemistry, chorology, etc.) or combinations of them. The VUs are hierarchically classified by any logical hierarchical classification method (a dendrogram can be used to suggest the hierarchical classification when there are no other logical alternatives) and the diversity is calculated for each hierarchical level by using the frequency of the VUs in the classes. T
... For example, one can evaluate the compactness and between-cluster separation in the multivariate space (e.g. Carranza et al. 1998;Aho et al. 2008;Roberts 2015) Similar internal heterogeneity Evaluation of the similarity of vegetation types in their internal heterogeneity (e.g. compositional variability) Classification stability ...
Article
Full-text available
Aims: Classification of vegetation is an essential tool to describe, understand, predict and manage biodiversity. Given the multiplicity of approaches to classify vegetation, it is important to develop international consensus around a set of general guidelines and purpose-specific standard protocols. Before these goals can be achieved, however, it is necessary to identify and understand the different choices that are made during the process of classifying vegetation. This paper presents a framework to facilitate comparisons between broad-scale plot-based classification approaches. Results: Our framework is based on the distinction of four structural elements (plot record, vegetation type, consistent classification section and classification system) and two procedural elements (classification protocol and classification approach). For each element we describe essential properties that can be used for comparisons. We also review alternative choices regarding critical decisions of classification approaches; with a special focus on the procedures used to define vegetation types from plot records. We illustrate our comparative framework by applying it to different broad-scale classification approaches. Conclusions: Our framework will be useful for understanding and comparing plot-based vegetation classification approaches, as well as for integrating classification systems and their sections.
Article
Full-text available
A method measuring convergence of vegetation structure based on floristic data is presented. A convergence model in different spaces of different hierarhical meaning is described. The use of mutual information of contingency tables, describing the spaces, seems to be a suitable function on which to base measures of convergence .
Article
Full-text available
Ordination and classification have always been important stages in ecological data analysis. This paper presents a clustering technique based on fuzzy sets to obtain both ordination and classification particularly well suited for ecological analyses. Three algorithms are presented to categorize data, classify new ones, and produce fuzzy dendrograms. Some examples demonstrate the algorithms’ capabilities to yield graded classification of data and show that this approach has more flexibility compared to classical methods.
Conference Paper
Full-text available
This is the original version of quantitative character-based community analysis implemented on a hierarchical model. The modus operandi is a must in statistical community level convergent evolutionary studies based on taxa other than species, such as functional types, life-form types, growth form-types, or any other character set types.
Chapter
Ecological data has several special properties: the presence or absence of species on a semi-quantitative abundance scale; non-linear relationships between species and environmental factors; and high inter-correlations among species and among environmental variables. The analysis of such data is important to the interpretation of relationships within plant and animal communities and with their environments. In this corrected version of Data Analysis in Community and Landscape Ecology, without using complex mathematics, the contributors demonstrate the methods that have proven most useful, with examples, exercises and case-studies. Chapters explain in an elementary way powerful data analysis techniques such as logic regression, canonical correspondence analysis, and kriging.
Article
The distinction between pattern, process and mechanism is rarely explicit in accounts of vegetation study, albeit always implied. The need for a clear distinction of these three concepts is stressed as a pre-requisite for clarity in studying vegetation. Vegetation science is examined in terms of its components, namely, pattern recognition, process definition and mechanism isolation, and the sources of difficulty are identified. In doing so, the evolution of thought and methods are summarized, reflecting upon the works of some major contributors. The importance of scale is emphasized and some approaches to its study reviewed. It is concluded that the problems identified may be ill-posed, rendering them more difficult to solve than previously thought. -from Author
Chapter
An examination of many of the indices proposed as numerical measures of pairwise similarity shows that they have strong relationships to string-to-string measures variously known as ‘Levenshtein distance’, ‘longest common subsequence’ or ‘minimal mutation distance’. The variations among coefficients are created in several ways, including changing the set of operations, using a richer structural pattern, modifying weights, limiting the extent of operations and varying the basis for normalisation. In total these measures provide a very flexible means of assessing similarity and can be extended to similarities based on collections of strings. While not denying the interest to the user of other properties, such as metricity or embedding in a euclidean space, examining the coefficients as variations on the Levenshtein theme provides a common basis for their comparison and provides the user with a means of choosing between coefficients in a rational manner. But however interesting this array of coefficients might be, it remains true that only some features of similarity will be captured in a minimal mutational measure. These features may be more or less than are actually required by the user. In this paper I have made a preliminary examination of various measures, some of which are related to the Levenshtein metric, and some of which appear to capture other aspects of similarity (i.e. topological, functional, analogic and/or conceptual). These latter are all measures which I have been unable to relate to the Levenshtein distance, although I have not pursued this very far as yet. All measures were applied to vegetation data, classifying both plots and attributes into a two-way table. The SAHN algorithm has been used for most of the clusterings, so that differences between measures of similarity are the primary cause of differences in results. In a few cases other clustering algorithms have been used and the data has been converted to presence/absence when this was necessary with the particular coefficient.
Conference Paper
The application of fuzzy set theory for describing vegetation dynamics is discussed on the basis of results obtained with forest data of NE Italy. -Authors
Article
A new simple method of ordination is proposed. It is based on classification of relevés, or variables, and matrix multiplication. An example of application is given with data from a simulated coenocline with different noise levels. The performance of the method may be considered as good as that of the most popular methods, or even better if the monotonous pattern is considered as an advantage
Article
A comparison of centered and non-centered principal component ordinations of various data sets shows that not always does non-centering merely add a trivial (`general') unipolar first component. This happens only in an internally homogeneous or continuous data set. If the set contains one or more disjunctions (complete discontinuities), each of the resulting submatrices has its own unipolar component and an independent set of bipolar components. Various intermediate cases may occur. This property of non-centered ordinations has two advantages in applications to the description of vegetation: (a) it allows an assessment of the number and sharpness of discontinuities in the sample; (b) in discontinuous data it minimizes the interference between the variation on the two sides of the break. Each subset of stands and species contributing to the same unipolar component is interpreted as a `vegetation series', which is disjunct in its composition from other series. Within each series so defined variation is continuous, and expressed by the bipolar components differentiating `compositional phases' within it.
Article
Since the very beginning in plant geography, man felt the need of creating vegetal groups sharing common growth or development patterns, according to their morphology, that is the physiognomic expression of ecological adaptations to the environmental conditions. Sucessively several classifications have been worked out aiming at building a rational synthesis of the multiplicity of plants forms. These classifications offer many useful information but they do not embrace to advantage the variegate typologies of floras and vegetations which may be found in different geographical districts.