ArticlePDF Available

A new diversity index

IOP Publishing
Physical Biology
Authors:

Abstract and Figures

We introduce here a new index of diversity based on consideration of reasonable propositions that such an index should have in order to represent diversity. The behaviour of the index is compared with that of the Gini–Simpson diversity index, and is found to predict more realistic values of diversity for small communities, in particular when each species is equally represented and for small communities. The index correctly provides a measure of true diversity that is equal to the species richness across all values of species and organism numbers when all species are equally represented, as well as Hill’s more stringent ‘doubling’ criterion when they are not. In addition, a new graphical interpretation is introduced that permits a straightforward visual comparison of pairs of indices across a wide range within a parameter space based on species and organism numbers.
This content is subject to copyright. Terms and conditions apply.
Phys. Biol. 18 (2021) 066004 https://doi.org/10.1088/1478-3975/ac264e
OPEN ACCESS
RECEIVED
8 March 2021
REVISED
5 July 2021
ACCEPTED FOR PUBLICATION
13 September 2021
PUBLISHED
11 October 2021
Original content from
this work may be used
under the terms of the
Creative Commons
Attribution 4.0 licence.
Any further distribution
of this work must
maintain attribution to
the author(s) and the
title of the work, journal
citation and DOI.
PAPER
Anewdiversityindex
ATAugousti
1,,NAtkins
1,ABen-Naim
2,SBignall
3,GHunter
1, M Tunnicliffe1
and A Radosz4
1Faculty of Science, Engineering and Computing, Kingston University London, United Kingdom
2Department of Physical Chemistry,The Hebrew University, Jerusalem 91904, Israel
3The Portland Hospital, London, United Kingdom
4Wroclaw University of Science and Technology, Wroclaw, Poland
Author to whom any correspondence should be addressed.
E-mail: augousti@kingston.ac.uk
Keywords: diversity index, Gini– Simpson index, biodiversity, species richness, entropy
Abstract
We introduce here a new index of diversity based on consideration of reasonable propositions that
such an index should have in order to represent diversity. The behaviour of the index is compared
with that of the Gini–Simpson diversity index, and is found to predict more realistic values of
diversity for small communities, in particular when each species is equally represented and for small
communities. The index correctly provides a measure of true diversity that is equal to the species
richness across all values of species and organism numbers when all species are equally represented,
as well as Hill’s more stringent ‘doubling’ criterion when they are not. In addition, a new graphical
interpretation is introduced that permits a straightforward visual comparison of pairs of indices
across a wide range within a parameter space based on species and organism numbers.
1. Introduction
The issue of measuring diversity in an ecosystem has
been a topic of vigorous discussion for many years.
Although many measures for measurement of this
diversity have been introduced over the years, it is fair
to say that most of these have been imported from
other fields. For example, a commonly used index,
the Gini– Simpson index, was originally introduced
by Gini in 1912 [1] as a statistical measure to indi-
cate the variability in distributions of both contin-
uous and discrete variables, the latter including, for
example, four categories of hair colour. Simpson [2]
later presented essentially the same index to repre-
sent ‘...a measure of concentration in terms of pop-
ulation constants’, in order to overcome difficulties
introduced by measuresdefined by Yule [3]andFisher
et al [4] which depended on sample data rather than
population constants.
The diversity has also been related to Shannon’s
measure of information (SMI) based on the idea that
the greater the diversity in the number of species and
the evenness of the distribution of organisms between
them, then the greater is the uncertainty in selection,
the latter being measured by the SMI (see, for instance
Macarthur [5]). The use of the SMI for such an appli-
cation is not without controversy [6] (as indeed is the
application of the SMI in a wide variety of applica-
tions with varying degrees of suitability (for a discus-
sion of this in the context of entropy see, for example,
Ben-Naim [7]). The concept of diversity has also been
extended to relate to diversity that is measured at mul-
tiple locations and then combined [8,9], and appli-
cations of the concept of diversity range across many
areas of biology, information theory, physics, eco-
nomics and even psychology [10], where the term
emodiversity has been used to describe the variety and
relative abundance of individuals’ emotional experi-
ences. At a microscopic level, the idea of diversity has
also been used to determine the impact of a class of
antibiotics known as macrolides used for therapeutic
purposes on the gut microbiota of children [11], for
instance.
In a very clear analysis Jost [12]distinguishes
between the use of uncertainty measures, which he
terms entropies, and diversities, which are intended
to provide a measure of relative abundance, and he
defines the term ‘true diversity’—an ‘effective num-
ber of species’ which would give the same value as the
diversity index determined for a specific distribution
if all of the species present were equally represented
by organisms. His use of the term ‘true diversity’ is
somewhat in opposition to the views of Hoffman and
Hoffman [13]. Despite the plethora of indices that
© 2021 The Author(s). Published by IOP Publishing Ltd
Phys. Biol. 18 (2021) 066004 ATAugoustiet al
have been developed [14], new indices continue to be
introduced [15], since most of the existing indices suf-
fer from weaknesses that cause them to be misleading
under particular situations, typically when the num-
ber of species or organisms tends to very large or small
limits. Xu et al [16] provide a helpful review that iden-
tifies the connections and differences in a wide range
of indices and entropies.
The aim of this work was to address the develop-
ment of an index by consideration of first principles,
beginning with a series of ‘desirable’ features that such
an index would possess. Some of these desirable fea-
tures seek to address weaknesses perceived in existing
measures, as well as those based on uncertainty.
Section 2details the features expected in such
an index, and offers a possible functional form that
exhibits these features. Section 3compares the results
calculated using this index with that of the widely-
used Gini–Simpson index, with which it shares some
characteristics and introduces a novel graphical rep-
resentation that permits a visual comparison of any
index in a convenient manner. Section 4presents and
discusses these results, and section 5concludes with
a summary and possible future developments. Refer-
ences are presented in reference section, appendix A
provides details of an improved algorithm for iden-
tification of acceptable distributions given particu-
lar values for the number of organisms and species.
Appendix Bprovides a derivation of the form of the
‘true’ diversity (see section 4for details) for this new
index and appendix Cprovides a proof supporting the
discussion in section 4, namely compliance with Hill’s
doubling criterion (see section 4for details).
2. Considerations for construction of a
new index
The purpose of the index is to provide a measure of
diversity— based on the number of species and organ-
isms that inhabit an environment—that gives rela-
tively intuitive results, especially with regard to partic-
ular limits, such as very large or very small numbers
ofspeciesororganisms,aswellasforparticularlyeven
distributions. The following terms will be used
mnumber of species
ninumber of organisms of species i(i=1,
2, .., m)
Ntotal number of organisms
Γdiversity index
Γifunctional form of Γwith respect to species i
n0the ‘ideal’ average number of organisms
per species if all organisms are equally distributed
(=N/m)
kϕscaling factor for azimuthal angle in graphical
representation
kθscaling factor for elevation angle in graphical
representation
pNm
It is clear that
m
i=1
ni=N.(1)
Let the diversity function have the following prop-
erties.
(a) It should depend on the distribution of organ-
isms, the total number of organisms and the total
number of species
Γ=Γ(ni,m,N).
(b) It should have a finite range—this can permit
comparison with other indices
0Γ1.
(c) The index should be maximised when the organ-
isms are evenly distributed among species, so for
ni=N/m=n0
Γ=Γ
max
n0and Γ
ni
=0.
(d) When organisms are evenly distributed among
species, the index should be maximal as the num-
ber of organisms increases
lim
N→∞ Γ=1.
Given these initial constraints, what are the
possible functional forms for Γ?Aformthat
fulfills condition (c) is
Γi=f(ni)e(nin0),(2)
where fdenotes an arbitrary polynomial func-
tion of the argument (ni). The simplest choice
here is to a scaled form in which variable niis
replaced by ni/n0and a linear form is chosen for
f,hence
Γi=ni
n0
e1ni
n0.(3)
Such a functional form for Γihas a peak value
of 1 at ni=n0and diminishes monotonically
to zero as nitends to zero or infinity. This sug-
gests a form for Γitself, which is simply a nor-
malised sum of these contributions arising from
each species Γi
Γ= 1
m
m
i=1
ni
n0
e1ni
n0=
m
i=1
ni
NeNmni
N.(4)
Such an index is determined primarily by the
distribution of organisms between species rather
than the total number of species themselves. This
dependence on the number of species can be
incorporated by the inclusion of two further con-
straints, thus
2
Phys. Biol. 18 (2021) 066004 ATAugoustiet al
(e) If a single species is present only, then the
index should indicate there is no diversity, hence
lim
m1Γ=0.
(f) And, in the case of equally distributed organisms
(see equation (B.3))
lim
m→∞ Γ=1.
Both of these constraints can be accommo-
dated by including an additional factor of m1
m
giving the final form of the index as
Γ= m1
m
m
i=1
ni
NeNmni
N.(5)
3. Representation of the index and
comparison with the GiniSimpson
index
One of the challenges in this area is to find an appro-
priate representation which displays compactly values
of Γfor a wide variety of values of N,m,andni.Such
a representation would also permit an easier compar-
ison with other diversity indices, once they have been
normalised to unity if necessary.
Even for relatively low values of both Nand
m, the possible range and enumeration of distribu-
tions becomes a challenging combinatorial problem
to represent adequately. For example, even for eight
species with 15 organisms to be distributed between
them, this represents 3432 distinct combinations.
This paper introduces, to the best of our knowledge,
a novel graphical transformation which permits the
compression and visualisation of the vast range of
combinations within the positive octant of a three-
dimensional spherical coordinate space.
The starting point is a unit line, along which will
be represented values of the index for a systematic list-
ing of distributions defined by niand m. The system-
atic listing is described further below. The line will be
oriented in the positive octant using a spherical coor-
dinate system, such that the azimuthal angle ϕand the
elevation θare defined by
ϕ=kϕtan1(m1) (6)
θ=kθtan1(N1).(7)
The values of kϕand kθare chosen to provide a
broader fill of the available space, otherwise if these
are omitted then incrementing mor Nfrom starting
values of 1 will result in the next lines being oriented
at 45to the x-axis and the xyplane respectively.
The specific value of the index for each distribution
is colour coded according to a defined colour scale,
with specific colours representing values between 0
and 1, occasionally referred to as a ‘heat map’. So
overall each possible configuration is a set of discrete
coloured points along a notional line whose orien-
tation depends on the total number of species and
organisms. In this way, an unbounded range of val-
ues for these quantities can be mapped and visualised
relatively easily.
The representation of each distribution along each
line is achieved systematically using the following
scheme. Firstly, the unit line is divided by the total
number of distributions which are possible given spe-
cific numbers of species and organisms. One way
to perform this systematic listing is by using mod-
ulo arithmetic and performing a checksum on the
sum of digits. This is most easily illustrated by way
of an example using a low number of species and
organisms.
Consider an environment with three species and a
total of five organisms. Since three species are defined,
there must exist at least one organism representing
each of these species. This leaves two further organ-
isms to be distributed among the three species. The
difference between the number of organisms and
species can be represented by p,hencep=2 here. If
one represents each distribution as an m-digit num-
ber to modulo p+1, then the possible distributions
of remaining organisms are enumerated by counting
as
000
001
002
010
011
012
020
021
022
100
101
102
110
111
120
120
122
200
Where the possible distributions whose checksum is
equal to p=2 are shown in bold. Hence there are
six ways in total to distribute five organisms between
three species. In this instance the unit line would be
subdivided into steps of 1/6, and the value of the diver-
sity index plotted at each of these locations along the
line for the distributions shown below
113
122
131
212
221
311
This describes the initial strategy that was adopted
to enumerate and display the possible distributions.
3
Phys. Biol. 18 (2021) 066004 ATAugoustiet al
However, such an approach based on incrementing
a count uniformly and then checking for the cor-
rect digit sum is highly inefficient as the values of m
and pincrease. For example, for ten species and 20
organisms, the count would need to be in excess of
2.3 ×1010 in order to identify the 92 378 accept-
able distributions. Appendix Aprovides details of
an improved counting strategy which, by using non-
uniform increments, reduces the count to the exact
number of acceptable distributions; in the example
cited above this represents an improvement of over
255 000-fold.
4. Results and discussion
The algorithms described above were coded in Mat-
lab, and figure 1displays an example of a comparison
between the present diversity index (PDI), and a well-
known conventional index used in environmental sci-
ence—the Gini–Simpson index (GSDI)—which is
given by
Gini Simpson index =1
m
i=1
ni(ni1)
N(N1) .(8)
Theimagesshowarangeofvaluesofmbetween
3 and 8, and organism numbers ranging between m
+1 and 15. The coloured dots representing values in
lines for larger numbers of possible distributions are
reduced in size in order to avoid overlap and for ease
of viewing. Such an image is best navigated using a
3D interactive format, as is available in Matlab, for
example
Even in a 2D representation as an image a clear
comparison between values calculated according to
the two indices can be made, as well as the variation
of the value within a single index as the organisms
are distributed among them. For ease of viewing, the
size of the coloured dot representing each value has
been scaled, so that unit lines containing more distri-
butions are represented by smaller dots. The values of
kϕand kθselected for use in figure 1are 1/6and1/8
respectively.
The figure primarily serves to illustrate the fact
that the GSDI generally provides larger values for the
diversity index than the PDI, particularly so at lower
values for the number of species and organisms. An
advantage of the PDI is that it gives lower values than
the GSDI in distributions close to or equal to uniform
representation in small communities.
It follows from the definition above that the limit-
ing value for the new DI in the case of equally repre-
sented species is simply m1
m, contrasting with a value
of N(m1)
m(N1) for the GSDI. In the latter case, when all
species have a single organism representing them the
value of the GSDI is 1, which clearly does not present
a realistic picture of diversity, being both indepen-
dent of the number of species present and suggesting
‘total’ diversity. The value of the new DI in this case
depends on the number of species (as required by the
conditions provided for its construction) but is inde-
pendent of the number of organisms present; it may
be argued that this gives a more realistic value that
represents the distribution of the organisms between
the species irrespective of how many (equal) repre-
sentatives there are for each species. For example, it
might reasonably be considered that three species rep-
resented by ten organisms each in fact represents no
greater diversity than three species represented by a
single member each, and this is the sense of diversity
captured in the new DI. In this sense, it is closer to a
measure of ‘species richness’.
Jost [12] introduces the term ‘true
diversity’—referred to with different names by
different authors, for instance the ‘effective number
of species’ by Macarthur [5]andthe‘numbers
equivalent’ by Patil and Tailee [17]ineconomics.
Thus for a given value of the index calculated for
a particular distribution, this corresponds to the
number of species present if all organisms are equally
represented. In this case, the ‘true diversity’ can easily
be shown to be (see appendix B)
1
1Γ(9)
and this corresponds, in the limit of large Nand ni,
to the same value for the Gini–Simpson index. It is
interesting to note this expression, while true for all
values of species number niand number of organisms
Nfor the new diversity index, is only true in the ‘large
community’ limit for the GS index, and this might
be considered a point in its favour. Indeed, most of
the indices introduced to date may be represented by
expressions of the form
m
i=1ni
Nq
, (10)
where qis an exponent that varies from one index to
another. In consequence, different indices that reduce
to an expression of this kind with the same value of
qare equivalent, and measures of the true diversity
would be the same for such indices, and given by [12]
qD=m
i=1ni
Nq1
1q
.(11)
These are a generalization of Hill’s numbers [18],
and the exponent qmay be termed the order of the
diversity. Thus different functional forms for a diver-
sity index may be connected in this way, with the true
diversity being dependent only on the order of the
diversity index.
The DI proposed here is not represented in this
way, and therefore is not immediately equivalent to
indices thus far introduced. It therefore also does not
fall under the category of a generalised weighted GS
index as proposed by Guiasu and Guiasu [15,19], and
4
Phys. Biol. 18 (2021) 066004 ATAugoustiet al
Figure 1. Comparison of the new diversity index (left) versus the Gini–Simpson index (right). The colour indicates that, for low
values of the number of organisms and species (larger circles, m=6, p=1– 3) the PDI gives values in the 0.53– 0.63, while the
corresponding values for the GSDI are in the range 0.6– 0.83.
hence their analysis of such generalised functions may
not apply here.
Jost [12] introduces a desirable ‘scaling’ test for
an index, proposing that the true diversity calculated
for sixteen equally represented species is represented
by an index which predicts a value of true diversity
twice as large as that for 8 equally represented species.
In other words, the true diversity should be propor-
tional to the number of species present. In common
with indices defined by expression (10)above,the
new DI passes this test. A stiffer requirement is Hill’s
‘doubling property’ [18], which states that for any
particular distribution, halving the number of rep-
resentatives of each species while doubling the num-
ber of species (the example that Jost [12]providesis
to split each species into two equal groups of males
and females and then treat each of these as a separate
species) should double the measure of true diversity.
A proof provided in appendix Bshows that the new DI
also passes this more stringent test, although it could
be argued that strict doubling in this way is not so
meaningful, providing nothing more than a measure
akin to species richness and failing to properly take
into account the relative representation of common
and rare species.
5. Conclusion
A new diversity index has been introduced which is
based on an intuitive set of properties. Values for
the new index have been calculated and compared
with the popular Gini–Simpson index, and it has
been shown that the new index compares favourably
with it in providing more realistic values, in partic-
ular in predicting the ‘true diversity’ at low counts
of organisms and species. In this instance, these low
counts—defining a ‘small community’—are when
the number of species mis 10 or less and the number
of organisms Nis 20 or less. At high counts of both
of these parameters (the ‘large community’ limit) the
two indices tend to coincide for equally represented
species.
The index passes the desirable test of predict true
diversity that is equivalent to species richness in the
case of equally represented species, and passes also a
more stringent test when the species are not equally
represented. As noted above, it could be argued that
this behaviour is similar to a behaviour equivalent
to species richness, which may not be so meaningful
when rare species are only minimally represented.
We introduce here also a novel way of represent-
ing the space of DI values in a convenient form which
permits closer comparison between different formu-
lations of DI in a visual format.
Future work will seek to establish a more effi-
cient counting algorithm for representing the possi-
ble distributions, a comparison with a broader range
of diversity indices, and application to a broader
range of practical problems in order to characterise
the behaviour of the index more fully, including also
potential use in language applications [20]. We will
also seek to extend the work to explore the behaviour
of indices where the linear term in equation (3)is
replaced by a higher order polynomial, as well as to
5
Phys. Biol. 18 (2021) 066004 ATAugoustiet al
extend the application to include measures of phy-
logenetic distance. A further aim of this work is to
explore the behaviour of indices of this kind when
sampling distributions whose frequencies appear to
be predicted on a combinational basis, and which
appear to follow a universal distribution [21].
Data availability statement
All data that support the findings of this study are
included within the article (and any supplementary
files).
Appendix A. Improved algorithm to
identify acceptable distributions for
specific values of pand m
It is probably easiest to describe the algorithm in the
first instance by way of an example. Consider the fol-
lowing example illustrating the distribution of eight
organisms among four species. As described above,
we need only consider here the remaining organisms
noting that four organisms must already respectively
represent each species, hence p=4.
The enumerated distributions follow the pattern
below. Note that these values correspond to digit val-
ues in a modulo-5 counting base. In general, the val-
ues are incremented by a decimal value of 4, and the
decimal increase between the previous row and the
current one is shown only for those values where the
increment differs from 4.
0004
0013
0022
0031
0040
0103 8
0112
0121
0130
0202 12
0211
0220
0301 16
0310
0400 20
1003 28
1012
1021
1030
1102 12
1111
1120
1201 16
1210
1300 20
2002 52
2011
2020
2101 16
2110
2200 20
3001 76
3010
3100 20
4000 120
Recall that these numbers are written in modulo-
(p+1)—in this case modulo-5—and therefore the
digits, counting from the least significant to the
most significant, represent respectively units, p+1,
(p+1)2,(p+1)3etc. Note that each non-uniform
jump of 4 occurs when the units digits is zero, as well
as when this is zero along with higher place value
digits also being zero. Let jrepresent the digit num-
ber, beginning with j=1 as the units digit, and label
the highest non-zero digit as j=qwhen the non-
uniform increment takes place. It may be observed
that the increment in these cases corresponds to
(p+1)qZq(p+1)q1+Zq1whereZqrepre-
sents the value in the qth digit prior to increment-
ing. This forms the basis of the algorithm coded in
Matlab, whereby the sequence of zero-occupied digits
is checked prior to incrementing, and it enumerates
precisely the exact number of acceptable distributions
without any redundant counts.
Using the example above, it should be clear that,
in general, if one unit increments are used (without
the zero-checking procedure noted above), the count
will reach a maximum of p(p+1)m1.Usingastars-
and-bars method (used as early as 1915 by Ehrenfest
and Onnes [22]), it is easy to show that the number
of distributions that contain the correct digit sum (p)
is given by (m+p1)!
p!(m1)!.Inthecasegiveninthetextof
m=10 and p=10 (thus N=20) then the total count
corresponds to 2.357 947 691 ×1010 and the number
of acceptable distributions is 92 378.
Appendix B. Derivation of the form of
the true diversity for the PDI
We follow here an algorithm based on the generalised
one provided by Jost [12]inordertocalculatethe
‘true diversity’ for any diversity index based on a sum
of powers of ni, the number of organisms of species i,
simplified in this instance due to the specified form
of the PDI. By formulating an expression for the
diversity index Γof mequally represented species
(hence ni=n0=N/m)and then solving for m,the
required formula for the true diversity (termed Dby
Jost and equivalent to min this case) is obtained.
Thus
Γ= m1
m
m
i=1
ni
NeNmni
N(B.1)
and using the equal representation condition
ni=N/m=n0
6
Phys. Biol. 18 (2021) 066004 ATAugoustiet al
Γ=m1
m
m
i=1
1
me(1m
m).(B.2)
The exponent is zero, and the summation total is
1. Hence
Γ= m1
m(B.3)
which is easily rearranged to give the required formula
for D, the true diversity.
m=1
1Γ=D.(B.4)
Appendix C. Proof of compliance with
Hill’s doubling property
Beginning from the definition of the DI
Γ= m1
m
m
i=1
ni
NeNmni
N(C.1)
one may label the sum in the formula as S,hence
Γ= (m1)S
m.(C.2)
The true diversity as defined by Jost is given by
D=1
1Γ(C.3)
which in the case of equally distributed species (i.e.
when Γ= m1
m,andS=1) easily reduces to a value of
m.
If ‘Hill doubling’ is conducted in the manner
described by Jost (splitting each species into two equal
groups of males and females and then treating each
of these as a separate species) then this may be repre-
sented in the following way
m2m
nini
2
ni+m=ni
and Nis unchanged.
Then
Γdouble =2m1
2m
2m
i=1
ni
2NeN2mni
/2
N
Γdouble =2m1
2m
2m
i=1
ni
2NeNmni
N
Γdouble =2m1
2mm
i=1
ni
2NeNmni
N
+
2m
i=m+1
ni
2NeNmni
N(C.4)
and using the property that
ni+m=ni
this may be rewritten
Γdouble =2m1
2mm
i=1
ni
2NeNmni
N
+
m
i=1
ni
2NeNmni
N(C.5)
Γdouble =2m1
2mS
2+S
2
hence
Γdouble =(2m1)S
2m.(C.6)
Since
Ddouble =1
1Γdouble
(C.7)
then simple substitution gives
Ddouble =2m
2m(2m1)S.(C.8)
Since Dis defined to be the diversity equivalent
to a distribution of equally represented species, then
S=1andDbecomes simply 2mi.e. double the orig-
inal value and hence the Hill doubling criterion is
fulfilled.
ORCID iDs
ATAugousti https://orcid.org/0000-0003-3000-
9332
References
[1] Gini C 1912 Var iabilita e mutabilita Memorie di
Metodologica Statistica ed E Pizetti and T Salvemini (Roma,
Italy: Liberia Eredi Virgilio Veschi)
[2] Simpson E H 1949 Measurement of diversity Nature 163 688
[3] Yule G U 1944 Statistical Study of Literary Vocabulary
(Cambridge: Cambridge University Press)
[4] Fisher R A, Corbet A S and Williams C B 1943 The relation
between the number of species and the number of
individuals in a random sample of an animal population J.
Anim. Ecol. 12 42– 58
[5] MacArthur R H 1965 Patterns of species diversity Biol. Rev.
40 510–33
[6] Pielou E C 1966 Shannon’s formula as a measure of specific
diversity: its use and misuse Am. Nat. 100 463 –5
[7] Ben-Naim A 2017 Information Theory Part I: An
Introduction to the Fundamental Concepts (Singapore: World
Scientific)
[8] MacArthur R H and Wilson E O 1967 The Theory of Island
Biogeography (Princeton, NJ: Princeton University Press)
[9] Whittaker R H 1972 Evolution and measurement of species
diversity Ta x o n 21 213 51
[10] Benson L, Ram N, Almeida D M, Zautra A J and Ong A D
2018 Fusing biodiversity metrics into investigations of daily
life: illustrations and recommendations with emodiversity J.
Gerontol. B Psychol. Sci. Soc. Sci. 73 75–86
[11] Wei S, Mortensen M S, Stokholm J, Brejnrod A D,
Thorsen J, Rasmussen M A, Trivedi U, Bisgaard H and
7
Phys. Biol. 18 (2021) 066004 ATAugoustiet al
Sørensen S J 2018 Short- and long-term impacts of
azithromycin treatment on the gut microbiota in children: a
double-blind, randomized, placebo-controlled trial
EBioMedicine 38 265–72
[12] Jost L 2006 Entropy and diversity Oikos 113 363–75
[13] Hoffmann S and Hoffmann A 2008 Is there a ‘true’
diversity? Ecol. Econ. 65 213–5
[14] Ricotta C 2005 Through the jungle of biological diversity
Acta Biotheor. 53 29–38
[15] Guiasu1 R C and Guiasu S 2012 The weighted
Gini– Simpson index: revitalizing an old index of
biodiversity Int. J. Ecol. 2012 478728
[16] Xu S, Böttcher L and Chou T 2020 Diversity in biology:
definitions, quantification and models Phys. Biol. 17 031001
[17] Patil G P and Taillie C 1982 Diversity as a concept and its
measurement J. Am. Stat. Assoc. 77 548– 61
[18] Hill M 1973 Diversity and evenness: a unifying notation and
its consequences Ecology 54 427432
[19] Guiasu R C and Guiasu S 2010 The Rich– Gini–Simpson
quadratic index of biodiversity Nat. Sci. 21130–7
[20] Tunnicliffe M and Hunter G 2021 The predictive capabilities
of mathematical models for the type-token relationship in
English language corpora Comput. Speech Lang. 70
101227
[21] Hatton L L and Warr G 2019 Strong evidence of an
information-theoretical conservation principle linking all
discrete systems R. Soc. Open Sci. 6191101
[22] Ehrenfest P and Onnes H K 1915 XXXIII. Simplified
deduction of the formula from the theory of combinations
which Planck uses as the basis of his radiation theory
London, Edinburgh Dublin Phil. Mag. J. Sci. 29
297–301
8
... The biological indices used to analyze the community structure of fouling organisms are the Shannon-Wiener diversity index (Odum 1993), Evenness index (Krebs 1985), and Simpson's dominance index (Odum 1993 Biodiversity signifies the quantity of organisms present within an ecosystem, and the vitality of an ecosystem is evident when it exhibits a robust diversity index. Conversely, when an ecosystem displays a diminished diversity index, it indicates a state of decline or degradation (Augousti et al. 2021). The results of the data analysis of fouling organisms obtained diversity index values (H'), including Station 1 (cage frames) with H': 1.82, Station 2 (concrete blocks) with H': 0.69, and Station 3 (ship hulls) with H': 1.52. ...
Article
Full-text available
Putro SP, Haqi MDA, Muhammad F, Hariyati R, Helmi M. 2024. The influence of different substrate types on the diversity of macrofouling organisms at the submerged coastal ecosystem of Karimunjawa Islands, Indonesia. Biodiversitas 25: 3394-3400. Colonizing fouling organisms exhibit taxonomic diversity and distribution in response to environmental variability of the submerged marine ecosystem. This study compares macrofouling organisms found on various substrate types at Menjangan Besar Island, Karimunjawa, Central Java, Indonesia which were varied from coarse to smooth surfaces in the water column. The consisted of High-Density Polyethylene (HDPE) cage framework (3 stations) and lower part of a wooden ship hull are the smooth ones (3 stations), and concrete block at the Menjangan Besar Island dock is the coarse ones (3 stations). Sampling of macrofouling organisms was collected from the substrate within 0.25 m2 area at each station with the purposive random sampling method. Identification of macrofouling organisms was done under Canon MZ25 stereomicroscope. We identified 15 species from 7 classes from Menjangan Besar Island. The most dominant species on concrete blocks was Septifer sp. (96 ind.m-2), on the ship’s hull was Littoraria sp. (416 ind.m-2), and on the cage framework was Crassostrea angulata Thunberg 1793 (240 ind.m-2). The diversity index (H’) of fouling organisms ranged from 0.69 to 1.82, while the evenness index (e) ranged from 0.25 to 0.67, and the dominance index (C) ranged from 0.24 to 0.51. Differences in index values are likely due to factors related to food source distribution and substrate types. Salinity, temperature, and turbidity were the physical-chemical water parameters significantly affecting fouling organism distribution (BIO-ENV, Primer 6.1.5; Corr. value (r)=0.722). Anthropogenic activities such as aquaculture, fishing and tourism in the area are suspected to influence hydrodynamic changes, subsequently affecting the fouling organism attachment and colonization processes, especially organic matter generated from marine culture activities.
... We may refer to Equation 8 as the Gini-Simpson Index for Design or GSID. Equation 8 is also known as the Gini-Simpson Diversity Index and is widely used by ecologists as a well-known conventional index for measuring diversity in an ecosystem (Chen et al., 2018;Augousti et al., 2021). The unbiased estimate of λ used in Equation 8 is also equivalent to the normalised Herfindahl-Hirschman Index (Cracau & Lima, 2016). ...
Preprint
Full-text available
Past research relates design creativity to 'divergent thinking,' i.e., how well the concept space is explored during the early phase of design. Researchers have argued that generating several concepts would increase the chances of producing better design solutions. 'Variety' is one of the parameters by which one can quantify the breadth of a concept space explored by the designers. It is useful to assess variety at the conceptual design stage because, at this stage, designers have the freedom to explore different solution principles so as to satisfy a design problem with substantially novel concepts. This article elaborates on and critically examines the existing variety metrics from the engineering design literature, discussing their limitations. A new distance-based variety metric is proposed, along with a prescriptive framework to support the assessment process. This framework uses the SAPPhIRE model of causality as a knowledge representation scheme to measure the real-valued distance between two design concepts. The proposed framework is implemented in a software tool called 'VariAnT.' Furthermore, the tool's application is demonstrated through an illustrative example.
... Its value is less affected or influenced by the existence of species with smaller values of the relative abundance ( ) in a sample. Augousti et al have recently proposed a new index whose value is close to Simpson index value for communities with large number of individuals and large number of species with equal relative abundance [23]. Zhou et al have shown how Simpson's index can be used for the purpose of diversifying multi-aspect search results [24]. ...
Article
Full-text available
In the present article, we have proposed a new biodiversity index based on the standard deviation of the number of individuals belonging to a species in a collection of biological organisms of different species. Using this index of biodiversity, we have proposed an index for the measurement of evenness of the distribution of individuals among different species in the collection of organisms. Using a hypothetical dataset representing the distribution of individuals among six species, for six different samples, we have calculated the indices defined by us and compared their values with the values of Shannon-Wiener diversity index, Simpson diversity index and their corresponding evenness indices calculated from the same dataset. It is observed that, our new indices undergo much greater changes, compared to the changes of the most commonly used indices, due to any change in the relative proportions of species present in a sample. This observation indicates that any change in the number of organisms of a species, in an area or habitat, is better reflected in the values of these new indices.
Preprint
Full-text available
This report summarizes the first five years of data from a continuing citizen science project examining the effects of wild-land fuel reduction treatments (i.e., prescribed fire and shrub-layer mastication) on bird communities in dry, mixed conifer forests in southwestern Colorado, USA. In addition to surveying bird community species composition, the authors discuss feeding guilds; nesting behavior; and migratory strategies of the 88 species that have been observed in the study.
Preprint
Full-text available
Over the past four-years, volunteers from the Weminuche Audubon Society chapter in Pagosa Springs, CO, have conducted a citizen science project examining the response of bird communities in dry, mixed-conifer forests to wildland fuel reduction treatments. This report details the findings from this study.
Technical Report
Full-text available
Citizen Science Project assessing the response of bird communities to prescribed fire and shrub-layer mastication in ponderosa pine forests in the San Juan Mountains of CO, USA.
Article
Full-text available
Diversity indices are useful single-number metrics for characterizing a complex distribution of a set of attributes across a population of interest. The utility of these different metrics or sets of metrics depends on the context and application, and whether a predictive mechanistic model exists. In this topical review, we first summarize the relevant mathematical principles underlying heterogeneity in a large population, before outlining the various definitions of ‘diversity’ and providing examples of scientific topics in which its quantification plays an important role. We then review how diversity has been a ubiquitous concept across multiple fields, including ecology, immunology, cellular barcoding experiments, and socioeconomic studies. Since many of these applications involve sampling of populations, we also review how diversity in small samples is related to the diversity in the entire population. Features that arise in each of these applications are highlighted.
Article
Full-text available
Diverse discrete systems share common global properties that lack a unifying theoretical explanation. However, constraining the simplest measure of total information (Hartley-Shannon) in a statistical mechanics framework reveals a principle, the conservation of Hartley-Shannon information (CoHSI) that directly predicts both known and unsuspected common properties of discrete systems, as borne out in the diverse systems of computer software, proteins and music. Discrete systems fall into two categories distinguished by their structure: heterogeneous systems in which there is a distinguishable order of assembly of the system's components from an alphabet of unique tokens (e.g. proteins assembled from an alphabet of amino acids), and homogeneous systems in which unique tokens are simply binned, counted and rank ordered. Heterogeneous systems are characterized by an implicit distribution of component lengths, with sharp unimodal peak (containing the majority of components) and a power-law tail, whereas homogeneous systems reduce naturally to Zipf's Law but with a drooping tail in the distribution. We also confirm predictions that very long components are inevitable for heterogeneous systems; that discrete systems can exhibit simultaneously both heterogeneous and homogeneous behaviour; and that in systems with more than one consistent token alphabet (e.g. digital music), the alphabets themselves show a power-law relationship.
Article
Full-text available
Background Macrolides are commonly prescribed for respiratory infections and asthma-like episodes in children. While their clinical benefits have been proved, concerns regarding the side-effects of their therapeutic use have been raised. Here we assess the short- and long-term impacts of azithromycin on the gut microbiota of young children. Methods We performed a randomized, double-blind, placebo-controlled trial in a group of children aged 12–36 months, diagnosed with recurrent asthma-like symptoms from the COPSAC2010 cohort. Each acute asthma-like episode was randomized to a 3-day course of azithromycin oral solution of 10 mg/kg per day or placebo. Azithromycin reduced episode duration by half, which was the primary end-point and reported previously. The assessment of gut microbiota after treatment was the secondary end-point and reported in this study. Fecal samples were collected 14 days after randomization (N = 59, short-term) and again at age 4 years (N = 49, long-term, of whom N = 18 were placebo treated) and investigated by 16S rRNA gene amplicon sequencing. Findings Short-term, azithromycin caused a 23% reduction in observed richness and 13% reduction in Shannon diversity. Microbiota composition was shifted primarily in the Actinobacteria phylum, especially a reduction of abundance in the genus Bifidobacterium. Long-term (13–39 months after treatment), we did not observe any differences between the azithromycin and placebo recipients in their gut microbiota composition. Interpretation Azithromycin treatment induced a perturbation in the gut microbiota 14 days after randomization but did not have long-lasting effects on the gut microbiota composition. However, it should be noted that our analyses included a limited number of fecal samples for the placebo treated group at age 4 years. Fund Lundbeck Foundation, Danish Ministry of Health, Danish Council for Strategic Research, Capital Region Research Foundation, China Scholarship Council.
Article
Full-text available
Objectives: Functionalist emotion and ecological systems theories suggest emodiversity-the variety and relative abundance of individuals' emotion experiences-is beneficial for psychological and physical health and may change with age. This paper examines and provides recommendations for operationalization of diversity-type intraindividual variability (IIV) constructs using intensive longitudinal data, and demonstrates the utility of emodiversity by examining its links to physical health moderated by mean levels of emotion and age. Method: Using data from a daily diary study of 138 adults (age 40 to 65 years), we consider how item selection, response scale, choice of diversity index, and number of occasions enable/constrain mapping to theory, measurement reliability, and empirical inquiry. Results: Item selection and response scale had limited influence on rank-order differences in diversity. Reliable measurement (r ≥ .8) required a minimum of 6 to 12 occasions depending on choice of index, theoretical conception, study design, and distribution of diversity scores. The empirical findings suggest mean level of negative affect, rather than age, moderates the relation between negative emodiversity and health. Discussion: This study provides recommendations for the calculation of diversity-type IIV constructs and illustrates the potential for study of emodiversity to contribute to understanding of successful aging.
Article
Full-text available
The Gini-Simpson quadratic index is a classic measure of diversity, widely used by ecologists. As shown recently, however, this index is not suitable for the measurement of beta diversity when the number of species is very large. The objective of this paper is to introduce the Rich-Gini-Simpson quadratic index which preserves all the qualities of the classic Gini-Simpson in-dex but behaves very well even when the num-ber of species is very large. The additive parti-tioning of species diversity using the Rich-Gini-Simpson quadratic index and an application from island biogeography are analyzed.
Article
We investigate the predictive capability of mathematical models of the type-token relationship applied to the vocabulary growth profiles of selected of English language documents. We compare the existing Good-Toulmin and Heaps formulae with an alternative approach based on Bernoulli trial word selection from a fixed finite vocabulary using the Zipf and Zipf-Mandelbrot probability distributions. We make two major observations: firstly, while the Zipf-Mandelbrot model makes better predictions of vocabulary growth than the Zipf model, the optimized parameters of the latter correlate better than those of the former with statistics gleaned independently from the data. Secondly, the mean of the Zipf-Mandelbrot, Good-Toulmin and Heaps models provides a more consistent and unbiased prediction of vocabulary than any individual model alone.
Article
This paper puts forth the view that diversity is an average property of a community and identifies that property as species rarity. An intrinsic diversity ordering of communities is defined and is shown to be equivalent to stochastic ordering. Also, the sensitivity of an index to rare species is developed, culminating in a crossing-point theorem and a response theory to perturbations. Diversity decompositions, analogous to the analysis of variance, are discussed for two-way classifications and mixtures. The paper concludes with a brief survey of genetic diversity, linguistic diversity, industrial concentration, and income inequality.