ArticlePDF Available

Abstract and Figures

Polish Konik remains one of the most important horse breeds in Poland. The primitive, native horses with a stocky body and mouse-like coat color are protected by a conservation program, while their Polish population consists of about 3,480 individuals, representing 16 dam and six sire lines. To define the population's genetic structure, mitochondrial DNA and Y chromosome sequence variables were identified. The mtDNA whole hypervariable region analysis was carried out using the Sanger sequencing method on 233 Polish Koniks belonging to all dam lines, while the Y chromosome analysis was performed with the competitive allele-specific PCR genotyping method on 36 horses belonging to all sire lines. The analysis of the mtDNA hypervariable region detected 47 SNPs, which assigned all tested horses to 43 haplotypes. Most dam lines presented more than one haplotype; however, five dam lines were represented by only one haplotype. The haplotypes were classified into six (A, B, E, J, G, R) recognized mtDNA haplogroups, with most horses belonging to haplogroup A, common among Asian horse populations. Y chromosome analysis allocated Polish Koniks in the Crown group, condensing all modern horse breeds, and divided them into three haplotypes clustering with coldblood breeds (28 horses), warmblood breeds (two horses), and Duelmener Pony (six horses). The clustering of all Wicek sire line stallions with Duelmener horses may suggest a historical relationship between the breeds. Additionally, both mtDNA and Y chromosome sequence variability results indicate crossbreeding before the studbooks closure or irregularities in the pedigrees occurred before the DNA testing introduction.
Content may be subject to copyright.
Mitochondrial DNA and Y chromosome
reveal the genetic structure of the native
Polish Konik horse population
Adrianna Dominika Musiał
1
, Lara Radović
2,3
, Monika Stefaniuk-
Szmukier
1
, Agnieszka Bieniek
1
, Barbara Wallner
2
and
Katarzyna Ropka-Molik
1
1Department of Animal Molecular Biology, National Research Institute of Animal Production,
Balice, Poland
2Institute of Animal Breeding and Genetics, University of Veterinary Medicine Vienna, Vienna,
Austria
3Vienna Graduate School of Population Genetics, University of Veterinary Medicine Vienna,
Vienna, Austria
ABSTRACT
Polish Konik remains one of the most important horse breeds in Poland.
The primitive, native horses with a stocky body and mouse-like coat color are
protected by a conservation program, while their Polish population consists of about
3,480 individuals, representing 16 dam and six sire lines. To dene the populations
genetic structure, mitochondrial DNA and Y chromosome sequence variables were
identied. The mtDNA whole hypervariable region analysis was carried out using the
Sanger sequencing method on 233 Polish Koniks belonging to all dam lines, while the
Y chromosome analysis was performed with the competitive allele-specic PCR
genotyping method on 36 horses belonging to all sire lines. The analysis of the
mtDNA hypervariable region detected 47 SNPs, which assigned all tested horses to
43 haplotypes. Most dam lines presented more than one haplotype; however, ve
dam lines were represented by only one haplotype. The haplotypes were classied
into six (A, B, E, J, G, R) recognized mtDNA haplogroups, with most horses
belonging to haplogroup A, common among Asian horse populations. Y
chromosome analysis allocated Polish Koniks in the Crown group, condensing all
modern horse breeds, and divided them into three haplotypes clustering with
coldblood breeds (28 horses), warmblood breeds (two horses), and Duelmener Pony
(six horses). The clustering of all Wicek sire line stallions with Duelmener horses may
suggest a historical relationship between the breeds. Additionally, both mtDNA and
Y chromosome sequence variability results indicate crossbreeding before the
studbooks closure or irregularities in the pedigrees occurred before the DNA testing
introduction.
Subjects Biodiversity, Conservation Biology, Genetics, Molecular Biology, Population Biology
Keywords Polish Konik, mtDNA, Y chromosome, SNPs, Diversity, Horse breeding, Primitive horse
breed, Hypervariable region, MSY region
How to cite this article MusiałAD, RadovićL, Stefaniuk-Szmukier M, Bieniek A, Wallner B, Ropka-Molik K. 2024. Mitochondrial DNA
and Y chromosome reveal the genetic structure of the native Polish Konik horse population. PeerJ 12:e17549 DOI 10.7717/peerj.17549
Submitted 28 December 2023
Accepted 20 May 2024
Published 20 June 2024
Corresponding author
Adrianna Dominika Musiał,
adrianna.musial@iz.edu.pl
Academic editor
Syed Ahmad
Additional Information and
Declarations can be found on
page 18
DOI 10.7717/peerj.17549
Copyright
2024 Musiałet al.
Distributed under
Creative Commons CC-BY 4.0
INTRODUCTION
Polish Konik (Polish Primitive Horse; Fig. 1) is one of the most important native horse
breeds of Poland. The Polish Konik horse, like many other primitive breeds, presents traits
such as good health and high fertility, resilience, and adaptability to life in forest conditions
(Hendricks, 2007;Doboszewski et al., 2017). Its body is strong and stocky, with a primitive
mouse-grey coat color and characteristic black stripe along the back (Janczarek, Pluta &
Paszkowska, 2017). The Polish Koniks are good working horses and are especially valued
in gardening farms (Hendricks, 2007), where the benecial effects of grazing have been
proven for plants and rare bird populations (Doboszewski et al., 2017). They also take a
part in nature conservation being used for rewilding processes and habitat protection
(Reke, Zarina & Vinogradovs, 2019). On the other hand, due to its gentleness and docility,
this breed is also used for riding (Hendricks, 2007). Thanks to its advantages, Polish Konik
studs can be found not only in Poland but also abroad, under stable and reserve
management or used as recreational mounts in countries such as the Netherlands,
Germany, Latvia, Belgium, and the UK (Gorecka-Bruzda et al., 2020).
Polish Konik horses are bred in traditional studs and semi-feral/free-roaming groups
like in the forest sanctuary of Popielno Research Station (Poland) and are considered a
valuable genomic resource (Gorecka-Bruzda et al., 2020;Wolc & Bali
nska, 2010). Although
the population has increased in recent years, it is still limited. Due to their low census
population size, genetic resource conservation programs are established to maintain
population size, genetic diversity, and preserve this endangered breed (Jaworski &
Tomczyk-Wrona, 2019;Pasicka, 2013).
Despite the importance of this native breed, the origin of the Polish Konik has not been
clearly explained so far. The most common theory assumes that Polish Koniks are the
direct descendent of the European wild horsethe Tarpan, a species that inhabited the
forests of Europe and became extinct in the 19th century (Pasicka, 2013). However, in
recent years, the direct descent of the Polish Konik from the Tarpan has been highly
debated. Lovasz, Fages & Amrhein (2021) suggest that it is a manmade myth that hinders
effective breed conservation management. Recent advances in the eld of ancient DNA
analysis showed that the Tarpan, referred to as the last wild horse, was also a representative
of a cluster of domestic horses that spread from 2,200 BC onwards to the second
millennium BC, and were a source of modern horses. The results shattered the common
hypothesis of Tarpans as the wild ancestors of modern horses or hybrids with Przewalski
horses (Librado et al., 2021), making the uncovering of Polish Konik origin an important
and difcult challenge to solve.
Although a century has passed since the Polish Konik began to be the object of interest
of researchers, there is still no clear information about the origin of the breed. The rst
research conducted on the Polish Koniks was carried out in 1914 and suggested the
presence of primitive horses, resembling Tarpans, near the city of Biłgoraj (Lublin
Voivodeship, Poland). In the 1920s, the name Polish Konikwas introduced into
literature by Prof. Tadeusz Vetulani, who played an outstanding role in starting the stud
Musiałet al. (2024), PeerJ, DOI 10.7717/peerj.17549 2/22
breeding of horses. The history of breeding dates back to 1923, when Polish Koniks were
placed in the Janów Podlaski stud, and the following years saw the opening of several new
studs. Unfortunately, the progress in breeding was interrupted by World War II, when
most of the Polish Koniks were taken to Germany or were lost (Tomczyk-Wrona, 2022).
After the end of the war, plenty of efforts were made to restore the population of Polish
Koniks and resume their breeding; however, most of the horses did not have documented
origins. The supervision of the Polish Koniks breeding led to the publication of the rst
Studbook of Polish Koniksin 1962, and in 1984 the studbooks became closed, which
eliminated the possibility of crossing with individuals of other horse breeds, and breeding
is conducted in a pure breed (Mackowski et al., 2015;Tomczyk-Wrona, 2022).
The current population of the Polish Konik registered in the studbook (data for 2022)
consists of ~3,480 individuals: ~1,760 mares, 183 stallions, and ~1,540 foals. Over the last
15 years, the number of stallions has not changed signicantly; a slight increase in their
numbers was noted at the beginning of the 21
st
century. In the case of Polish Konik mares,
the number of horses has doubled since 2010 and has almost quadrupled since 2005 due to
the breeding approach and subsidies granted mainly for breeding mares (Polish Horse
Breeders Association, 2022). All Polish Koniks come from 35 female lines, of which 19
became extinct and only 16 are currently active: Liliputka I (1920), Karolka (1933), Zaza
(1933), Urszulka (1934), Tarpanka I (1937), Traszka (unknown), Tunguska (1949),
Figure 1 Polish Konik in Roztocza
nski National Park, Poland (photo credit: Adrianna Musiał).
Full-size
DOI: 10.7717/peerj.17549/g-1
Musiałet al. (2024), PeerJ, DOI 10.7717/peerj.17549 3/22
Tygryska (1928), Popielica (1937), Wola (1943), Białka (1944), Ponętna (1946),
Misia II (1948), Dzina I (unknown), Bona (1954), and Geneza (1965); (Jaworski, 1997).
The rst seven lines showed a clear progression in the last years, while the remaining
lines show only slight progression or stagnation. Horses belonging to the three most
numerous dam lines (Traszka, Tarpanka I, and Zaza) make up more than half of all the
individuals. Due to the persistent state of stagnation, the lines originating from the
mares Bona, Ponętna, Misia II, Geneza, and Białka are threatened with extinction. The
Polish Konik males are distinguished into six lines, which were founded by the
stallions: Chochlik (1940), Goraj (1935), Glejt I (1944), Wicek (<1930), Liliput (1918),
and Myszak (1937), of which the Liliput and Glejt I lines are the least numerous
(Tomczyk-Wrona, 2022).
Recently, mitochondrial DNA and Y chromosome diversity studies using Single
Nucleotide Polymorphism (SNP) have started to play an increasing role in population
genetics including horse genetics. The genetic information is transmitted as a single
haplotype (HT) block, which mirrors dam and sire line history. Thus, we can easily detect
pedigree incongruences since all individuals from a certain line (sire or dam) should have
the same haplotype (Hutchison et al., 1974;Liu, 2010).
The polymorphisms present in the mitochondrial DNA D-loop sequence made it
possible to distinguish the dam lines and identify the genetic diversity, as well as maternal
ancestry and relationships, in many different horse breeds, like Arabian horses (Khanshour
& Cothran, 2013), Thoroughbreds (Yoon et al., 2018), Cleveland Bay horses (Dell et al.,
2020), Tibetan horses (Yang et al., 2018), Holstein horses (Engel et al., 2021), Polish Draft
horses (Myćka et al., 2022), or Hucul horses (Czerneková, Kott & Majzlík, 2013). The part
of the mitochondrial DNA used in most diversity research is hypervariable region 1
(HVR1), localized in the D-loop. Since the early 2000s, single individuals of Polish Konik
have started to be included in multi-breed studies of mtDNA variation (Cothran, Juras &
Macijauskiene, 2005;Kusza et al., 2013;Cieślak et al., 2017). However, a complete analysis
of all present Polish Konik dam lines has not been performed so far.
Regarding paternal ancestry, the researchers focus on the Y chromosome male-specic
region (MSY), which is inherited without recombination from fathers to sons (Jobling &
Tyler-Smith, 2017) and allows tracing the sire lines (Felkel et al., 2019;Radovic et al., 2021).
Previous efforts established the possibility of exploiting the variation present in the MSY
region to identify the diversity of sire lines as well as illustrate male lineagesdemographic
history (e.g., Remer et al., 2022). Currently, MSY variation (>3,000 variants) groups
modern domesticated stallions tested so far into one 1,500-year-old haplogroupthe
Crown (daC). Only a handful of remote horse breeds carry haplotypes beyond the Crown
(Bozlak et al., 2023;Felkel et al., 2018), classied as Non Crown.
Here, we investigate the current genetic diversity of Polish Konik maternal and paternal
lines using uniparental markers. The aim of this research was the investigation of the
Polish Konik population structure in terms of dam and sire lines, based on the analysis of
the mtDNA whole D-loop sequence as well as the analysis of selected Y chromosomal
markers.
Musiałet al. (2024), PeerJ, DOI 10.7717/peerj.17549 4/22
MATERIALS AND METHODS
Founder lines sampling
The variability of mitochondrial DNA D-loop sequence was examined in 233 Polish
Koniks belonging to all 16 dam founding lines: Białka (nine individuals), Dzina I (14),
Geneza (six), Karolka (17), Liliputka I (19), Misia II (six), Ponętna (10), Popielica (11),
Tarpanka I (20), Traszka (44), Tunguska (13), Tygryska (seven), Urszulka (22), Wola (11),
Zaza (21), and Bona (three). The number of analysed horses constituted approximately
7.3% of the total active Polish Konik population, and the share of individual dam lines
included in the research was also proportional to the total number of horses belonging to
each dam line. MSY variability was analysed in 36 stallions selected from the individuals
above and belonged to all six sire founding lines: Goraj (six individuals), Liliput (six),
Wicek (six), Glejt I (six), Chochlik (six), and Myszak (six). The lines were assigned to the
horses based on pedigree data, and all analysed animals were available in the pedigree
database of the Polish Horse Breeders Association. The material used in this study were
hair follicles and blood samples that came from a genetic material bank of the National
Research Institute of Animal Production, Poland.
DNA isolation
The DNA extraction was made using the Sherlock AX kit (A&A Biotechnology, Gda
nsk,
Poland) according to the manufacturers protocol, and DNA quality was determined by
checking concentration and purity on the NanoDrop 2000 spectrophotometer (Thermo
Fisher Scientic, Waltham, MA, USA). DNA samples with concentration >40 ng/µl and
purity (A260/A280) between 1.82.0 were qualied for the next steps and stored at 20 C.
Mitochondrial DNA analysis
For the amplication of the mitochondrial DNA D-loop hypervariable region 1 and
hypervariable region 2, three pairs of primers were designed based on the GenBank Equus
caballus reference sequence: NC_001640 (Table 1). PCR products covered a total region of
1,062 bp. Amplication was performed on all 233 individuals using the Phanta Ready Mix
(Vazyme Biotech, Nanjing, China) according to the instructions. To check the specicity of
the obtained products, the separation on the 3% agarose gel with the ethidium bromide
addition, along with a DNA length markerMarker1 1001,000 bp (A&A Biotechnology,
Gda
nsk, Poland), was performed (120 V, 30 min.). PCR products were cleaned from
primers and free nucleotides with the enzymatic method using the EPPiC Fast reagent
(A&A Biotechnology, Gda
nsk, Poland) and used as a template for sequencing by the
Sanger method. The PCR for the sequencing reaction was performed on 233 individuals
for each amplicon (699 samples in total) with the BigDye Terminator v3.1 Cycle
Sequencing Kit (Thermo Fisher Scientic, Waltham, MA, USA) according to the
instructions, and the products were repuried with the BigDye XTerminator Purication
Kit (Thermo Fisher Scientic, Waltham, MA, USA) according to the protocol. The
capillary electrophoresis was performed on 3500xL Genetic Analyzer (Thermo Fisher
Scientic, Applied Biosystems, Foster City, CA, USA) using POP-7TM Polymer for 3500/
3500xL (Thermo Fisher Scientic, Applied Biosystems, Foster City, CA, USA). The results
Musiałet al. (2024), PeerJ, DOI 10.7717/peerj.17549 5/22
in the form of chromatograms were analysed using BLAST (Altschul et al., 1990), FinchTV
1.3.0 (Geospiza, Inc., Seattle, WA, USA), DnaSP 5.10.01 (Librado & Rozas, 2009), and
Variant Analysis application (Thermo Fisher Scientic Cloud, Waltham, MA, USA) and
compared with a GenBank reference sequence NC_001640.
The SNPs made it possible to identify individual haplotypes and all the sequences were
subjected to phylogenetic analysis. The phylogenetic tree illustrating the variability of
analysed mitochondrial DNA fragments was constructed using Mega 11.0.13 software
(Tamura, Stecher & Kumar, 2021) and iTOL 6.8.1 (Letunic & Bork, 2021) with the
neighbor-joining method (Saitou & Nei, 1987), including 1,000 bootstrap replications.
To visualize the connections between all mtDNA HTs of Polish Koniks, the
median-joining network (Bandelt, Forster & Röhl, 1999) was constructed in the PopART
phylogenetic software (https://popart.maths.otago.ac.nz/, accessed September 2023).
Haplotypes found within the studied Polish Konik population were compared with
sequences stored in the GenBank database using BLAST (Altschul et al., 1990), and the
similarities between Polish Konik population HTs with other breeds have been described.
The sequences representing haplogroups published by Achilli et al. (2012) were found
among the BLAST results. The sequence with the highest similarity percentage was
selected, and its haplogroup was checked to categorize the identied Polish Konik
haplotypes into the haplogroups.
Y chromosome analysis
Y chromosomal haplotypes were determined for 36 Polish Koniks, representing all six
founding sire lines. For haplotyping, a downscaled haplotype structure of the most recent
horse Y phylogeny was constructed (Bozlak et al., 2023) based on 114 selected variants (92
in the Crown and 22 Non-Crown, see Table S1) that determine 115 HTs. In a hierarchical
manner (as described by Remer et al. (2022)), allelic states of variants in the downscaled
haplotype structure were determined via the competitive allele-specic PCR (KASPTM,
www.lgcgroup.com) genotyping method under the standard KASP
TM
genotyping protocol
(lgcgroup.com). The analysis was performed on a CFX96 Touch
Ò
BioRad Real-Time PCR
machine. In addition to Polish Konik samples, each reaction included two positive controls
with known allelic states and two negative controls, including female DNA and water.
In two individuals, the tetranucleotide microsatellite fBVB (GATA14/GATA15) was
screened following Felkel et al. (2019) and Remer et al. (2022).
Table 1 Primer sequences and length of the amplicons.
Primer Sequences (53) Length (bp)
amp1 F: AACGTTTCCTCCCAAGGACT
R: GTAGTTGGGAGGGTTGCTGA
397
amp2 F: ACCCCATCCAAGTCAAATCA
R: CAGGTGCACTTGTTTCCTATG
462
amp3 F: ACCTACCCGCGCAGTAAGCAA
R: ACGGGGGAAGAAGGGTTGACA
306
Musiałet al. (2024), PeerJ, DOI 10.7717/peerj.17549 6/22
Raw data were analysed with Bio-Rad CFX Manager 3.1
Ò
software (BioRad). The allelic
states of all tested variants were catenated, while the allelic states of variants that were not
tested were imputed according to the published HT structure (Bozlak et al., 2023, see
Table S2). For visualization, a median-joining haplotype network was constructed with the
program Network 10.2 (Bandelt, Forster & Röhl, 1999), and the output was redrawn as a
haplotype frequency plot with Canva Pro (Canva, https://www.canva.com/pro/, accessed
August 2023).
RESULTS
Mitochondrial DNA variability
The whole mitochondrial DNA hypervariable region 1 and hypervariable region 2
sequences were determined in 233 Polish Konik horses using Sanger sequencing of three
PCR products (Table S3). The alignment of three Sanger sequencing products revealed the
entire mitochondrial DNA hypervariable region for each horse, allowing comparison with
a publicly available reference sequence (NC_001640) and between all samples. Cutting the
obtained sequences from the location of the rst identied SNP (15,542 bp) to the last one
(16,611 bp) led to the receiving of 1,070 bp fragments, whose analysis detected 47 variable
sites. The analysis in DnaSP indicated a nucleotide diversity result of 0.010, and the average
number of nucleotide differences of 11.104. The 43 mtDNA HTs were dened in Polish
Konik samples and reported to the GenBank database, where received accession numbers
OR827103OR827145 (Table 2). The haplotype diversity amounted to 0.96.
Among the considered dam lines, 11 of 16 lines presented more than one HT (Table 2).
The exceptions were Białka, Bona, Dzina I, Popielica, and Tunguska maternal lines, in
which all examined individuals presented the same, unique haplotype. On the other hand,
in both Traszka and Zaza lines, there were six different HTs. Some of the haplotypes were
recognized in two different lines: HT9 in Karolka and Ponętna lines, HT33 in Tygryska
and Zaza, HT6 in Karolka and Liliputka I, HT20 in Tarpanka I and Zaza, and HT19 in
Tarpanka I and Popielica. Compared to the reference sequence, the most nucleotide
differences were found in HT18 (Ponętna) and HT39 (Wola) HTs22 SNPs. Only one
nucleotide difference from the reference sequence was found in HT22 (Tarpanka) and
HT30 (Tunguska) HTs. Out of 47 detected SNP positions, there was one insertion position
(g.16557_16558insC), one transversion position (g.16543 T > A), one position with both
transversion and transition (g.15776 T > C, T > G; Fig. 2), and transitions in the other 44
positions.
The obtained Polish Konik HTs were compared with sequences available in GenBank
showing that none of them presents 100% coverage. The similarity to Polish Konik HTs of
at least 99% has been presented by a whole range of different horse breeds, such as Arabian
horses (KU575096.1), Polish Draft horses (ON052712.1), Akhal Teke horses (JN398385),
Thoroughbred horses (KC202956.1), Selle Francais horses (OR909772.1), East Asian
horses (JQ340108.1), Chinese Native horses (JQ710930.1), and also Przewalski horses
(ON393916.1).
To construct the neighbor-joining tree (Fig. 3), supported by bootstrap percentages (BP)
computed with 1,000 replicates, we used 233 sequences representing all 16 Polish Konik
Musiałet al. (2024), PeerJ, DOI 10.7717/peerj.17549 7/22
Table 2 Mitochondrial DNA variants located between positions 15,542 and 16,611 detected in all Polish Konik haplotype sequences compared to the reference
sequence NC_001640.The number of horses belonging to each haplotype is included. The HTs presented by more than one dam line are underlined. All obtained HTs
were deposited in the GenBank database and received accession numbers OR827103OR827145.
Dam line/HT GenBank
acc. no.
HG*
No. of horses
15542
15585
15597
15598
15601
15602
15604
15615
15616
15617
15635
15650
15659
15666
15667
15672
15703
15762
15766
15769
15770
15771
15775
15776
15777
15806
15807
15810
15811
15826
15827
15868
15870
15871
15956
15974
15995
15996
16007
16022
16540
16543
16546
16551
16557_16558insC
16559
16611
Ref: NC_001640 CGATT CGAAT CATGAGTGCTCCCTACC ACAAT CCACATT TC TT G- CA
Białka HT1 OR827103 E 9T.G..T.....G.A..................T.GT...........
Bona HT2 OR827104 R 3...C.T.GG...C...C...T.TC.T....G...GT.C..TA.....
Dzina I HT3 OR827105 A 14.A..CT.......................G...........AC..T.
Geneza HT4 OR827106 E 5T.G..T.....G.A.......T..........T.GT...........
Geneza HT5 OR827107 A/B1T.G..T.....G.A...............G.................
Karolka HT6 OR827108 A 10.............................G..............C..
Karolka HT7 OR827109 A 2.A..CT...........ATCT.TG..T.T.G.T.G.G.CC.AC..T.
Karolka HT8 OR827110 G 2T.G..T....TG.A.AC............G.................
Karolka HT9 OR827111 G 2T.G..T....TG.A.AC...............TT.............
Karolka HT10 OR827112 A 1.A...T..........C............G...........A.....
Liliputka I HT11 OR827113 B 13.A.........G.A.............G.G.................
Liliputka I HT6 OR827108 A 3.............................G..............C..
Liliputka I HT12 OR827114 A/B3.........C.G.A...............G.................
Misia II HT13 OR827115 A 3.A...T...............T....T...G.TT.............
Misia II HT14 OR827116 R 2...C.T.GG.......C...T.T..T....G..TGT.C..TA.....
Misia II HT15 OR827117 A 1...C.T.GG.......C.............G..T.....CTA.....
Ponętna HT16 OR827118 A 4.A...TA.........C............G...........A..C..
Ponętna HT17 OR827119 A 3.A...TA.........C...............T........A..C..
Ponętna HT18 OR827120 A 2.A...TA.........CATCT.TG..T.T.GCT.G.G.CC.A..C.G
Ponętna HT9 OR827111 G 1T.G..T....TG.A.AC...............T..............
Popielica HT19 OR827121 J 11.A...T...............T....T...G.TT.............
Tarpanka I HT20 OR827122 A 9.A...TA.........C............G...........A.....
Tarpanka I HT21 OR827123 A 6.A..CT.......................G...........AC..T.
Tarpanka I HT22 OR827124 A 3.............................G.................
Tarpanka I HT19 OR827121 J 1.A...T...............T....T...G.TT.............
Tarpanka I HT23 OR827125 A/B1T.G..T.....G.A...............G...........A.....
Musiałet al. (2024), PeerJ, DOI 10.7717/peerj.17549 8/22
Table 2 (continued)
Dam line/HT GenBank
acc. no.
HG*
No. of horses
15542
15585
15597
15598
15601
15602
15604
15615
15616
15617
15635
15650
15659
15666
15667
15672
15703
15762
15766
15769
15770
15771
15775
15776
15777
15806
15807
15810
15811
15826
15827
15868
15870
15871
15956
15974
15995
15996
16007
16022
16540
16543
16546
16551
16557_16558insC
16559
16611
Ref: NC_001640 CGATT CGAAT CATGAGTGCTCCCTACC ACAAT CCACATT TC TT G- CA
Traszka HT24 OR827126 A 26.....T...C..C................G...........AC..T.
Traszka HT25 OR827127 A 9.....T...C..C............................AC..T.
Traszka HT26 OR827128 A 3.....T...C..C....ATCT.TG..T.T.GCT.G.G.CC.AC..T.
Traszka HT27 OR827129 G 3T.G..T....TG.A.AC............G...........AC..T.
Traszka HT28 OR827130 A 2.....T...C..C........T....T.....T........AC..T.
Traszka HT29 OR827131 A 1.A...TA.........C.........T....C......CC.A.....
Tunguska HT30 OR827132 A 13.............................G.................
Tygryska HT31 OR827133 A/B5.........C.G.A...............G.................
Tygryska HT32 OR827134 A 1.A...TA.........CAT.T.T...T.T.G.T.G.G....A.....
Tygryska HT33 OR827135 A 1.A...TA.........CATCT.TG..T.T.GCT.G.G.CC.A.....
Urszulka HT34 OR827136 A/B10T.G..T.....G.A.AC............G.................
Urszulka HT35 OR827137 G 9T.G..T....TG.A.AC...............TT.............
Urszulka HT36 OR827138 A 2..G..T.....G.A.AC...............T..............
Urszulka HT37 OR827139 A/B1.A.........G.A...ATCT.TG....T.GCT...G.CC.......
Wola HT38 OR827140 A 6..G..TA.......G.C............G...........A..C..
Wola HT39 OR827141 A 5..G..TA.......G.CATCT.TG..T.T.GCT.G.G.CC.A..C..
Zaza HT20 OR827122 A 7.A...TA.........C............G...........A.....
Zaza HT33 OR827135 A 5.A...TA.........CATCT.TG..T.T.GCT.G.G.CC.A.....
Zaza HT40 OR827142 G 4TAG..T.G..TG.A..C...............T...........C..
Zaza HT41 OR827143 A 3.A...TA.........CA.C...G.......C......CC.A.....
Zaza HT42 OR827144 A 1.A.........G.A.......T..G...T.....G......A.....
Zaza HT43 OR827145 A 1.A...TA.........C....T..G...T.....G......A.A...
Note:
*
Achilli et al. (2012)
Musiałet al. (2024), PeerJ, DOI 10.7717/peerj.17549 9/22
dam lines and the Equus caballus (NC_001640) sequence as the root. The NJ tree presents
the clustering of the analysed horses in separate groups with varied horse numbers
(Table 2). Although most of the lines form clear clusters, some different sequences can be
found in them; for example, the Tarpanka I_19 horse is embedded within the Popielica
cluster. This nding suggests that the ofcial Polish Konik pedigree data does not agree
with the mtDNA HTs revealed from genetic analysis.
The median-joining network was constructed using all Polish Konik sequences to
demonstrate the genetic distance between the obtained haplotypes (Fig. 4). The network
indicated clustersseparation based on the number of polymorphic sites. Several dam lines
Figure 2 Chromatograms showing three Polish Konik mtDNA HTs with marked 15,776 variable site
position (FinchTV software). The transition g.15776T>C was observed in HT3, the transversion
g.15776T>G in HT7, and nucleotide T (in accordance with the reference sequence) in HT1.
Full-size
DOI: 10.7717/peerj.17549/g-2
Musiałet al. (2024), PeerJ, DOI 10.7717/peerj.17549 10/22
formed HTs separated by a single polymorphism, which points out that their haplotypes
are very similar. However, in some dam lines (including Karolka and Traszka lines), we
observed several different, only distantly related HTs. The overview of both the NJ tree and
Figure 3 The neighbor-joining tree of the 233 Polish Konik mtDNA sequences with Equus caballus reference sequence (NC_001640) used as a
root. Each colour represents one of sixteen dam lines. Full-size
DOI: 10.7717/peerj.17549/g-3
Musiałet al. (2024), PeerJ, DOI 10.7717/peerj.17549 11/22
Figure 4 The median-joining network of the identied Polish Konik haplotypes (Table 2) with each dam line represented by circles coloured
according to the dam line info from the pedigree. The size of the nodes corresponds to the number of samples and the strokes on the branches
correspond to the number of polymorphisms. Full-size
DOI: 10.7717/peerj.17549/g-4
Musiałet al. (2024), PeerJ, DOI 10.7717/peerj.17549 12/22
Figure 5 The median-joining network of the identied Polish Konik haplotypes (Table 2) with their belonging to the generally known
haplogroups described by Achilli et al. (2012).The size of the nodes corresponds to the number of samples. Six haplogroups are detected
among the Polish Konik population: A, B, E, G, J, and R. The percentage share of detected haplogroups is presented in a pie chart.
Full-size
DOI: 10.7717/peerj.17549/g-5
Musiałet al. (2024), PeerJ, DOI 10.7717/peerj.17549 13/22
median-joining network highlights the wide range of variability among different mtDNA
haplotypes in Polish Koniks.
The median-joining network above was also constructed in a way to presents all
recognized haplotypes and their belonging to the generally known haplogroups described
by Achilli et al. (2012);(Fig. 5). Among the Polish Konik population were six haplogroups
detected: A, B, E, G, J, and R. Similarly to the research conducted by Cieślak et al. (2017)
haplogroup L, which is the most frequent in Europe and the Middle East, was not found.
The haplogroup A observed as common among Asian horse populations (Achilli et al.,
2012), turned out to be the most frequent within the population including 6372% of all
analysed horses. The presence of the other haplogroups ranged from several to a dozen
percent. It should be noted that 9% of horses could not be clearly assigned to a specic
Figure 6 Magnied view on the MSY HTs in Polish Konik sire lines. The simplied haplotype structure based on 114 (92 Crown) variants (see
variant details in Table S2), according to the published MSY topology (Bozlak et al., 2023). Variant names are placed on each branch, in red when
tested in the sample set and in gray if their allelic states were imputed. Genotyping results from 36 males are shown as pies, sized according to
frequency and colored with regard to the sire line. Undetected HTs are shown in gray and collapsed, and *HTsare placed on the respective
branching points. Previously reported horse breeds that carry the same HTs as Polish Konik males are denoted in proximity in black, while
gray-colored breeds indicate horse breeds reported in neighboring HTs (Wallner et al., 2017;Felkel et al., 2019;Remer et al., 2022;Radovic et al., 2022;
Bozlak et al., 2023). Full-size
DOI: 10.7717/peerj.17549/g-6
Musiałet al. (2024), PeerJ, DOI 10.7717/peerj.17549 14/22
haplogroup because of sharing the same similarity to the haplogroup A and haplogroup B.
It is the result of a smaller number of nucleotides used in our study than analysed in studies
by Achilli et al. (2012).
Y chromosome variability
We investigated the MSY HTs in 36 Polish Konik males, representing all six sire lines.
All samples were allocated to the Crown haplogroup (daC), where 4 HTs were
distinguished. Notably, eight analysed males carried previously dened HTs (daC_Tb-
oB1*and daC Ao-aA1D1), whereas 28 males were placed at internal nodes of the backbone
topology (daC_Ad-h*and daC_Ad-hA1*; see Fig. 6 and Table S2), and their private HTs
are not yet denitely dened in the genotyping backbone. The HT name of a sample that
clustered internally was marked with an asterisk (*), e.g., daC_Ad-h*(Fig. 6).
The most abundant HT was daC_Ad-h*, detected in 16 horses. It was noted, that all
representatives of the Chochlik and Myszak sire lines clustered into daC_Ad-h*, while four
individuals from the Glejt I stallion line carried this HT. A subsequent daC_Ad-hA1*HT
was carried by twelve individuals, accounting for all analysed males of Liliput and Goraj
sire lines. Interestingly, daC_Ad-h*and daC_Ad-hA1*HTs were most frequently reported
in Coldbloods (Felkel et al., 2019;Remer et al., 2022;Bozlak et al., 2023). The remaining
part of the dataset (22%) carried daC_Ao-a1D1 (six samples from the Wicek sire line) and
daC_Tb-oB1*(two samples from the Gejt I sire line) HTs, respectively. Previously, HT
daC_Ao-a1D1 was reported only in Duelmener Ponies (Remer et al., 2022). In contrast,
daC_Tb-oB1*HT was characterized in Thoroughbreds (via Byerley Turk, 1680. founder),
Arabians, Akhal Teke, Barbs, and many other horse breeds (Wallner et al., 2017;Felkel
et al., 2019;Remer et al., 2022;Radovic et al., 2022;Bozlak et al., 2023).
DISCUSSION
The analysis of population structure in unique, native horse breeds such as Polish Konik is
extremely important because it allows the attempt to increase their genetic diversity and
keeps all lines large enough to prevent them from extinction. Currently, in a situation
where more than half of the Polish Konik lines are already extinct and several subsequent
lines are threatened with extinction, an especially important task is to control the
population structure and stop the regression of the least numerous lines in the population.
Tools used for this purpose are mitochondrial DNA and Y chromosome sequence analysis.
The variability of mtDNA in modern horse breeds is generally high. For the rst time,
the mtDNA variation within the Polish Konik maternal lines was examined in 2017
(Cieślak et al., 2017). The research was conducted on 173 horses and included 510 bp,
containing hypervariable region 1. The 33 variable sites were used to segregate Polish
Koniks in the form of 19 haplotypes. In our study, the number of polymorphic sites used
was 14 sites greater (47 SNPs), and this resulted in more than twice the number of detected
haplotypes (43). This comparison may indicate that applying the entire hypervariable
region gives us more detailed results than using only hypervariable region 1. Moreover,
Cieślak et al. (2017), for the rst time, suggested that mtDNA results do not fully
correspond to the ofcial pedigree data. Another study focused on assessing the genetic
Musiałet al. (2024), PeerJ, DOI 10.7717/peerj.17549 15/22
diversity and structure of the Polish Konik breed based on 17 microsatellite markers
(Fornal et al., 2021), and this presented no signs of inbreeding and the assignment of the
horses using Structure software into (most likely) three clusters. However, the study
conducted by Mackowski et al. (2015) found 8.6% of inbreeding. The above studies showed
the diversity of the Polish Konik population in terms of STR variability.
The occurrence of a small number of differentiating sites between HTs observed within
a dam line may be the result of spontaneous mutations naturally occurring in the sequence
over the years. On the other hand, the presence of clearly different haplotypes in one dam
line can be the consequence of migration between different Polish Konik populations
before the establishment of the studbooks or the presence of errors in the pedigrees.
Further, both the neighbor-joining tree and the median-joining network demonstrated
moderate variability among different mitochondrial DNA sequences and overall diverse
variability of mitochondrial DNA sequence in Polish Konik dam lines.
The median-joining network constructed in the way to present the Polish Konik
haplotypes belonging to the generally known haplogroups (Achilli et al., 2012) revealed the
presence of six haplogroups (A, B, E, G, J, and R) among the Polish Koniks, which indicates
multiple origins of dam lines. The most numerous haplogroup A including 6372% of
analysed horses was observed as most common among Asian horse populations. Taking
into consideration the geographical attribution of the haplogroups, the A, E, G, and J
haplogroups were the ones the most common in Asia, including 8392% of tested Polish
Koniks. The remaining two haplogroups B and R were most common for the North
American and European populations (Achilli et al., 2012), respectively, and included
615%, and 2% of horses. These results, with an outstanding percentage predominance of
haplogroups most common in Asia, may shed light on the unexplored importance of Asian
horses in the development of the Polish Konik breed.
Another interesting observation noticed also by Cieślak et al. (2017) is that almost all of
the dam lines represented by more than one haplotype belonged to (at least) two distinct
haplogroups, which strengthens the assumptions regarding the possible presence of errors
in the Polish Konik pedigrees.
In the case of paternal inheritance, 17 autosomal microsatellite loci were previously
investigated among Polish Konik sire lines (Fornal et al., 2020). The microsatellite analysis
showed low genetic diversity and vague population structure among sire lines, where the
Polish Konik population can be described with only two clusters. The diversity of Y
chromosome in modern horses is also very low, however, recent advances in horse MSY
phylogeny (Felkel et al., 2019;Bozlak et al., 2023) enabled investigation and ne-scaled
insights into the horse patrilines. Numerous studies used the stable structure of MSY
phylogeny and investigated genetic variation (e.g., Wallner et al., 2017), population history
(e.g., Remer et al., 2022), and breeding practices development (e.g., Castaneda et al., 2019)
of diverse horse breeds. Thus, the investigation of Y chromosomal markers is crucial to
understanding the history and genetic patterns of Polish Konik patrilines.
Here, the MSY HT spectrum in Polish Koniks was investigated, and all sire lines were
reported within the Crown group, which reects the very recent breeding history (last
1,500 years) of the population (Bozlak et al., 2023). Among all analysed stallions, four
Musiałet al. (2024), PeerJ, DOI 10.7717/peerj.17549 16/22
different haplotypes were noted, which exhibit a greater resolution of patriline
differentiation than observed with microsatellite loci (Fornal et al., 2020). Interestingly,
members of ve out of six sire lines carried haplotypes across the daC_A clade (daC_Ao-
a1D, daC_Ad-h*, and daC_Ad-hA1*). Among those, all Wicek line stallions carried
daC_Ao-a1D1, previously described in Duelmener horses. This nding sheds new light on
the close kinship between the breeds. Right after the end of World War II, Polish Konik
stallions were used to eliminate Duelmener poniesdomesticated traits. A notable example
was the stallion Nugat XII, who covered Duelmener mares in the years 19571963 (Opora,
2006). The haplotypes of Chochlik, Myszak, and Glejt I line in daC_Ad-h*, as well as the
joint allocation of Goraj and Liliput males to daC_Ad-hA1*, indicated private, not yet
resolved HTs in these lines. Previous research traced these HTs in Coldbloods (e.g., Felkel
et al., 2019;Remer et al., 2022), which is in line with the Polish Koniksbreeding history,
phenotypic characteristics, and horse type itself since the breed is considered a light draft
breed(Hendricks, 2007). It is also noteworthy that a few males in the Glejt I lineage carried
the daC_Tb-oB1*haplotype (Fig. 6), previously described in various lighter breeds.
daC_Tb-oB1*was primarily attributed to the Thoroughbred founder Byerley Turk (1680)
(Felkel et al., 2019) but later also detected in Arabians, Kuhailan Afas, and Latifsires
(Remer et al., 2022), as well as three patrilines in North African horses (Radovic et al.,
2022). The MSY patterns and narrative on the vast use of Thoroughbred and Arabian
stallions for the improvement of local stocks (Hendricks, 2007) could indicate
undocumented crossbreeding, probably to expand the degree of genetic variability of the
Polish Konik breed.
The described mitochondrial DNA and the Y chromosome genetic diversity also
indicated a certain probability of pedigree errors in the Polish Konik due to the unique
parental inheritance mechanism of mtDNA and the Y chromosome. Mitochondrial DNA
analysis allowed us to highlight individuals whose haplotype ts perfectly into haplotypes
occurring more frequently in a different dam line. In addition, MSY analysis emphasized
the accuracy and information content of sire line tracking in the Wicek line. In contrast, we
also observed two very distinct HTs in the Glejt I patriline (two males in daC_Tb-oB1*and
four in daC_Ad-h*). This nding cannot be explained by de novo mutations but rather
shows a clear sign of two different sire lines among analysed individuals from this line.
CONCLUSIONS
Genetic analysis of Polish Konik dam and sire lines has revealed their diversity in the form
of the identied haplotype number. The identication of mtDNA haplotypes among
individual dam lines showed that in some of them only one single haplotype was
presented, but several showed a wide range of haplotypes within the line. The haplotypes
were also classied into six (A, B, E, J, G, R) haplogroups, with an outstanding percentage
predominance of haplogroups most common in Asian horse populations. Y chromosome
analysis revealed that most of the Polish Konik sire lines share their haplotype with
Coldblood breeds, but also uncovered intriguing historical relationships and potential
crossbreeding events. The possibility of errors in the Polish Konik pedigrees was also
Musiałet al. (2024), PeerJ, DOI 10.7717/peerj.17549 17/22
reported. The ndings emphasize the importance of genetic monitoring to maintain and
control the genetic diversity of Polish Koniks and ensure their long-term survival.
ADDITIONAL INFORMATION AND DECLARATIONS
Funding
This research was funded by Diamentowy Grantno. 0211/DIA/2019/48Ministry of
Science and Higher Education, Poland. The funders had no role in study design, data
collection and analysis, decision to publish, or preparation of the manuscript.
Grant Disclosures
The following grant information was disclosed by the authors:
Diamentowy: 0211/DIA/2019/48.
Ministry of Science and Higher Education, Poland.
Competing Interests
The authors declare that they have no competing interests.
Author Contributions
.Adrianna Dominika Musiałconceived and designed the experiments, performed the
experiments, analyzed the data, prepared gures and/or tables, authored or reviewed
drafts of the article, funding acquisition, and approved the nal draft.
.Lara Radovićconceived and designed the experiments, performed the experiments,
analyzed the data, prepared gures and/or tables, authored or reviewed drafts of the
article, and approved the nal draft.
.Monika Stefaniuk-Szmukier conceived and designed the experiments, authored or
reviewed drafts of the article, and approved the nal draft.
.Agnieszka Bieniek conceived and designed the experiments, authored or reviewed drafts
of the article, material collection, and approved the nal draft.
.Barbara Wallner conceived and designed the experiments, analyzed the data, authored or
reviewed drafts of the article, and approved the nal draft.
.Katarzyna Ropka-Molik conceived and designed the experiments, performed the
experiments, analyzed the data, authored or reviewed drafts of the article, and approved
the nal draft.
Data Availability
The following information was supplied regarding data availability:
All obtained mitochondrial DNA haplotypes are available at GenBank:
OR827103OR827145.
Supplemental Information
Supplemental information for this article can be found online at http://dx.doi.org/10.7717/
peerj.17549#supplemental-information.
Musiałet al. (2024), PeerJ, DOI 10.7717/peerj.17549 18/22
REFERENCES
Achilli A, Olivieri A, Soares P, Lancioni H, Kashani BH, Perego UA, Nergadze SG, Carossa V,
Santagostino M, Capomaccio S, Felicetti M, Al-Achkar W, Penedo MCT, Verini-Supplizi A,
Houshmand M, Woodward SR, Semino O, Silvestrelli M, Giulotto E, Pereira L, Bandelt H-J,
Torroni A. 2012. Mitochondrial genomes from modern horses reveal the major haplogroups
that underwent domestication. Proceedings of the National Academy of Sciences of the United
States of America 109(7):24492454 DOI 10.1073/pnas.1111637109.
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. 1990. Basic local alignment search tool.
Journal of Molecular Biology 215(3):403410 DOI 10.1016/S0022-2836(05)80360-2.
Bandelt H-J, Forster P, Röhl A. 1999. Median-joining networks for inferring intraspecic
phylogenies. Molecular Biology and Evolution 16(1):3748
DOI 10.1093/oxfordjournals.molbev.a026036.
Bozlak E, Radovic L, Remer V, Rigler D, Allen L, Brem G, Stalder G, Castaneda C, Cothran G,
Raudsepp T, Okuda Y, Moe KK, Moe HH, Kounnavongsa B, Keonouchanh S, Van NH,
Vu VH, Shah MK, Nishibori M, Kazymbet P, Bakhtin M, Zhunushov A, Paul RC,
Dashnyam B, Nozawa K, Almarzook S, Brockmann GA, Reissmann M, Antczak DF,
Miller DC, Sadeghi R, von Butler-Wemken I, Kostaras N, Han H, Manglai D,
Abdurasulov A, Sukhbaatar B, Ropka-Molik K, Stefaniuk-Szmukier M, Lopes MS, da
Câmara Machado A, Kalashnikov VV, Kalinkova L, Zaitev AM, Novoa-Bravo M,
Lindgren G, Brooks S, Rosa LP, Orlando L, Juras R, Kunieda T, Wallner B. 2023. Rening the
evolutionary tree of the horse Y chromosome. Scientic Reports 13:8954
DOI 10.1038/s41598-023-35539-0.
Castaneda C, Juras R, Khanshour A, Randlaht I, Wallner B, Rigler D, Lindgren G, Raudsepp T,
Cothran EG. 2019. Population genetic analysis of the estonian native horse suggests diverse and
distinct genetics, ancient origin and contribution from unique patrilines. Genes 10(8):629
DOI 10.3390/genes10080629.
Cieślak J, Wodas L, Borowska A, Cothran EG, Khanshour AM, Mackowski M. 2017.
Characterization of the Polish Primitive Horse (Konik) maternal lines using mitochondrial
D-loop sequence variation. PeerJ 5(4):e3714 DOI 10.7717/peerj.3714.
Cothran EG, Juras R, Macijauskiene V. 2005. Mitochondrial DNA D-loop sequence variation
among 5 maternal lines of the Zemaitukai horse breed. Genetics and Molecular Biology
28(4):677681 DOI 10.1590/S1415-47572005000500006.
Czerneková V, Kott T, Majzlík I. 2013. Mitochondrial D-loop sequence variation among Hucul
horse. Czech Journal of Animal Science 58(10):437442 DOI 10.17221/6992-CJAS.
Dell AC, Curry MC, Yarnell KM, Starbuck GR, Wilson PB, Rogers C. 2020. Mitochondrial
D-loop sequence variation and maternal lineage in the endangered Cleveland Bay horse. PLOS
ONE 15(12):e0243247 DOI 10.1371/journal.pone.0243247.
Doboszewski P, Doktor D, Jaworski Z, Kalski R, Kulakowska G, Lojek J, Plachocki D, Rys A,
Tylkowska A. 2017. Konik polski horses as a mean of biodiversity maintenance in
post-agricultural and forest areas: an overview of Polish experiences. Animal Science Papers and
Reports 35:333347.
Engel L, Becker D, Nissen T, Russ I, Thaller G, Krattenmacher N. 2021. Exploring the origin and
relatedness of maternal lineages through analysis of mitochondrial DNA in the holstein horse.
Frontiers in Genetics 12:632500 DOI 10.3389/fgene.2021.632500.
Felkel S, Vogl C, Rigler D, Dobretsberger V, Chowdhary BP, Distl O, Fries R, Jagannathan V,
Janečka JE, Leeb T, Lindgren G, McCue M, Metzger J, Neuditschko M, Rattei T, Raudsepp T,
Rieder S, Rubin C-J, Schaefer R, Schlötterer C, Thaller G, Tetens J, Velie B, Brem G,
Musiałet al. (2024), PeerJ, DOI 10.7717/peerj.17549 19/22
Wallner B. 2019. The horse Y chromosome as an informative marker for tracing sire lines.
Scientic Reports 9(1):6095 DOI 10.1038/s41598-019-42640-w.
Felkel S, Vogl C, Rigler D, Jagannathan V, Leeb T, Fries R, Neuditschko M, Rieder S, Velie B,
Lindgren G, Rubin C-J, Schlötterer C, Rattei T, Brem G, Wallner B. 2018. Asian horses
deepen the MSY phylogeny. Animal Genetics 49(1):9093 DOI 10.1111/age.12635.
Fornal A, Kowalska K, Ząbek T, Piestrzynska-Kajtoch A, MusiałAD, Ropka-Molik K. 2020.
Genetic diversity and population structure of polish konik horse based on individuals from all
the male founder lines and microsatellite markers. Animals 10(9):1569
DOI 10.3390/ani10091569.
Fornal A, Kowalska K, Ząbek T, Piestrzynska-Kajtoch A, MusiałAD, Ropka-Molik K. 2021.
Genetic variability and population structure of polish konik horse maternal lines based on
microsatellite markers. Genes 12(4):546 DOI 10.3390/genes12040546.
Gorecka-Bruzda A, Jaworski Z, Jaworska J, Siemieniuch M. 2020. Welfare of free-roaming
horses: 70 years of experience with Konik Polski breeding in Poland. Animals 10(6):1094
DOI 10.3390/ani10061094.
Hendricks BL. 2007. International Encyclopedia of Horse Breeds. Norman, OK, USA: University of
Oklahoma Press.
Hutchison CA, Newbold JE, Potter SS, Edgell MH. 1974. Maternal inheritance of mammalian
mitochondrial DNA. Nature 251(5475):536538 DOI 10.1038/251536a0.
Janczarek I, Pluta M, Paszkowska A. 2017. Pochodzenie, hodowla i użytkowanie koników
polskich. Przegląd hodowlany nr 4/2017. Available at http://ph.ptz.icm.edu.pl/wp-content/
uploads/2017/07/2-Janczarek.pdf.
Jaworski Z. 1997. Genealogical tables of the polish primitive horse. Popielno, Poland: Polish
Academy of Sciences, Research Station for Ecological Agriculture and Preserve Animal
Breeding.
Jaworski Z, Tomczyk-Wrona I. 2019. Program ochrony zasobów genetycznych koni rasy konik
polski. Program Hodowli Koni rasy Konik Polski, Warszawa; Załącznik nr 2 do Zarządzenia Nr
70/09 z dn. 30.12.2009. Available at https://www.pzhk.pl/wp-content/uploads/pr-hodow-kn-
2019-04-01.pdf.
Jobling MA, Tyler-Smith C. 2017. Human Y-chromosome variation in the genome-sequencing
era. Nature Reviews Genetics 18(8):485497 DOI 10.1038/nrg.2017.36.
Khanshour AM, Cothran E. 2013. Maternal phylogenetic relationships and genetic variation
among Arabian horse populations using whole mitochondrial DNA D-loop sequencing. BMC
Genetics 14(1):83 DOI 10.1186/1471-2156-14-83.
Kusza S, Priskin K, Ivankovic A, Jędrzejewska B, Podgorski T, Jávor A, Mihók S. 2013. Genetic
characterization and population bottleneck in the Hucul horse based on microsatellite and
mitochondrial data. Biological Journal of the Linnean Society 109(1):5465
DOI 10.1111/bij.12023.
Letunic I, Bork P. 2021. Interactive Tree Of Life (iTOL) v5: an online tool for phylogenetic tree
display and annotation. Nucleic Acids Research 49:W293W296 DOI 10.1093/nar/gkab301.
Librado P, Khan N, Fages A, Kusliy MA, Suchan T, Tonasso-Calvière L, Schiavinato S,
Alioglu D, Fromentier A, Perdereau A, Aury J-M, Gaunitz C, Chauvey L, Seguin-Orlando A,
Der Sarkissian C, Southon J, Shapiro B, Tishkin AA, Kovalev AA, Alquraishi S,
Alfarhan AH, Al-Rasheid KAS, Seregély T, Klassen L, Iversen R, Bignon-Lau O, Bodu P,
Olive M, Castel J-C, Boudadi-Maligne M, Alvarez N, Germonpré M, Moskal-del Hoyo M,
Wilczy
nski J, Pospu1a S, Lasota-KuśA, Tunia K, Nowak M, Rannamäe E, Saarma U,
Boeskorov G, Lōugas L, Kyselý R, Peške L, Bălăşescu A, Dumitraşcu V, Dobrescu R,
Musiałet al. (2024), PeerJ, DOI 10.7717/peerj.17549 20/22
Gerber D, Kiss V, Szécsényi-Nagy A, Mende BG, Gallina Z, Somogyi K, Kulcsár G, Gál E,
Bendrey R, Allentoft ME, Sirbu G, Dergachev V, Shephard H, Tomadini N, Grouard S,
Kasparov A, Basilyan AE, Anisimov MA, Nikolskiy PA, Pavlova EY, Pitulko V, Brem G,
Wallner B, Schwall C, Keller M, Kitagawa K, Bessudnov AN, Bessudnov A, Taylor W,
Magail J, Gantulga J-O, Bayarsaikhan J, Erdenebaatar D, Tabaldiev K, Mijiddorj E,
Boldgiv B, Tsagaan T, Pruvost M, Olsen S, Makarewicz CA, Valenzuela Lamas S, Albizuri
Canadell S, Nieto Espinet A, Iborra MP, Lira Garrido J, Rodríguez González E, Celestino S,
Olària C, Arsuaga JL, Kotova N, Pryor A, Crabtree P, Zhumatayev R, Toleubaev A,
Morgunova NL, Kuznetsova T, Lordkipanize D, Marzullo M, Prato O, Bagnasco Gianni G,
Tecchiati U, Clavel B, Lepetz S, Davoudi H, Mashkour M, Berezina NY, Stockhammer PW,
Krause J, Haak W, Morales-Muñiz A, Benecke N, Hofreiter M, Ludwig A, Graphodatsky AS,
Peters J, Kiryushin KY, Iderkhangai T-O, Bokovenko NA, Vasiliev SK, Seregin NN,
Chugunov KV, Plasteeva NA, Baryshnikov GF, Petrova E, Sablin M, Ananyevskaya E,
Logvin A, Shevnina I, Logvin V, Kalieva S, Loman V, Kukushkin I, Merz I, Merz V,
Sakenov S, Varfolomeyev V, Usmanova E, Zaibert V, Arbuckle B, Belinskiy AB,
Kalmykov A, Reinhold S, Hansen S, Yudin AI, Vybornov AA, Epimakhov A, Berezina NS,
Roslyakova N, Kosintsev PA, Kuznetsov PF, Anthony D, Kroonen GJ, Kristiansen K,
Wincker P, Outram A, Orlando L. 2021. The origins and spread of domestic horses from the
Western Eurasian steppes. Nature 598:17DOI 10.1038/s41586-021-04018-9.
Librado P, Rozas J. 2009. DnaSP v5: a software for comprehensive analysis of DNA polymorphism
data. Bioinformatics 25(11):14511452 DOI 10.1093/bioinformatics/btp187.
Liu W. 2010. Comparative genomics of the Y chromosome and male fertility. In: Reproductive
Genomics in Domestic Animals. Ames, IA, USA: Blackwell Publishing, 129155.
Lovasz L, Fages A, Amrhein V. 2021. Konik, Tarpan, European wild horse: an origin story with
conservation implications. Global Ecology and Conservation 32(3):e01911
DOI 10.1016/j.gecco.2021.e01911.
Mackowski M, Mucha S, Cholewinski G, Cieslak J. 2015. Genetic diversity in Hucul and Polish
primitive horse breeds. Archives Animal Breeding 58(1):2331 DOI 10.5194/aab-58-23-2015.
Myćka G, Klecel W, Stefaniuk-Szmukier M, Jaworska J, MusiałAD, Ropka-Molik K. 2022.
Mitochondrial whole D-loop variability in polish draft horses of sztumski subtype. Animals
12(15):1870 DOI 10.3390/ani12151870.
Opora J. 2006. Die Wildbahngestüte Westfalens. Geschichte, Entwicklung und Zukunf. Berlin,
Bristol, Niederlassung Deutschland: TENEA Verlag Ltd.
Pasicka E. 2013. Polish konik horsecharacteristics and historical background of native
descendants of tarpan. Acta Scientiarum Polonorum. Medicina Veterinaria 12:2538.
Polish Horse Breeders Association. 2022. Breeding Statistics. (accessed on 19 September 2023).
Available at https://en.pzhk.pl/breeding/breeding-statistics/.
Radovic L, Remer V, Krcal C, Rigler D, Brem G, Rayane A, Driss K, Benamar M,
Machmoum M, Piro M, Krischke D, von Butler-Wemken I, Wallner B. 2022. Y chromosome
haplotypes enlighten origin, inuence, and breeding history of North African barb horses.
Animals 12(19):2579 DOI 10.3390/ani12192579.
Radovic L, Remer V, Reiter S, Bozlak E, Felkel S, Grilz-Seger G, Brem G, Wallner B. 2021. 38 Y
chromosome genetic variation and deep genealogies provide new insights on Lipizzan sire lines.
Journal of Equine Veterinary Science 100:103501 DOI 10.1016/j.jevs.2021.103501.
Reke A, Zarina A, Vinogradovs I. 2019. Management of semi-wild large herbivoresGrazing Sites
in Latvia. In: Proceedings of the 12th International Scientic and Practical Conference, Vol. 1,
Rezekene, Latvia, 2022 June 2019, 241244.
Musiałet al. (2024), PeerJ, DOI 10.7717/peerj.17549 21/22
Remer V, Bozlak E, Felkel S, Radovic L, Rigler D, Grilz-Seger G, Stefaniuk-Szmukier M, Bugno-
Poniewierska M, Brooks S, Miller DC, Antczak DF, Sadeghi R, Cothran G, Juras R,
Khanshour AM, Rieder S, Penedo MC, Waiditschka G, Kalinkova L, Kalashnikov VV,
Zaitsev AM, Almarzook S, Reißmann M, Brockmann GA, Brem G, Wallner B. 2022. Y-
chromosomal insights into breeding history and sire line genealogies of Arabian horses. Genes
13(2):229 DOI 10.3390/genes13020229.
Saitou N, Nei M. 1987. The neighbor-joining method: a new method for reconstructing
phylogenetic trees. Molecular Biology and Evolution 4(4):406425
DOI 10.1093/oxfordjournals.molbev.a040454.
Tamura K, Stecher G, Kumar S. 2021. MEGA11: molecular evolutionary genetics analysis version
11. Molecular Biology and Evolution 38(7):30223027 DOI 10.1093/molbev/msab120.
Tomczyk-Wrona I. 2022. Program ochrony zasobów genetycznych koni rasy konik polski. Kraków,
Poland: Instytut ZootechnikiPIB w Krakowie, 115.
Wallner B, Palmieri N, Vogl C, Rigler D, Bozlak E, Druml T, Jagannathan V, Leeb T, Fries R,
Tetens J, Thaller G, Metzger J, Distl O, Lindgren G, Rubin C-J, Andersson L, Schaefer R,
McCue M, Neuditschko M, Rieder S, Schlötterer C, Brem G. 2017. Y chromosome uncovers
the recent oriental origin of modern stallions. Current Biology 27(13):20292035.e5
DOI 10.1016/j.cub.2017.05.086.
Wolc A, Bali
nska K. 2010. Inbreeding effects on exterior traits in Polish konik horses. Archiv
Tierzucht 53:18 0003-9438 DOI 10.5194/aab-53-1-2010.
Yang L, Kong X, Yang S, Dong X, Yang J, Gou X, Zhang H. 2018. Haplotype diversity in
mitochondrial DNA reveals the multiple origins of Tibetan horse. PLOS ONE 13(7):e0201564
DOI 10.1371/journal.pone.0201564.
Yoon SH, Lee W, Ahn H, Caetano-Anolles K, Park KD, Kim H. 2018. Origin and spread of
Thoroughbred racehorses inferred from complete mitochondrial genome sequences:
phylogenomic and Bayesian coalescent perspectives. PLOS ONE 13(9):e0203917
DOI 10.1371/journal.pone.0203917.
Musiałet al. (2024), PeerJ, DOI 10.7717/peerj.17549 22/22
... The evaluation of all existing lineages from both maternal and paternal sides is a precondition for breeding strategy and, thus, for the protection and balanced development of breeds. Due to its high mutation rate and its strictly maternal inheritance, mtDNA sequencing has become a widely used tool for tracing genetic diversity among horse breeds and pedigree populations in Europe [6,18,[34][35][36][37][38] and especially in local breeds from Italy [25,31,33,39]. ...
... A similar Oriental origin was also found in Lipizzan stallions from Monterotondo, where the high frequency of HT02 (80%, the same frequency as previously reported [24]) suggests a Neapolitan/Oriental origin. Similar distributions were found in Maremmano (80%), an Italian local breed, descending from the Etruscan native horses [24], and Giara (75%), a feral native breed from Sardinia, probably brought to the island by the Phoenicians [38]. Identical haplotypes were also recorded in other Lipizzan stallions, but at different frequencies, namely HT01 = 33% and HT02 = 67% [29,44]. ...
Article
Full-text available
Lipizzan is a famous horse breed dating back to 1580 when the original stud of Lipica was established by the Hasburg Archduke Charles II. Currently, the Italian State Stud of Lipizzan Horses (ASCAL) is a conservation nucleus managed through strict mating rules where mitochondrial DNA sequences are used to verify the correct assignment of mares to a historical pedigree maternal lineage. Here, we analyzed the D-loop sequences of Lipizzan horses from the ASCAL in Monterotondo (Rome, Italy) in order to confirm their pedigree assignment to known female founder families. The concurrent investigation of the paternal counterpart based on variation in the male-specific region of the Y chromosome (MSY) showed a prevalence of HT02 (80%), typical of the Neapolitan/Oriental wave. The mtDNA polymorphisms identified shaped nine haplotypes that were unequivocally assigned to each of nine classical mare families of the stud (Africa, Almerina, Argentina, Deflorata, Djebrin, Fistula, Ivanka, Sardinia, Spadiglia), while the Europa and Theodorosta families shared a tenth haplotype. New polymorphisms were identified in a not previously studied region (np 16100–16350). The mtDNA phylogenetic analysis revealed that the Lipizzans of the Monterotondo stud belong to six haplogroups (B, C, G, L, M, Q), out of the 18 recorded for the equine species. This work enabled us to identify and preserve ten haplotypes from the historical maternal lines in a small stud kept in genetic segregation for over 100 years.
... However, Ma et al. [80] showed that a haplogroup identified from the complete mitochondrial genome may correspond to one or more of its HVR1 (Hypervariable region 1) sequences. This pattern was further studied by Musial et al. [65], who reported twice as many detected haplotypes when analysing the whole D-loop compared to using only HVR1. ...
Article
Full-text available
This Systematic Literature Review (SLR) consolidates current research on mitochondrial DNA (mtDNA) analysis in horses, focusing on genetic variation, maternal lineage tracing, and haplogroup identification. The article selection process screened 1380 articles, with 76 fulfilling the inclusion criteria. Data extraction covered sampling techniques, studied mtDNA regions, sequencing methods, and haplogroup identification. Following the methodology of the PRISMA guidelines, this review encompasses studies published since 2012, obtained from Scopus, PubMed, Research4Life, Web of Science, and ScienceDirect. The major findings emphasise the use of mtDNA for tracing ancestry, validating maternal lineages, and identifying haplogroups along with their geographic distributions. This review identifies challenges, including the need to update the haplogroup classification system and potential information loss due to sequence trimming. Additionally, it examines promising avenues for future research, such as the implementation of next-generation sequencing and the merging of haplogroup data with performance traits, which could influence conservation initiatives and breeding programs. This review emphasises the necessity for standardised classification systems and further research on underrepresented breeds and regions to improve our understanding of equine genetic diversity.
... This is the result of numerous errors introduced into the official pedigrees of Hucul horses (Figure 1), likely before mandatory genetic markers inspections for breeding horses. Such errors have also been reported for other horse breeds, such as Arabians, Thoroughbreds, Polish Koniks, and Hucul horses outside Poland [25][26][27][28][29]. It has been suspected that the inconsistencies found in the official pedigrees of Polish Koniks may support the hypothesis that one or more maternal lines, which are commonly believed to no longer exist, are, in fact, still active in the breeding population but are "hidden" due to errors in pedigrees made several generations earlier [26]. ...
Article
Full-text available
Backround: The Hucul horse breed formed in the region of the Eastern Carpathians, likely through the natural crossbreeding of oriental horses. After World War II, their population significantly decreased, leading to the breeding being based on only 14 female lines, whose founders often had unknown origins. To preserve the breed’s unique characteristics, it is now part of a Genetic Resources Conservation Program, which prioritizes the maintenance of genetic diversity. This study aims to clarify the maternal relatedness of founder mares and assess genetic diversity using mitochondrial DNA (mtDNA). Methods: The hyper-variable region of the mitochondrial genome was analyzed in 57 horses. Pedigree records were used to trace genealogical lines, and molecular analysis focused on identifying maternal relationships between founder mares. Results: The analysis revealed close maternal kinships between the lines of Jagoda and Bajkałka, as well as Sekunda and Sroczka. In the Hucul population, seventeen mitochondrial haplotypes were identified, with three that did not match any established lines. The findings reveal discrepancies between pedigree records and mitochondrial DNA data, suggesting potential inaccuracies in the Hucul horse studbook. Conclusions: The findings highlight the importance of combining pedigree and molecular data to refine strategies to preserving genetic diversity, minimizing inbreeding, and improving the management the Genetic Resources Conservation Program.
... Robust MSY HT analysis using biallelic markers has been established recently [3,4]. Most modern horse breeds cluster into a haplogroup (HG) that emerged only around 1,500 years ago (ya) [3][4][5][6][7][8][9][10], with the worldwide dominance of the so-called 'Crown HG' ('daC') attributed to HTs of influential foundation sires, such as the English Thoroughbred [4] and Arabian [9]. Their HTs within the Crown have been delineated, and their over-regional impact demonstrated [4,11,12]. ...
... Maintaining the Dülmen wild horse population under natural harsh conditions without stabling or human interventions and with exclusively grassland pastures has contributed to the uniqueness of this wild horse herd and a careful selection of stallions that can withstand these conditions. Continuous monitoring using dense SNP-arrays [44] and mtDNA markers [45,46] should be helpful to control breeding decisions and the fitness of stallions through the rate of male foals per day of employment in the breeding season. The expected inbreeding load of the future progeny through the potential candidates can be derived prior to their use in breeding [47] and can be an additional tool, alongside phenotypic criteria, such as physical development, temperament, and character, to keep genetic diversity as high as possible. ...
Article
Full-text available
Simple Summary Dülmen wild horses have been managed by the Dukes of Croÿ for about 180 years in the Merfelder Bruch, a region near Dülmen in Westphalia, Germany. The Dülmen wild horses are freely roaming in this area of about 1400 acres all year round without human intervention and have to cope with the harsh natural conditions. Each breeding season 2–3 stallions are employed to sire foals. Herdbook records for this herd are not available due to the wildlife living conditions, free from any human disturbances. In this present study, we analyzed the distribution of the male progeny by their sires and whether sire effects on the genetic diversity and structure in the male progeny can be found. The genetic substructure in female progeny could not be evaluated as the fillies could not be captured for sampling. The paternal half-sib groups showed a very similar genetic diversity, but pairwise genetic distances and neighbor-joining dendrograms indicated a clustering by sires and a marker variance of 9% between paternal progeny groups. Bayesian cluster analysis supported subdivision into paternal progeny groups. Cluster memberships to paternal progeny groups, which were sired in the same year but by different sires, may also be related to mares, while clusters of paternal progeny from the same sire, but in intervals of more than one year, could not be distinguished. The employment of sires which are able to withstand the harsh natural conditions may also ensure a high genetic diversity in their progeny. Abstract The objectives of the present study were to analyze the influence of the stallions employed in the Dülmen wild horses on the genetic diversity and population substructure using Bayesian cluster analysis. The Dülmen wild horse is maintained as a unique horse population exposed to the natural conditions all year round in the Merfelder Bruch near Dülmen in Westphalia, Germany. Stallions selected for breeding have to prove their abilities to survive under this harsh environment. We used multilocus genotypic information from a set of 29 autosomal microsatellites to determine the paternity of 185 male foals sired by nine stallions. As females could not be sampled, we could not make inferences on all yearlings and test whether there are differences in the genetic population parameters between both sexes. The mean number of progeny was 19.92 with a range of 2–32, caused by the length of the service period per stallion. The average observed and unbiased expected heterozygosity was 0.688 and 0.631, the mean number of alleles was 4.448, and Wright’s FIS was −0.173. Pairwise genetic distances (FST and Nei’s unbiased genetic distances) were significant and varied between 0.038 to 0.091 and 0.085 to 0.290, respectively. Neighbor-joining dendrogram plots clustered a large proportion of the paternal progeny groups in different branches. Posterior Bayesian analyses using seven paternal half-sib groups with 10–74 members supported a maximum of six clusters, with two paternal progeny groups not differing, and a median of five clusters, with two groups of two sires each falling into the same clusters. When sires were employed in non-consecutive years, progeny from these different years of the same sires were grouped in the same cluster, whereas the progeny of one sire from two consecutive years were in different clusters. We were able to distinguish male progeny from Dülmen wild horse stallions and to show the effects of stallion use on the genetic substructure in the Dülmen wild horse herd. In conclusion, the analyses showed the genetic potential of the Dülmen wild horse stallions to maintain a high genetic diversity and also the effects in which breeding seasons and for how long stallions are used to sire foals. The selection of stallions may be sensitive for the further development of genetic diversity and preserve this closed population as a valuable resource for further studies on the evolution of the horse.
... This is the first report on this horse population. Previous studies included domesticated Dülmen horses or ponies [6,16,44], and thus the uniqueness and the potential value of the wild herd for phylogenetic studies was obscured. ...
Article
Full-text available
Simple Summary Wild horses in the region around Dülmen in Westphalia, Germany, date back to the Middle Ages. These horses ranged the wooden and marsh area in small herds. The Dukes of Croÿ fenced these wild horses in 1856 in an area called Merfelder Bruch, near Dülmen, and managed this herd as a self-contained population. Stallions employed were from the Dülmen wild horse population and, in some breeding seasons, from other primitive horse populations. The Dülmen wild horse population has to cope with harsh natural conditions all year round. The Dülmen wild horse population consists of about 530 horses with around 380 mares. Each year, 2 to 3 stallions sire 50 to 60 foals. In the present study, we characterize, for the first time, the Dülmen wild horse population in comparison to warmblood, coldblood, and primitive horse populations using a highly informative microsatellite marker set. We found a high degree of genetic diversity in the Dülmen wild horse population. Genetic distance measures, principal component analysis, and Bayesian cluster analyses clearly demarcated the Dülmen wild horse population from the other 17 horse populations studied here. In addition, data supported the hypothesis of an early genetic divergence of Dülmen wild horses. The Dülmen wild horse population appeared as an invaluable resource to study the evolution of domesticated horses. Abstract The objective of the present study was to analyze the genetic diversity, individual-based assessment of population structure, and admixture in the Dülmen wild horse population in comparison to warmblood, coldblood, and primitive horse populations. The Dülmen wild horse is kept as a unique horse population in the Merfelder Bruch near Dülmen in Westphalia, Germany, and since 1856 has been managed by the Dukes of Croÿ. The Dülmen wild horse population is exposed to the natural conditions of the Merfelder Bruch all year round without human interventions for feeding and veterinary care. In the present study, genetic diversity was estimated for 101 Dülmen wild horses using multilocus genotypic information from a set of 29 autosomal microsatellites and compared with 587 horses from 17 different horse populations. Dülmen wild horses maintained a high degree of genetic diversity, with an average observed heterozygosity of 0.68, a mean number of 6.17 alleles, and heterozygote deficit of −0.035. Pairwise genetic distances (FST, Nei’s standard, and Cavalli-Sforza distances) were closest to German coldblood breeds, Polish Konik, and Icelandic horses and most divergent from Sorraia and Przewalski’s horses. Neighbor joining dendrogram and PCA plots showed a clear distinction of Dülmen wild horses from other populations, particularly from Przewalski horses. Posterior Bayesian analysis confirmed clear differentiation from other horse populations without an admixture pattern and a high membership index (0.92). It was possible to distinguish Dülmen wild horses from Dülmen and Polish Konik horses. In conclusion, Dülmen wild horses show a notable separation from other German horse breeds and primitive horse populations and may serve as a resource to study evolution of equine domestication.
Article
Full-text available
The Y chromosome carries information about the demography of paternal lineages, and thus, can prove invaluable for retracing both the evolutionary trajectory of wild animals and the breeding history of domesticates. In horses, the Y chromosome shows a limited, but highly informative, sequence diversity, supporting the increasing breeding influence of Oriental lineages during the last 1500 years. Here, we augment the primary horse Y-phylogeny, which is currently mainly based on modern horse breeds of economic interest, with haplotypes (HT) segregating in remote horse populations around the world. We analyze target enriched sequencing data of 5 Mb of the Y chromosome from 76 domestic males, together with 89 whole genome sequenced domestic males and five Przewalski’s horses from previous studies. The resulting phylogeny comprises 153 HTs defined by 2966 variants and offers unprecedented resolution into the history of horse paternal lineages. It reveals the presence of a remarkable number of previously unknown haplogroups in Mongolian horses and insular populations. Phylogenetic placement of HTs retrieved from 163 archaeological specimens further indicates that most of the present-day Y-chromosomal variation evolved after the domestication process that started around 4200 years ago in the Western Eurasian steppes. Our comprehensive phylogeny significantly reduces ascertainment bias and constitutes a robust evolutionary framework for analyzing horse population dynamics and diversity.
Article
Full-text available
Simple Summary Bred over centuries in the Maghreb region, on a corridor between the Arab and the Western world, the North African Barb horse has been touched by many influences in the course of history. The present study investigated the paternally inherited Y chromosome in today´s Barbs and Arab-Barbs collected from North Africa and Europe, with the aim to link genetic patterns and narrative history. A broad Y chromosomal spectrum was observed, as well as regional disparities among populations. Y chromosomal patterns illustrated a tight connection of Barb horses with Arabians and several other breeds, including Thoroughbreds. Besides, results depict footprints of past migrations between North Africa and the Iberian Peninsula. Abstract In horses, demographic patterns are complex due to historical migrations and eventful breeding histories. Particularly puzzling is the ancestry of the North African horse, a founding horse breed, shaped by numerous influences throughout history. A genetic marker particularly suitable to investigate the paternal demographic history of populations is the non-recombining male-specific region of the Y chromosome (MSY). Using a recently established horse MSY haplotype (HT) topology and KASP™ genotyping, we illustrate MSY HT spectra of 119 Barb and Arab-Barb males, collected from the Maghreb region and European subpopulations. All detected HTs belonged to the Crown haplogroup, and the broad MSY spectrum reflects the wide variety of influential stallions throughout the breed’s history. Distinct HTs and regional disparities were characterized and a remarkable number of early introduced lineages were observed. The data indicate recent refinement with Thoroughbred and Arabian patrilines, while 57% of the dataset supports historical migrations between North Africa and the Iberian Peninsula. In the Barb horse, we detected the HT linked to Godolphin Arabian, one of the Thoroughbred founders. Hence, we shed new light on the question of the ancestry of one Thoroughbred patriline. We show the strength of the horse Y chromosome as a genealogical tool, enlighten recent paternal history of North African horses, and set the foundation for future studies on the breed and the formation of conservation breeding programs.
Article
Full-text available
Simple Summary The Polish draft horse (PDH) breed is a result of crossing local mares with imported cold-blooded stallions. In this study, we investigate the genetic composition of the PDH by analyzing whole mitochondrial d-loop variabilities and comparing these variabilities to previously demonstrated whole d-loop sequences of other cold-blooded breeds: Ardennais, Belgian, Breton, Clydesdale, Noriker, Norwegian Fjord, Percheron, and Suffolk. Our results show high nucleotide diversity among the PDH population and the existence of two main haplogroups with strong kinship to the Belgian breed. Abstract The Polish draft horse (PDH) breed is a result of crossing local mares with imported cold-blooded stallions, such as Belgians, Ardennes, Fjords, and others. A part of the broodmare stock investigated in this study was also imported from various countries, such as Denmark. In this study, we investigate the genetic composition of the PDH by analyzing the whole mitochondrial d-loop variability and comparing it to previously demonstrated whole d-loop sequences of other cold-blooded breeds: Ardennais, Belgian, Breton, Clydesdale, Noriker, Norwegian Fjord, Percheron, and Suffolk. Our results show high nucleotide diversity within the PDH population (π = 0.011), and the existence of two main haplogroups: one of relatively concise origin, with strong kinship to the Belgian breed, and the second showing close relation to the majority of other analyzed cold-blooded breeds. Some of the PDH maternal strains clustered separately, which can be a result of the influence of other unidentified breeds that served as a foundation stock for the present population. This present study explains the genetic relationship of the PDH to other cold-blooded breeds and indicates the high genetic diversity of the breed.
Article
Full-text available
The Y chromosome is a valuable genetic marker for studying the origin and influence of paternal lineages in populations. In this study, we conducted Y-chromosomal lineage-tracing in Arabian horses. First, we resolved a Y haplotype phylogeny based on the next generation sequencing data of 157 males from several breeds. Y-chromosomal haplotypes specific for Arabian horses were inferred by genotyping a collection of 145 males representing most Arabian sire lines that are active around the globe. These lines formed three discrete haplogroups, and the same haplogroups were detected in Arabian populations native to the Middle East. The Arabian haplotypes were clearly distinct from the ones detected in Akhal Tekes, Turkoman horses, and the progeny of two Thoroughbred foundation sires. However, a haplotype introduced into the English Thoroughbred by the stallion Byerley Turk (1680), was shared among Arabians, Turkomans, and Akhal Tekes, which opens a discussion about the historic connections between Oriental horse types. Furthermore, we genetically traced Arabian sire line breeding in the Western World over the past 200 years. This confirmed a strong selection for relatively few male lineages and uncovered incongruences to written pedigree records. Overall, we demonstrate how fine-scaled Y-analysis contributes to a better understanding of the historical development of horse breeds.
Article
Full-text available
Horses are gaining importance in European nature conservation management, for which usually so-called primitive breeds are favored due to their claimed robustness. An increasingly popular breed, the Konik horse, is often said to be the direct descendent of the alleged European wild horse, the Tarpan. However, both the direct descent of the Konik from European wild horses and the existence of the Tarpan as a wild species are highly debated. In this review, we scrutinized both contemporary research and historical sources and suggest that the Tarpan and the Konik as its direct descendent are manmade myths that hinder effective conservation management. We did not find evidence that the Tarpan was a wild horse rather than a feral horse. We did not find any evidence either for a closer connection between the Konik and any extinct wild horse than between other domestic breeds and wild horses. We discuss three perspectives on why the myth has become widely accepted and survived to this day: a historical-political, a biological-ecological, and an emotional perspective. It seems that the origin story of the Konik and its connection to the Tarpan was shaped by personal and political interests, including nationalistic ideas. These as well as general human emotions towards horses have influenced researchers and laypeople to keep the myth alive, which has been possibly negatively impacting contemporary nature conservation. Indeed, today’s Koniks originated from a small founder population of only six male lines that were selected according to their phenotypic traits, with the aim to rebreed the ‘wild Tarpan’. Strict breeding practices have led to high inbreeding levels in recent Konik populations, which may undermine nature conservation purposes. Therefore, we suggest that mythologized origin stories should not be an argument for selecting breeds of grazers for nature conservation.
Article
Full-text available
Domestication of horses fundamentally transformed long-range mobility and warfare¹. However, modern domesticated breeds do not descend from the earliest domestic horse lineage associated with archaeological evidence of bridling, milking and corralling2–4 at Botai, Central Asia around 3500 bc³. Other longstanding candidate regions for horse domestication, such as Iberia⁵ and Anatolia⁶, have also recently been challenged. Thus, the genetic, geographic and temporal origins of modern domestic horses have remained unknown. Here we pinpoint the Western Eurasian steppes, especially the lower Volga-Don region, as the homeland of modern domestic horses. Furthermore, we map the population changes accompanying domestication from 273 ancient horse genomes. This reveals that modern domestic horses ultimately replaced almost all other local populations as they expanded rapidly across Eurasia from about 2000 bc, synchronously with equestrian material culture, including Sintashta spoke-wheeled chariots. We find that equestrianism involved strong selection for critical locomotor and behavioural adaptations at the GSDMC and ZFPM1 genes. Our results reject the commonly held association⁷ between horseback riding and the massive expansion of Yamnaya steppe pastoralists into Europe around 3000 bc8,9 driving the spread of Indo-European languages¹⁰. This contrasts with the scenario in Asia where Indo-Iranian languages, chariots and horses spread together, following the early second millennium bc Sintashta culture11,12.
Article
Full-text available
Maternal lineages are important for the breeding decision in the Holstein horse breed. To investigate the genetic diversity of the maternal lineages and the relationships between founder mares, the maternal inherited mitochondrial genome (except the repetitive part of the non-coding region) of 271 mares representing 75 lineages was sequenced. The sequencing predominantly revealed complete homology in the nucleotide sequences between mares from one lineage with exceptions in 13 lineages, where differences in one to three positions are probably caused by de novo mutations or alternate fixation of heteroplasmy. We found 78 distinct haplotypes that have not yet been described in other breeds. Six of these occurred in two or three different lineages indicating a common ancestry. Haplotypes can be divided into eight clusters with all mares from one lineage belonging to the same cluster. Within a cluster, the average number of pairwise differences ranged from zero to 16.49 suggesting close maternal relationships between these mares. The results showed that the current breeding population originated from at least eight ancestral founder mares.
Conference Paper
Full-text available
The male-specific region of the Y chromosome (MSY) is paternally inherited and enables to trace patrilines through genetic analysis. MSY markers and haplotypes in combination with pedigree information provide precise insights into genealogical systems. While the use of the Y chromosomal genetic genealogy is well established in humans, we recently could develop an equine MSY marker system from NGS data capable to perform a genetic fine-scale analysis of sire lines in horses. This approach has been successfully applied to solve genealogical questions in the English Thoroughbred. In this work, we study sire line genealogies in Lipizzan horses by combining MSY and pedigree information. The Lipizzan breed is characterized by deep pedigree records that trace back to founder sires born in the 18th and early 19th century. We inferred Lipizzan specific MSY haplotypes based on NGS data from 15 Lipizzan males mapped to a 5.8 Mb horse MSY draft reference (LipY764). Haplotype distribution was then studied by screening 52 selected haplotype determining variants in 132 stallions representing the 8 existing Lipizzan sire lines. Samples were derived from 7 European Lipizzan state stud farms. Genomic DNA was isolated from blood and genotyping was performed with the KASP genotyping method. MSY haplotype spectra in Lipizzans were compared with other breeds, and the results confirmed a presumed Arabian and Spanish origin in 2 Lipizzan sire lines. Surprisingly, horses from the remaining 6 sire lines grouped into haplogroup Tb. This haplogroup was recently characterized as a signature of the Turkoman horse, and the allocation of Lipizzans into Tb points to a possible Turkish influence in the Lipizzans. In addition, we detected 6 haplotypes that arose via de novo mutation in the timeframe of pedigree documentation. In 2 sire lines the MSY pattern in sub-lines did not accord with the paternal lineage documentation. In 5 horses belonging to a side-branch of the Favory line, a unique haplotype was detected. This finding indicates another foundation sire in the Lipizzan horse. In conclusion, MSY haplotyping confirmed historic breed documentation and offered new insights on the male breed ancestry. Furthermore, we demonstrated that our approach is suitable to study sire lines on a genealogical scale. MSY analysis can be used for studying the paternal ancestry and as a forensic tool, to solve open questions in the paternal lineages, even if they occurred multiple generations back in time.
Article
Full-text available
The Molecular Evolutionary Genetics Analysis (MEGA) software has matured to contain a large collection of methods and tools of computational molecular evolution. Here, we describe new additions that make MEGA a more comprehensive tool for building timetrees of species, pathogens, and gene families using rapid relaxed-clock methods. Methods for estimating divergence times and confidence intervals are implemented to use probability densities for calibration constraints for node-dating and sequence sampling dates for tip-dating analyses, which will be supported by new options for tagging sequences with spatiotemporal sampling information, an expanded interactive Node Calibrations Editor, and an extended Tree Explorer to display timetrees. We have now added a Bayesian method for estimating neutral evolutionary probabilities of alleles in a species using multispecies sequence alignments and a machine learning method to test for the autocorrelation of evolutionary rates in phylogenies. The computer memory requirements for the maximum likelihood analysis are reduced significantly through reprogramming, and the graphical user interface (GUI) has been made more responsive and interactive for very big datasets. These enhancements will improve the user experience, quality of results, and the pace of biological discovery. Natively compiled GUI and command-line versions of MEGA11 are available for Microsoft Windows, Linux, and macOS from www.megasoftware.net.
Article
Full-text available
The Interactive Tree Of Life (https://itol.embl.de) is an online tool for the display, manipulation and annotation of phylogenetic and other trees. It is freely available and open to everyone. iTOL version 5 introduces a completely new tree display engine, together with numerous new features. For example, a new dataset type has been added (MEME motifs), while annotation options have been expanded for several existing ones. Node metadata display options have been extended and now also support non-numerical categorical values, as well as multiple values per node. Direct manual annotation is now available, providing a set of basic drawing and labeling tools, allowing users to draw shapes, labels and other features by hand directly onto the trees. Support for tree and dataset scales has been extended, providing fine control over line and label styles. Unrooted tree displays can now use the equal-daylight algorithm, proving a much greater display clarity. The user account system has been streamlined and expanded with new navigation options and currently handles >1 million trees from >70 000 individual users.