ArticlePDF Available

Macro- and microcolinearity between the genomic region of wheat chromosome 5B containing the Tsn1 gene and the rice genome

Authors:

Abstract and Figures

The Tsn1 gene in wheat confers sensitivity to a proteinaceous host-selective toxin (Ptr ToxA) produced by the tan spot fungus (Pyrenophora tritici-repentis) and lies within a gene-rich region of chromosome 5B. To use the rice genome sequence information for the map-based cloning of Tsn1, colinearity between the wheat genomic region containing Tsn1 and the rice genome was determined at the macro- and microlevels. Macrocolinearity was determined by testing 28 expressed sequence markers (ESMs) spanning a 25.5-cM segment and encompassing Tsn1 for similarity to rice sequences. Twelve ESMs had no similarity to rice sequences, and 16 had similarity to sequences on seven different rice chromosomes. Segments of colinearity with rice chromosomes 3 and 9 were identified, but frequent rearrangements and disruptions occurred. Microcolinearity was determined by testing the sequences of 26 putative genes identified from BAC contigs of 205 and 548 kb in length and flanking Tsn1 for similarity to rice genomic sequences. Fourteen of the predicted genes detected orthologous sequences on six different rice chromosomes, whereas the remaining 12 had no similarity with rice sequences. Four genes were colinear on rice chromosome 9, but multiple disruptions, rearrangements, and duplications were observed in wheat relative to rice. The data reported provide a detailed analysis of a region of wheat chromosome 5B that is highly rearranged relative to rice.
Content may be subject to copyright.
Funct Integr Genomics (2006) 6: 90103
DOI 10.1007/s10142-005-0020-1
ORIGINAL PAPER
Huangjun Lu
.
Justin D. Faris
Macro- and microcolinearity between the genomic region
of wheat chromosome 5B containing the
Tsn1
gene
and the rice genome
Received: 13 September 2005 / Revised: 21 November 2005 / Accepted: 22 November 2005 / Published online: 22 December 2005
# Springer-Verlag 2005
Abstract The Tsn1 gene in wheat confers sensitivity to a
proteinaceous host-selective toxin (Ptr ToxA) produced
by the tan spot fungus (Pyrenophora tritici-repentis) and
lies within a gene-rich region of chromosome 5B. To use
the rice genome sequence information for the map-based
cloning of Tsn1, colinearity between the wheat genomic
region containing Tsn1 and the rice genome was de-
termined at the macro- and microlevels. Macrocolinearity
was determined by testing 28 expressed sequence mark-
ers (ESMs) spanning a 25.5-cM segment and encom-
passing Tsn1 for similarity to rice sequences. Twelve
ESMs had no similarity to rice sequences, and 16 had
similarity to sequences on seven different rice chromo-
somes. Segments of colinearity with rice chromosomes 3
and 9 were identified, but frequent rearrangements and
disruptions occurred. Microcolinearity was determined by
testing the sequences of 26 putative genes identified from
BAC contigs of 205 and 548 kb in length and flanking
Tsn1 for similarity to rice genomic sequences. Fourteen
of the predicted genes detected orthologous sequences on
six different rice chromosomes, whereas the remaining
12 had no similarity with rice sequences. Four genes
were colinear on rice chromosome 9, but multiple dis-
ruptions, rearrangements, and duplications were observed
in wheat relative to rice. The data reported provide a
detailed analysis of a region of wheat chromosome 5B
that is highly rearranged relative to rice.
Keywords Wheat
.
Comparative genomics
.
Colinearity
.
Rice
Introduction
Bread wheat (Triticum aestivum L., 2n=6x=42, AABBDD
genomes) is one of the most important food crops in the
world. Genomics and gene discovery in hexaploid wheat is
confounded by a genome size of approximately 17,300 Mb
(Bennett and Leitch 1995) and an abundance (80%) of re-
petitive sequences (Wicker et al. 2001; SanMiguel et al.
2002). Despite these obstacles, considerable progress toward
understanding genome structure and organization has been
made. The polyploid nature of wheat allows it to tolerate
aneuploidy, including the loss of large chromosome seg-
ments, chromosome arms, and even whole chromosomes.
This feature has led to the development of a large collection
of aneuploid stocks for cytogenetic and genomic analyses
(Sears 1954; 1966; Sears and Sears 1978; Endo and Gill
1996). Endo and Gill (1996) isolated more than 400 terminal
chromosome deletion lines, which have been used to
develop physical maps of wheat chromosomes (Werner et
al. 1992;Gilletal.1993; Hohmann et al. 1994; Delaney et al.
1995a,b; Mickelson-Young et al. 1995). Comparisons of the
physical maps with recombination-based maps led to the
discovery that recombination frequencies were highest near
the distal regions of the chromosomes and that genes existed
in clusters with the highest gene density occurring near the
telomeric regions (Gill et al. 1996a,b; Akhunov et al. 2003;
Erayman et al. 2004).
More than 600,000 expressed sequence tags (ESTs) have
been generated from wheat and closely related species by
the National Science Foundation (NSF)-funded wheat EST
project and other public and private entities (http://www.
ncbi.nlm.nih.gov/dbEST/dbEST_summary.html). A panel
of the chromosome deletion lines was used to locate more
than 16,000 EST loci to specific chromosome deletion bins
by the wheat NSF EST project (Qi et al. 2004). The EST
sequence and mapping data provide a valuable resource for
genome analysis, identification of candidate genes for traits
of interest, predicting biological function of genes, and
comparative genomics.
Comparative analysis of plant genomes has increased
our knowledge of evolutionary relationships among spe-
H. Lu
Department of Plant Sciences,
North Dakota State University,
Fargo, ND 58105, USA
J. D. Faris (*)
USDA-ARS Cereal Crops Research Unit,
Red River Valley Agricultural Research Center,
Fargo, ND 58105, USA
e-mail: farisj@fargo.ars.usda.gov
cies, and considerable effort has been put forth in com-
paring the genomic relationships among grasses (Devos
and Gale 2000). Comparative mapping experiments among
wheat and other members of the Poaceae including rice,
barley, rye, oats, and maize have revealed remarkable
similarities in gene content and marker colinearity at the
chromosome (macro) level. The genomes of distantly re-
lated cereals like oat, rice, and maize can be divided into
linkage blocks that have homology to corresponding seg-
ments of the wheat genome (Ahn et al. 1993; Van Deynze
et al. 1995a,b).
The degree of genomic similarity observed at the macro-
level among grass genomes led to the notion that infor-
mation from the small genome of rice could be directly
applied to the much larger genome of wheat, and the recent
completion of rice genome sequencing (Goff et al. 2002;
Yu et al. 2002; International Rice Genome Sequencing
Project 2005) makes it a valuable reference for comparative
genomics of the grass family. However, while some studies
of colinearity between wheat and rice at the sequence
(micro) level indicated good levels of conservation (Yan et
al. 2003; Chantret et al. 2004; Distelfeld et al. 2004), many
have reported the occurrence of multiple rearrangements in
gene order and content (Bennetzen 2000; Feuillet and
Keller 2002; Li and Gill 2002; Sorrells et al. 2003; Francki
et al. 2004). Nevertheless, the rice genome sequence is a
potentially valuable tool for map-based cloning of genes in
wheat. The positional cloning of the wheat vernalization
gene VRN1 (Yan et al. 2003) is a good example of using
information from the colinear regions in rice and sorghum
to facilitate the cloning of a wheat gene. Others have shown
that colinear regions of rice can be a useful source of
markers for saturation and high-resolution mapping of
target genes in wheat (Liu and Anderson 2003; Distelfeld
et al. 2004).
Our goal is to isolate the Tsn1 gene in wheat using a
positional cloning approach. Tsn1 confers sensitivity to the
host-selective proteinaceous toxin Ptr ToxA, which is pro-
duced by the tan spot fungus (Pyrenophora tritici-repentis )
(Faris et al. 1996). In previous work, we located Tsn1 to the
long arm of chromosome 5B by genetic linkage mapping
(Faris et al. 1996), and later, we located the gene on the
physical map of 5B within the deletion bin 5BL 0.750.76
(Faris et al. 2000). Haen et al. (2004) conducted saturation
and high-resolution mapping of the Tsn1 locus and iden-
tified markers delineating the gene to a 0.8-cM interval. In
addition, we conducted genetic linkage mapping of 23
markers derived from bin-mapped ESTs to further saturate
the Tsn1 locus. In this study, we explored the level of
macrocolinearity between a 25.5-cM segment containing
Tsn1 and rice using EST and cDNA marker sequences. We
also evaluated the level of microcolinearity of the Tsn1
region and rice using putative gene sequences derived from
more than 750 kb of BAC sequence at the Tsn1 locus.
Materials and methods
Genetic mapping of expressed sequence markers
A mapping population consisting of 117 recombinant sub-
stitution lines (RSLs) derived from a cross between Chinese
Spring (CS) and a CSTriticum dicoccoides chromosome
5B disomic substitution line (CS-DIC 5B, i.e., the CS 5B
chromosome pair was replaced by a pair of T. dicoccoides
5B chromosomes) was used for mapping expressed se-
quence markers (ESMs), which consisted of EST-derived
markers and cDNA-derived RFLP markers, within and near
the 5BL 0.750.76 interval (Faris et al. 2000; Lu et al.,
unpublished data). The genetic mapping of cDNA-derived
RFLP markers XksuQ63, XksuQ11, Xcdo400, Xbcd183,
Xbcd1030, Xksu919(Lpx), Xrz575, and Xcdo465 was
described in Faris et al. (2000). Marker Xfcc1 was described
in Haen et al. (2004), and details of EST-derived markers
and mapping are described elsewhere (Lu et al., unpub-
lished data). The resulting map, which spanned 25.5 cM
encompassing the Tsn1 locus, was used to determine
macrocolinearity with rice.
Chromosome walking and high-resolution mapping
The Langdon (LDN) BAC library (Cenci et al. 2003) was
used to assemble BAC contigs of 205 and 548 kb (hereafter
referred to as ctg205 and ctg548, respectively) flanking the
Tsn1 gene. The library was initially screened with probes
FCG17 (CC249715) and FCG9 (CC249705), which are
AFLP-derived RFLP probes that flank Tsn1 (Haen et al.
2004). Positive clones identified with FCG17 were used to
assemble ctg205 on the proximal side of Tsn1, which con-
sists of the overlapping BACs 378P21 (AY914085) and
1154L7 (AY914086) (Lu et al., unpublished data). BACs
404J6 (DQ157837) and 533E21 (DQ157838) were iden-
tified with probe FCG9 and used to assemble a 228-kb
contig on the distal side of Tsn1. Three subsequent chro-
mosome walking steps, which included BACs 50F12 (DQ
157835), 239E17 (DQ157836), and 1089I4 (DQ157839),
were performed from the proximal end of the 228-kb contig
resulting in ctg548. For each walking step, BACs were
completely sequenced (Lu and Faris, unpublished data),
and putative genic and low-copy regions were exploited for
the development of RFLP markers as described in Faris et
al. (2003). The markers were anchored to the genetic map
by conducting high-resolution mapping in a durum wheat
(Triticum turgidum L., 2n=4x=28, AABB genomes) pop-
ulation derived from crossing LDN with LDN-DIC 5B as
described (Haen et al. 2004). This mapping population
consists of 2,719 Ptr ToxA-insensitive F
2
plants, which are
all homozygous for the tsn1 allele.
91
BLAST similarity searches
Sequences of ESMs were tested for similarity to sequences
in The Institute for Genomic Research (TIGR) wheat gene
index database TaGI release 10.0 (http://tigrblast.tigr.org/
tgi/) using BLASTn (Altschul et al. 1997) to identify the
tentative consensus (TC) sequences containing the corre-
sponding ESM. TC sequences were downloaded and sub-
jected to BLASTx searches against the NCBI nonredundant
(nr) database. A significant match was declared on the basis
of a minimum 40% amino acid identity for at least half the
length of the TC or EST sequence and an e value of less than
e11. The hit with the lowest e value was considered the
corresponding protein of the ESM when several matches
were found.
TC sequences (or ESM sequences when no TC was
available) were subjected to BLASTn and tBLASTx
searches against rice genomic sequences using Gramene
(Ware et al. 2002), genomic sequences of genes in TIGR
rice pseudomolecules (http://tigrblast.tigr . org/euk-blast/index.
cgi?project=osa1), and all rice BAC and PAC sequences in
GenBank(http://tigrblast.tigr.org/euk-blast/index.cgi?pro
ject=osa1). For BLASTn searches, the threshold limits for
significant matches were at least 65% nucleotide identity
for at least half of the ESM or TC sequence, but not less
than 150 bases, and an e value of less than e20. For
tBLASTx searches, significance was declared when there
was at least 40% amino acid identity over at least half of the
ESM or TC sequence, but no less than 200 amino acids, and
an e value of less than e11. When there were several
Rice chromosomeCS x CS-DIC 5B genetic map
5BL-14 breakpoint
5BL-9 breakpoint
0.4
1.8
0.4
0.4
0.0
0.0
1.7
0.0
2.6
1.7
0.0
0.0
0.0
3.1
2.1
0.0
0.0
0.0
5.2
0.9
0.0
2.6
2.6
0.0
XBE590499
XBE488792
XBE426161
XBE403217
XksuQ63
XBF484437
XBE442978
XBM140357
XksuQ11
XBE500658
XBE591798
XBE445619
Xcdo400
Xbcd183
XBE403702
XBE423505
XBF483506
XBM138151.1
XBE425878
Xfcc1
XBE443610
XBM138151.2
Xksu919(Lpx)
Xrz575
XBG608197
Xcdo465
Tsn1
XBF483510
XBE403968
XBE203136
cM Marker
0.81 5BS-6
0.71 5BS-5
0.56 5BS-8
0.13 5BS-7
0.29 5BL-6
0.59 5BL-11
0.75 5BL-14
0.76 5BL-9
0.79 5BL-16
5B Physical Map
(11.194)
(31.064)
(20.944)
(25.396)
(14.011)
(23.009)
(14.533)
(26.404)
(30.271)
(11.294)
(22.174)
(21.578)
(30.809)
(21.099)
(30.374)
3
3
3
3
3
9
9
9
9
12
7
2
2
2
5/11
4
(23.401/27.245)
Mb
Fig. 1 Macrocolinearity between a 25.5-cM region of wheat chro-
mosome 5BL encompassing Tsn1 and the rice genome based on
expressed sequence markers (ESMs). A physical map of wheat
chromosome 5B is shown to the left for reference to the genomic
region evaluated in this study, and the 5BL-14 and 5BL-9 deletion
breakpoints, which define the 5BL 0.750.76 deletion interval, are
indicated on the wheat genetic map (center). Rice chromosomes
harboring sequences with significant similarity to ESMs are in-
dicated to the right, and the megabase positions of the significant
hits along rice chromosomes are indicated in parentheses
92
significant matches for a single wheat ESM or TC
sequence, only the best match is reported.
BAC sequence analysis and gene prediction
Sequences of BACs were first subjected to BLASTn
searches of the Triticeae Repeat Sequence (TREP) database
(http://wheat.pw.usda.gov/ITMI/Repeats/index.shtml), the
TIGR Triticum Repeat Database (http://www.tigr.org/tdb/
e2k1/plant.repeats/), and BLASTn and BLASTx searches
of the NCBI nr database (http://www.ncbi.nlm.nih.gov/
BLAST/) to identify repetitive elements. Sequences not
containing putative repetitive elements were then subjected
to BLASTn and BLASTx searches of the NCBI nr database
and BLASTn searches of the wheat EST database (http://
wheat.pw.usda.gov/wEST/blast/), the NCBI EST database
(http://www.ncbi.nlm.nih.gov/dbEST/), and the TIGR wheat
gene indices database (http://tigrblast.tigr.org/tgi/) to identify
putative genes. Whole BAC sequences were also submitted
to the Rice Genome Automated Annotation System (Rice
GAAS; http://ricegaas.dna.affrc.go.jp/), where sequences
were analyzed with the various integrated coding pre dic-
tion programs (Autopredgenset, GENSCAN, RiceHMM,
FGENESH, MZEF), homology search analysis programs
(BLAST, HMMER, ProfileScan, MOTIF), and repetitive
DNA analysis programs (RepeatMasker, Printrepeats, Auto
PredLTR, BLASTn against NCBI LTRdb, and BLASTx
against NCBI transposon subjects). To validate or correct
the predicted structure of putative genes, cDNA sequences
with greater than 80% identity were aligned with the ge-
nomic sequence of the corresponding predicted gene, and
the predicted coding sequence was then manually edited if
necessary. Predicted coding sequences of putative genes
were analyzed against rice genomic sequences using the
same criteria as for ESM and TC sequences described
above.
Results
Analysis of ESMs and corresponding sequences
The map for the genomic region under investigation
consisted of 29 ESM loci including seven previously
anonymous cDNA-RFLP markers (Faris et al. 2000), one
cDNA-derived AFLP marker (Xfcc1) (Haen et al. 2004),
and 21 EST-derived markers (Lu et al., unpublished data).
One EST-derived marker (XBM138151) detected two loci
(Fig. 1). Faris et al. (2000) showed that Tsn1 mapped near
the distal end of the 5BL 0.750.76 deletion interval.
Therefore, the 25.5-cM genomic region considered for this
study includes all of the 5BL 0.750.76 deletion interval,
which accounts for approximately 16.8 cM, and extends
about 8.7 cM into the 5BL 0.760.79 deletion interval
(Fig. 1).
The wheat NSF-EST project conducted BLASTn/x
searches of the public databases for all mapped ESTs, and
the best hits were reported at http://wheat.pw.usda.gov/cgi-
bin/westsql/map_locus.cgi. For this study, ESM sequences
were subjected to BLASTn searches of the TIGR gene
indices, which led to the identification of longer TC se-
quences for all but six of the ESM sequences. BLASTx
searches of the NCBI nr database revealed that 17 (61%) of
the 28 sequences had significant similarities to known or
putative proteins, whereas the remaining 11 sequences had
no similarities to any sequences in the databases (Table 1).
Macrocolinearity of the genomic region
containing Tsn1 and rice
To evaluate the level of colinearity between the 25.5 cM of
wheat chromosome 5B spanning Tsn1 and the rice genome,
we subjected ESM sequences to BLAST searches of the
rice genomic sequences. Of the 28 ESM sequences, 12
(43%) had no significant similarity to rice genomic se-
quences (Table 1). Eleven of these 12 were the same ESM
sequences that had no similarities to known or putative
proteins. XBE590499 was the only ESM that detected no
similar sequences in the rice genome but was assigned a
putative protein function (Gamma-2 purothionin). The re-
maining 16 (57%) ESMs had significant similarity to se-
quences on seven different rice chromosomes (Table 1;
Fig. 1).
With the exception of Xfcc1, ESMs in the distal 13.9 cM
of the genetic map that detected similar rice sequences had
colinearity with rice chromosome 3 (Fig. 1). Five ESMs
within this region of wheat chromosome 5B were colinear
with a 7.8-Mb segment of the distal region of the long arm
of rice chromosome 3, but colinearity was interrupted by
Xfcc1 (rice chromosome 9). Five ESMs within the same
region had no similarity to any rice genomic sequences.
The proximal 11.6 cM of the wheat genetic map was a
mosaic of similarities to different rice chromosomes (Fig. 1).
However, three ESMs within this segment of the map had
significant similarity to sequences on rice chromosome 9.
These three ESMs together with Xfcc1 were colinear with a
1.2-Mb segment of rice chromosome 9.
Three ESM sequences had similarity to rice chromo-
some 2 sequences, but whereas two of them (XBE403968
and XBE500658) detected rice sequences near each other,
the third (Xcdo400) detected a rice sequence approxi-
mately 14 Mb away. Of the four remaining ESMs, three
detected sequences on different rice chromosomes (4, 7,
and 12), and the fourth (XBF484437) detected sequences
on rice chromosomes 5 and 11 with nearly equal signif-
icance (Table 1; Fig. 1).
At the macrolevel, the Tsn1 locus was flanked by one
ESM on the proximal side and two ESMs on the distal side
that detected no similar sequences in rice. Adjacent to these
ESMs, but farther from Tsn1, were ESMs with similarity to
different rice chromosomes (Fig. 1). Therefore, colinearity
between the wheat Tsn1 region and rice was not conserved
at the macrolevel.
93
Table 1 Predicted proteins of expressed sequence markers based on BLASTx searches of the NCBI database and the chromosome
assignments of corresponding rice orthologues based on the best BLASTn and tBLASTx hits to rice genomic sequences
GenBank Marker TC NCBI BLASTx Rice BLASTn Rice tBLASTx
Putative protein e
value
Rice
chromosome
a
e
value
Rice
chromosome
e
value
BF483510 XBF483510 TC262304 NS NS NS
BE403968 XBE403968 TC273054 NS NS NS
BF203136 XBF203136 TC248883 Unknown protein
[Oryza sativa]
0 2 0 2 6.9e232
BE590499 XBE590499 TC234385 Gamma-2 purothionin 1.0e22 NS NS
BE488792 XBE488792 TC236559 Putative phospholipase
D beta 1 [O. sativa]
0 9 0 9 2.3e292
BE426161 XBE426161 TC271278 Putative anthranilate
n-benzoyltransferase
[O. sativa]
2.0e60 9 9.3e70 9 5.0e56
BE403217 XBE403217 TC247678 Multidrug resistance
protein (MRP)-like ABC
transporter [O. sativa]
4.0e144 4 0 4 2.4e233
DT319102 XksuQ63 N/A NS NS NS
BF484437 XBF484437 N/A Hypothetical protein
[O. sativa]
2.0e73 5 2.3e92 11 3.6e69
BE442978 XBE442978 TC240270 NS NS NS
BM140357 XBM140357 TC262273 NS NS NS
DT319103 XksuQ11 TC271737 NS NS NS
BE500658 XBE500658 N/A Hypothetical protein
[O. sativa]
3.0e12 2 1.4e31 2 7.3e15
BE591798 XBE591798
TC255372 Putative nicotinate-
nucleotide
pyrophosphorylase
[Arabidopsis thaliana]
9.0e126 9 1.5e119 9 5.8e81
BE445619 XBE445619 TC242559 NS NS
BE439060 Xcdo400 TC251537 Putative ubiquitin-conjugating
enzyme E2 [O. sativa]
9.0e80 2 3.7e73 2 3.5e53
BE438984 Xbcd183 TC268348 Putative transducin WD-40
repeat protein [O. sativa]
2.0e78 7 4.1e93 7 3.7e63
BE403702 XBE403702 TC263127 PHD finger [O. sativa] 1.0e60 12 1.7e160 12 5.3e110
BE423505 XBE423505 TC250760 Serpin [Triticum aestivum] 0 3 5.0e171 3 6.1e219
BF483506 XBF483506 N/A NS NS NS
BM138151 XBM138151.1,
XBM138151.2
N/A NS NS NS
BE425878 XBE425878 TC254873 NS NS NS
CC249712 Xfcc1 TC235883 Ps16 protein 4.0e96 9 9.8e93 9 4.9e68
BE443610 XBE443610 TC238236 Mannosyltransferase (PIG-M)
domain containing
protein [O. sativa]
2.0e91 3 4.6e113 3 1.8e94
BE604920 Xksu919(Lpx) TC248004 Lipoxygenase 2
[Hordeum vulgare]
8.0e89 3 2.4e97 3 2.8e67
AI978442 Xrz575 TC275921 Putative somatic embryogenesis
related protein [O. sativa]
0 3 0 3 3.9e287
BG608197 XBG608197 N/A NS NS NS
BE439135 Xcdo465 TC250804 Cell wall beta-glucosidase
[Secale cereale]
2.0e95 3 2.4e119 3 1.6e74
TC Tentative consensus, NS No significant similarity, N/A TC sequences not available
a
Positions of hits to rice chromosomes are presented in Fig. 1
94
Wheat
genetic map
Tsn1
0.30
0.09
0.16
0.21
0.28
Xfcg9
Xfc
g
23
Xfcg25.2
Xfcg25.1
Xfcg26
Xfcg17
Xfcg22
Xfcg21
22.000
22.050
22.025
21.975
Cysteine proteinase
Unknown protein
Hypothetical protein
Potassium transporter
DHHC zinc finger
Protein kinase
Hypothetical protein
Hypothetical protein
Hypothetical protein
Hypothetical protein
Protein kinase
Protein kinase
Protein kinase
Mb Annotation
Rice
chromosome 9
cM
Marker
300
500
100
200
200
Tsn1
100
9
9
2
11
9
9
12
HG8
BM138151
FRIR
CS
HG9
HG10
DZF4
RNP
HG4
PT
DZF1
HG5
HG6
HG7
DZF2
DZF3
SSP1,2
HG1
SRK1
HG2
WAK1
SRK2
WAK3
HG3
400
WAK2
Wheat
physical map
Rice
chromosome
4
7
4
7
(9.714)
(0.232/4.593)
(0.232/4.593)
(24.605)
(19.352)
centromeretelomere
(31.003)
6
Fig. 2 Microcolinearity of putative genes within wheat BAC
contigs flanking the Tsn1 locus and the rice genome. BAC contigs
(center) are indicated in black, and a kilobase scale is indicated to
the left of each contig. Gene designations are indicated to the left of
the BAC contigs and shown in their relative positions along the
contigs as yellow boxes. The LDN × LDN-DIC 5B genetic map is
shown to the left in gray. Positions of markers used to anchor the
contigs to the genetic map are shown along the contigs as red lines.
Markers used to initiate the chromosome walk are shown in red
along the genetic map. Rice chromosomes harboring sequences with
significant similarity to genes in the BAC contigs are shown to the
right of the BAC contigs , and the megabase positions of the sig-
nificant hits along rice chromosomes (except for rice chromosome
9) are indicated in parentheses. The 75-kb region of rice chromo-
some 9 having colinearity with the Tsn1 region of wheat is shown to
the right, and the relative positions of genes are indicated as yellow
boxes. The transcriptional orientations of genes within the wheat
BAC contigs and putative orthologues on rice chromosome 9 are
indicated with purple arrows
95
Anchoring the BAC contig to the genetic map
We sequenced contigs of 205 and 548 kb flanking Tsn1
(Lu and Faris, unpublished data). The correct orientation
of ctg205 is unknown at this time due to the fact that no
recombinants were identified among 5,438 gametes using
markers Xfcg21 and Xfcg22 near the nonoverlapping ends
of the contig (Lu et al., unpublished data) (Fig. 2).
Probe FCG23 was developed from a low-copy region
near the distal end of ctg548 and mapped 0.28 cM distal to
Xfcg9 (Fig. 2), allowing the orientation of the initial 228 kb
contig to be determined (Lu et al., unpublished data). Sub-
sequent chromosome walking steps led to the development
of probes FCG25 and FCG26. Probe FCG25, which is a
fragment of a DHHC-type zinc finger (DZF)-like gene
(see below), detected two loci (Xfcg25.1 and Xfcg25.2)
that mapped 0.09 and 0.25 cM from Tsn1, respectively
(Fig. 2). The probe FCG26, which is a fragment of a po-
tassium transporter (PT)-like gene, cosegregated with Tsn1
in the 5,438 gametes. Therefore, ctg548 spans 0.74 cM, and
the genetic distance between ctg205 and ctg548 is 0.3 cM
(Fig. 2). The cumulative physical to genetic distance ratio
across both contigs is approximately 1 Mb/cM.
Gene prediction
A total of 26 putative genes were identified in the BAC
contigs for an average gene density of one gene/29 kb. Ten
putative genes were identified on ctg205 (Table 2; Fig. 2).
Two predicted open reading frames (ORFs) in the same
orientation at positions 3362,312 and 4,3897,025 of
ctg205 had significant similarity to subtilisin-like serine
protease (SSP) genes from rice as indicated by BLASTx
alignments. SSP1 and SSP2 were 1,977 and 2,637 bp in
size, respectively, and were separated by a 2,077-bp down-
stream region of SSP1, which was also duplicated down-
stream of SSP2. The two genes shared 96% identity at the
nucleotide level including the duplicated downstream re-
gions, and SSP2 had seven exons whereas SSP1 had six, but
the 660-bp difference in size between the two genes was
primarily due to a much larger intron 3 in SSP2. A wheat
cDNA (CB307830) was identified that had 96% similarity
to both SSP1 (less than e155) and SSP2 (less than e0),
indicating that at least one of the genes is functional. The
cDNA sequence was used to validate and/or correct pre-
dicted exons 35ofSSP1 and exons 46ofSSP2. SSP1 and
SSP2 coded for proteins of 441 and 458 amino acids,
respectively, and they shared 76% identity at that level.
Two putative S-receptor kinase (SRK)-like genes in the
same orientation were identified on ctg205 (Table 2; Fig.
2).
SRK1 occupied 2,556 bp at position 25,04727,602. This
gene was predicted as having two exons, but alignments
with multiple wheat cDNAs with significant similarity
(less than e88) and a rice homolog from chromosome 7
with greater than 80% identity at the nucleotide level
indicated that SRK1 contained a single exon. The pre-
dicted protein of SRK1 contained 851 amino acids. The
SRK2 gene was 1,077 bp long and resided at position
107,511108,587 of ctg205. The gene had one predicted
exon, and it had significant similarity (e47) with a wheat
cDNA clone (CD877820). However, the degree of sim-
ilarity and alignment length between SRK2 and homolo-
gous cDNA sequences from wheat and other species were
not sufficient to validate the structure of SRK2 . SRK1 and
SRK2 were nearly 80 kb apart, and the predicted ORFs
differed in size by 1,479 bp, but when we aligned the full-
length SRK1 gene to the predicted ORF of SRK2 plus the
1,479 bp 3 flanking region, we found that the two se-
quences were 94% identical, and the predicted ORF of
SRK2 was truncated due to an apparent mutation at position
1,075 of the gene that induced a premature stop codon.
Therefore, SRK1 and SRK2 are likely paralogs, but it is
unkown whether SRK2 is functional in its putatively trun-
cated form. Comparisons of the 5 upstream regions of the
two genes indicated a conserved duplication residing at
position 316 to 744 from the start codon of SRK1 and at
position 475 to 888 from the start codon of SRK2, which
retained 93% identity. Residing at position 1,390 to
1,478 from the start codon of SRK2 was a microsatellite
consisting of GT(44) repeats, which was used to develop
the marker Xfcp1 for marker-assisted selection as pre-
viously described (Lu et al., unpublished data). No sim-
ilarity was retained in the 3 flanking regions of the two
genes. The predicted protein of SRK2 was 385 amino acids
long, and it shared 65% identity with SRK1 at the amino
acid level.
Three wall-associated kinase (WAK)-like genes were
identified on ctg205 based on BLASTx alignments and
gene prediction (Table 2; Fig. 2). WAK1, WAK2, and WAK3
were all in the same orientation and resided at positions
71,40773,322, 99,317100,291, and 113,862114,725,
respectively. The predicted ORF of WAK1 contained four
exons and was 1,916 bp in size. WAK2, which contains a
fragment of the AFLP-derived RFLP marker Xfcg17 (Haen
et al. 2004) used for initial screening of the BAC library,
contained a single exon and was 975 bp. The predicted
ORF of WAK3 included 864 bp of sequence with sig-
nificant similarity to other WAK genes as indicated by
BLASTx alignments, but the 3 end of the predicted ORF
extended 2,691 bp into the adjacent non-LTR LINE ele-
ment Karin. Therefore, WAK3 may be a pseudogene that
resulted from interruption by the retrotransposon. The ge-
nomic sequences of the three WAK genes had significant
similarities to multiple cDNAs from various grass species,
but alignments did not allow confirmation of structure or
function because significance values were not sufficiently
high to make inferences regarding orthology. The predicted
proteins for WAK1 and WAK2 were 463 and 324 amino
acids, respectively. No significant similarity was among the
three WAK genes at the nucleotide level, but WAK2 and the
pseudogene WAK3 shared 53% amino acid identity.
Three ORFs were predicted in regions of ctg205 that did
not appear to harbor repetitive elements, and were therefore
considered putative genes. The hypothetical genes desig-
nated HG1, HG2, and HG3 resided at positions 10,088
11,067, 47,43748,063, and 125,413126,204 of ctg205
(Table 2; Fig. 2). HG2 and HG3 were in the same orien-
96
Table 2 Characteristics of the 26 predicted genes in Langdon BAC contigs flanking Tsn1
Name Description BAC ctg Strand Position Gene length (bp) No. of exons Protein
size
Best BLASTn hit to
expressed sequences
SSP1 Subtilisin-like serine protease 378P21 205 + 3362,312 1,977 6 441 Wheat cDNA clone CB307830 (e155)
SSP2 Subtilisin-like serine protease 378P21 205 + 4,3897,025 2,637 6 458 Wheat cDNA clone CB307830 (e0)
HG1 Hypothetical gene 378P21 205 10,08811,067 980 4 286 No significant hits
SRK1 S-receptor kinase 378P21 205 25,04727,602 2,556 1 851 Wheat cDNA clone DN829563 (e148)
HG2 Hypothetical gene 378P21 205 + 47,43748,063 627 2 192 No significant hits
WAK1 Wall-associated kinase 378P21, 1154L7 205 71,40773,322 1,916 4 463 Wheat cDNA clone CK207836 (e51)
WAK2 Wall-associated kinase 378P21, 1154L7 205 99,317100,291 975 1 324 Rice cDNA clone AK102702 (e26)
SRK2 S-receptor kinase 1154L7 205 107,511108,587 1,077 1 358 Wheat cDNA clone CD877820 (e47)
WAK3
a
Probable pseudogene;
wall-associated kinase
1154L7 205 113,862114,725 NA NA NA Rice cDNA clone AK102702 (e40)
HG3 Hypothetical gene 1154L7 205 + 125,413126,204 792 1 263 Wheat cDNA clone CD373786 (e103)
RNP U2 snRNP auxiliary factor 1089I4 548 8,02312,273 4,251 11 539 Durum wheat cDNA BE428242 (e0.0)
HG4 Hypothetical gene 1089I4 548 15,13816,997 1,860 5 259 Wheat cDNA clone CD900543 (e95)
PT Potassium transporter 1089I4 548 17,59222,619 5,028 9 785 Rice cDNA clone AK065464 (e0.0)
DZF1 DHHC-type zinc finger 1089I4 548 + 33,96738,702 4,736 6 547 Wheat cDNA clone CK199365 (e0.0)
HG5 Hypothetical gene 1089I4 548 40,54241,716 1,175 3 337 Wheat cDNA clone CK205827 (e121)
HG6 Hypothetical gene 1089I4 548 56,01256,722 711 2 236 Wheat cDNA clone BJ309122 (e0.0)
DZF2 DHHC-type zinc finger 1089I4 548 69,16269,830 669 1 222 Wheat cDNA clone CK199365 (e0.0)
DZF3
a
Probable pseudogene;
DHHC-type zinc finger
1089I4 548 + 83,04283,973 NA NA NA Rice cDNA clone AU093319 (e105)
HG7 Hypothetical gene 1089I4 548 + 96,32398,134 1,812 6 132 Wheat cDNA clone BE401011 (e154)
DZF4 DHHC-type zinc finger 239E17 548 + 145,662149,420 3,759 7 471 Wheat cDNA clone BE500355 (e116)
HG8 Hypothetical gene 50F12, 533E21 548 311,888312,388 501 1 166 No significant hits
BM138151
b
Possible pseudogene:
unknown gene
50F12, 533E21 548 + 320,025320,777 NA NA NA Wheat cDNA clone BM138151 (e0)
FRIR
a
Probable pseudogene; far-red
impaired response protein
50F12, 533E21 548 328,073328,672 NA NA NA No significant hits
CS
a
Probable pseudogene;
callose synthase
533E21 548 + 369,803371,154 NA NA NA Wheat cDNA clone BJ220565 (e0.0)
HG9 Hypothetical gene 533E21 548 + 377,414382,379 4,966 7 432 Barley cDNA clone BJ467479 (e102)
HG10 Hypothetical gene 533E21, 404J6 548 401,442404,054 2,613 6 553 Barley cDNA clone AV834198 (e102)
a
Probable pseudogenes identified based on significant BLASTx alignments to known proteins. No corresponding ORFs were predicted
b
Possible pseuodogene identified based on alignment of expressed sequence tag (EST) sequence with BAC sequence, but no corresponding ORF was predicted. BM138151 detects a second
locus on chromosome 5BL (Fig. 1), which may represent the functional copy
97
tation (plus strand), whereas HG1 was the opposite. HG1
had four predicted exons and was 980 bp, with a predicted
protein consisting of 286 amino acids. BLASTx and
BLASTn alignments indicated that HG1 shared 37 (e17)
and 81% (e 36) identity at the amino acid and nucleotide
levels, respectively, with a putative protein (AY491681)
detected at the hardness locus in Triticum monococcum
(Chantret et al. 2004). However, no homologous cDNA
sequences from wheat or other grasses were identified,
suggesting that either HG1 is not a functional gene or it
is expressed at very low levels. HG2 had two predicted
exons and was 627 bp long, with a predicted protein se-
quence consisting of 192 amino acids. No significant
BLASTn or BLASTx hits were obtained, suggesting that
either it is a novel gene that is transcribed at low levels or it
is not a real gene. HG3 had a single predicted exon, was
792 bp, and had 92% identity (e103) with a wheat cDNA
(CD373786), indicating that it is a functional gene. It is
interesting to note that all three HG genes on ctg205 had GC
contents greater than 65%.
Seventeen putative genes were identified on ctg548.
Analysis of BLASTx alignments indicated a gene with
significant similarity (less than e100) to U2 snRNP aux-
iliary factor (RNP)-like genes. The RNP resided at position
8,02312,273 and was 4,251 bp long (Table 2; Fig. 2). A
barley cDNA consensus sequence (TC135043) with 91%
identity (e185) and a durum wheat cDNA (BE428242)
with 100% identity (e173) were used to correct the pre-
dicted intron/exon structure of the gene and validate that it
was functional. RNP contained 11 exons and coded for a
protein consisting of 539 amino acids. The gene contained
two RNA recognition motifs (RRMs), which are diagnostic
for RNA-binding proteins.
About 5.3 kb from the RNP gene at position 17,592
22,619 was gene with significant similarity to PT-like genes
(Table 2; Fig. 2). The PT gene was 5,208 bp long and had
nine exons based on the predicted structure. The structure
and function of the PT gene was validated by alignments
with a full-length rice cDNA (AK065464) with 85% iden-
tity (e0) and a durum wheat cDNA (CK206884) with 96%
identity (e0), respectively. The PT gene coded for a pro-
tein consisting of 785 amino acids.
Four putative DZF-like genes within 115 kb were iden-
tified on ctg548 (Table 2; Fig. 2) based on BLASTx align-
ments with e values ranging from e73 to e
168. The
predicted ORF of DZF1 occupied position 33,96738,702
and was 4,736 bp long with six exons. Attempts to identify
corresponding full-length cDNAs were not successful, but
the existence of the last exon was validated by alignment
to the wheat cDNA CK199365 (99% identity; e0). The
predicted protein of DZF1 contained 547 amino acids.
Approximately 30.5 kb downstream from DZF1 and in
the opposite orientation was DZF2. The predicted ORF of
DZF2 contained a single exon of 669 bp and resided at
position 69,16269,830. This gene shared 95% identity
(e0) with wheat cDNA CK199365 across 428 bp.
Therefore, the wheat cDNA CK199365 most likely cor-
responds to the DZF1 gene because they shared a higher
percent identity over a longer segment. This suggests that
DZF1 is functional, but whether or not DZF2 is functional
is inconclusive.
DZF3 was identified based on BLASTx alignments at
position 83,04283,973, but no corresponding ORF was
predicted with any of the gene prediction programs. There-
fore, DZF3 may be a pseudogene lacking a start codon.
DZF4 occupied position 145,662149,420 of ctg548
and had a predicted ORF consisting of seven exons. This
gene was 3,759 bp long and in the same orientation as
DZF1. Alignments with the wheat cDNA BE500355,
which had 92% identity (e0), allowed the determination
of function and the validation of exons 46, but cDNAs
corresponding to the rest of the gene could not be iden-
tified. The predicted protein of DZF4 consisted of 471
amino acids.
Comparisons of the genomics sequences of the DZF
genes with each other indicated that DZF1, DZF2, and the
pseudogene DZF3 were more similar to each other, where-
as DZF4 was slightly more divergent. Similarity among the
genes was mainly confined to the 3 ends of DZF1 and
DZF4, which corresponded to the entire DZF3 genomic
region and DZF2 ORF. Within this segment, DZF1, DZF2,
and DZF3 had identities ranging from 93 to 96% with
each other, whereas comparisons of these three genes
with DZF4 yielded identities ranging from 86 to 91%.
Comparisons at the amino acid level yielded similar results.
DZF1, DZF2, and DZF3 shared greater than 77% identity
with each other, but
DZF4 had less than 57% identity with
the other three. No similarity was identified in 1.5 kb of the
5 regions of the four DZF genes, but conservation was
observed in the 3 regions where similarities ranged from
38% between the 3 regions of DZF2 and DZF3 to 65%
between the same regions of DZF1 and DZF3.
Seven hypothetical genes were predicted on ctg548 and
designated HG4HG10. HG4 was predicted as a 1,860-bp
gene containing five exons at position 15,13816,997.
Several wheat cDNAs with 89% identity (e95) to HG4
were identified, but aligned within the second predicted
intron. Therefore, the predicted structure of HG4 could not
be validated, and its functionality, including production of
its 259-amino-acid protein, is questionable.
HG5 was predicted as being 1,175 bp long with three
exons at position 40,54241,716. A wheat cDNA clone
(CK205827) with 87% identity was identified, but align-
ment with HG5 extended from the first predicted exon
through the first intron and into the second exon. Due to the
relatively low identity and poor structural alignment, this
cDNA may not represent the corresponding coding se-
quence for HG5, and whether HG5 is a real functional gene
is left to be determined.
About 14 kb downstream from HG5 at position 56,012
56,722 was HG6, which was 711 bp long. A wheat cDNA
clone (BJ309122) was 91% identical (e0) and used to
validate the structure of the ORF as having a single exon.
The high degree of similarity between HG6 and the wheat
cDNA suggests that the gene, which encodes a protein
consisting of 237 amino acids, is functional.
HG7 was located at position 96,32398,134 and pre-
dicted as being 1,812 bp long with six exons. The wheat
98
cDNA clone BE401011 was 98% identical (e154) to
HG7, suggesting the gene is functional. However, only the
fourth and fifth exons could be confirmed by alignments to
the cDNA. HG7 was predicted to encode a protein con-
sisting of 132 amino acids.
A 501-bp gene (HG8) was predicted as having a single
exon and coding for a 166-amino-acid protein at position
311,888312,388. No cDNA clones with significant sim-
ilarity to HG8 were identified, indicating that either it is not
a real gene or it is expressed at very low levels.
HG9 was predicted as a 4,966-bp gene with seven exons
at position 377,414382,379. BLASTn alignments with
sequences in the EST databases indicated a large number of
barley and wheat ESTs with significant similarity (greater
than 90% identity) to HG9, but alignments were confined
to the last predicted exon, which was 256 bp in size. This
suggests that at least the last exon probably represents a
repetitive sequence, and because the gene had no signif-
icant similarity to known repetitive elements, it may rep-
resent an unclassified element.
Occupying position 401,442404,054 on ctg548 was
HG10. The predicted ORF of this gene was 2,613 bp
long; it had six exons and a predicted protein consisting
of 553 amino acids. A barley cDNA (AV834198) shared
90% identity (e102) over a 305-bp segment, which cor-
responded to the last exon. This suggests that HG10 may
be functional, but we were unable to validate its structure.
The wheat cDNA clone BM138151 was mapped as an
RFLP marker in the CS × CS-DIC 5B population (Lu et al.,
unpublished data) and used to assess macrocolinearity as
described above. BLASTn alignments of BM138151 against
ctg548 revealed its position on the contig at 320,025
320,777. No corresponding ORF for BM138151 was pre-
dicted, but it should be noted that in addition to this locus,
BM138151 detects a second more distal locus on 5BL
(Fig. 1). Therefore, it is possible that the more distal locus
harbors a functional gene, while the locus detected in
ctg548 is actually a pseudogene.
In addition to WAK3, DZF3, and BM138151, we iden-
tified two additional putative pseudogenes. Genes with
similarity to a far-red impaired response (FRIR)-like pro-
tein (e21) (position 328,073328,672) and a callose
sythase (CS)-like protein (e26) (369,803371,154) were
identified based on BLASTx alignments, but no corre-
sponding ORFs were predicted. No cDNAs correspond-
ing to the FRIR gene were identified, but a wheat cDNA
(BJ220565) with 93% identity (e0) to the CS gene was
identified, suggesting that this gene is functional at least in
some genotypes. However, it is interesting to note that the
Table 3 Predicted genes within
wheat BAC contigs and their
best BLASTn and tBLASTx hits
to rice genomic sequences
NS No significant similarity
a
Positions of hits to rice
chromosomes are presented
in Fig. 2
Predicted gene Rice BLASTn Rice tBLASTx
Rice chromosome
a
e value Rice chromosome e value
SSP1 2 5.6e125 2 8.8e65
SSP2 2 8.3e120 2 9.0e120
HG1 NS
b
NS
SRK1 4 3.0e191 4 4.7e251
7 2.6e251 7 3.7e253
HG2 NS NS
WAK1 9 4.4e85 9 1.1e112
WAK2 NS NS
SRK2 4 4.5e91 4 6.9e82
7 1.7e95 7 3.8e81
WAK3 NS 9 8.2e20
HG3 NS NS
RNP 11 4.0e179 11 1.8e138
HG4 NS NS
PT 9 4.2e261 9 5.0e298
DZF1 9 2.0e195 9 9.1e192
HG5 NS NS
HG6 NS NS
DZF2 9 2.0e74 9 2.3e72
DZF3 9 3.5e140 9 5.7e94
HG7 NS NS
DZF4 9 1.5e68 9 1.4e93
HG8 NS NS
BM138151 (EST) NS NS
FRIR NS 12 4.4e27
CS NS 6 5.4e28
HG9 NS NS
HG10 NS NS
99
CS gene in LDN is flanked by gagpol polyprotein-like
sequences, which may have disrupted its function.
Microcolinearity of the Tsn1 locus with rice
BLAST analysis of the 26 putative genes within the BAC
contigs showed that 14 (54%) had homologous sequences
distributed among six different rice chromosomes. These
14 gene sequences detected only nine unique loci in rice
because the two wheat SSP genes had significant similarity
to a single SSP gene on rice chromosome 2, the four DZF
genes had similarity to a single DZF gene on rice chro-
mosome 9, and the two SRK genes detected single loci on
rice chromosomes 4 and 7 with nearly equal significance
(Table 3; Fig. 2).
Four of the ten putative genes on ctg205 had no sim-
ilarity to any sequences in the rice genome, whereas the
remaining six detected orthologous loci on at least three
different rice chromosomes (Fig. 2). Only five of the 13
unique gene sequences (DZF2, DZF3, and DZF4 excluded)
on ctg548 detected orthologous loci in rice, and they were
distributed on four different rice chromosomes.
The WAK1, WAK3, PT,andDZF genes all had significant
similarity to sequences spanning a segment of about 65 kb
on rice chromosome 9 (Fig. 2). These four genes appear to
be colinear between wheat and rice; however, as mentioned
above, the correct orientation of ctg205 containing WAK1
and WAK3 is unknown. If the correct orientation is as shown
in Fig. 2,thenWAK1 and WAK3 are colinear, but the genes
are in opposite transcriptional orientation compared to the
rice orthologues. If the correct orientation is for the contig to
be inverted, then WAK1 and WAK3 wouldbeinthesame
transcriptional orientation as the rice orthologues, but their
order would be inverted and they would not be colinear.
WAK1 and WAK3 are separated by about 40 kb in wheat,
whereas the orthologous rice sequences are separated by
about 30 kb. Between the WAK1 and WAK3 orthologues in
rice are four hypothetical proteins and a putative retro-
element (not shown), whereas WAK2 and SRK2 separate the
two genes in wheat. On ctg548, the PT and DZF genes in
wheat are in the same order and orientation as the rice
orthologues. The two genes are separated by about 10 kb in
wheat, which included a DNA transposon (not shown),
whereas they are separated by only 3.5 kb in rice. None of
the rice hypothetical proteins between the WAK1 and WAK3
orthologues on rice chromosome 9 had similarity to any
sequences on the wheat BAC contigs.
In addition to breaks in colinearity between WAK1 and
WAK3, breaks in colinearity with rice chromosome 9 oc-
curred between WAK1 (or WAK3, depending on the correct
orientation of ctg205) and the
PT gene. This included the
RNP gene, which was highly significant in similarity to an
RNP orthologue on rice chromosome 11 (Table 3).
Discussion
Macrocolinearity of a 25-cM segment spanning Tsn1
and the rice genome
Comparative analysis of the wheat and rice genomes has
been conducted by investigating syntenic relationships of
bin-mapped ESTs with rice genomic sequences without
prior knowledge of the genetic order of EST loci within
bins (Sorrells et al. 2003; Conley et al. 2004; Francki et al.
2004; Hossain et al. 2004; La Rota and Sorrells 2004;
Linkiewicz et al. 2004; Munkvold et al. 2004; Peng et al.
2004). However, use of genetically resolved EST loci
within chromosome bins can reveal more regarding the
length and organization of syntenic units between wheat
and rice. In this study, genetic resolution of 29 ESM loci
spanning 25.5 cM and encompassing the Tsn1 gene al-
lowed us to determine colinearity between wheat and rice
at the macrolevel.
The wheat homoeologous group 5 chromosomes have
been shown to be the least conserved of the homoeologous
groups compared to rice chromosomes (Sorrells et al. 2003;
La Rota and Sorrells 2004). Our results agree with this
notion because while detectable levels of macrocolinearity
were observed with rice chromosomes 3 and 9, eight ESMs
detected sequences on other rice chromosomes. Of these,
XBE403968 and XBE500658 detected loci approximately
100 kb apart on rice chromosome 2 and could have been
the result of a single evolutionary event. The other six
ESMs were not adjacent in the rice genome and probably
do not represent single translocation events.
Genes in BAC contigs
Fourteen of the 26 putative genes identified on the BAC
contigs had significant similarity to known proteins, al-
lowing us to infer putative functions. However, whereas
corresponding ORFs were predicted for ten of these, the
gene prediction programs used in this study did not indicate
the presence of ORFs for WAK3, DZF3, FRIR, and CS.
Analysis of flanking sequences provided evidence that
WAK3 and CS were possibly disrupted by transposons, but
no such events were evident for DZF3 and FRIR.Itis
possible that these two genes, especially DZF3 given the
presence of three other DZF genes with potential function,
were superfluous and degenerated over time.
Of the ten predicted genes in the BAC contigs with no
similarity to known proteins, seven were strongly sup-
ported by ESTs and possibly represent real genes. The
other three hypothetical genes with no similarity to ex-
pressed sequences could be degenerated genes from un-
known repetitive elements. Therefore, further analysis of
these putative genes is necessary to determine if they are
biologically valid.
100
Inclusion of all 26 putative genes in gene density cal-
culations results in an average density of one gene/29 kb.
Exclusion of the hypothetical genes and the putative pseu-
dogenes leads to a more conservative estimate of one
gene/68 kb. However, it should also be noted that only six of
the 11 genes used in the latter calculation could be con-
fidently validated.
The discovery of four DZF genes within a 115-kb seg-
ment of ctg548 is a good example of gene duplication in
wheat that did not occur in rice. Higher frequencies of gene
duplication in wheat compared to rice have been reported in
other macrocolinearity (Anderson et al. 1992; Dubcovsky
et al. 1996; Akhunov et al. 2003) and microcolinearity
(Yan et al. 2003; Chantret et al. 2004) studies. It has also
been suggested that a higher frequency of duplications
occur in distal regions of wheat chromosomes (Akhunov
et al. 2003). The duplications of DZF, SSP, and SRK genes
observed in this study support that trend because Tsn1 and
the region under study lie in the distal 25% of chromosome
5BL. It is interesting to note that hybridization experiments
of DZF gene sequences indicated that only one copy resides
on wheat chromosome 5A (data not shown). Therefore,
multiple duplications of this gene likely occurred after the
divergence of a common Triticeae ancestor.
Microcolinearity between the Tsn1 locus and rice
Variable results have been reported in studies of colinearity
between wheat and rice at the microlevel. Most wheatrice
microcolinearity studies have shown better conservation in
proximal regions of wheat chromosomes (Roberts et al.
1999; SanMiguel et al. 2002; Yan et al. 2003; Distelfeld et
al. 2004) compared to distal regions (Feuillet and Keller
1999; Li and Gill 2002; Yan et al. 2004; Guyot et al. 2004).
For example, Yan et al. (2003) found a high level of micro-
colinearity between a proximal region of wheat chromo-
some 5A
m
L and rice chromosome 3 with the exception of
two tandem gene duplications in wheat, and this informa-
tion was successfully used to clone Vrn1. On the other hand,
a region containing Sh2/X1/X2/A1 genes in rice chromo-
some 1 was conserved in sorghum and maize (Bennetzen
and Ramakrishna 2002), but only X2/A1 were detected in
the colinear region of the distal region of homoeologous
group 3 in wheat whereas Sh2/X1 were found in a non-
colinear region of the long arm of homoeologous group 1
(Li and Gill 2002).
An exception to the idea that distal regions are less con-
served than proximal regions was reported by Chantret
et al. (2004). They sequenced a 101-kb BAC from the
distal region of chromosome 5A
m
S, which included three
genes associated with grain texture, and found that all three
were in the same order and orientation as in rice chro-
mosome 12. In addition, the genes were separated by sim-
ilar physical distances.
Our study supports the trend of less conservation in
distal regions. Within the BAC contigs at the Tsn1 locus, 14
genes detected orthologues on six different rice chromo-
somes, and multiple instances of adjacent wheat genes de-
tecting orthologues on different rice chromosomes were
observed. For example, along ctg205, WAK1 and WAK3
were apparently colinear with rice chromosome 9, but this
colinearity was interrupted by SRK2, which is a likely
paralog of SRK1. Also, little physical distance separated PT
from RNP, which detected orthologues on rice chromo-
somes 9 and 11, respectively. Most of these observations
probably represent multiple translocation or transposition
events that led to the shuffling and rearranging of genetic
material in this region. Alternatively, it may reflect multiple
instances of single gene transposition events.
Previous research has indicated that Brachypodium is
more closely related to wheat than rice (Draper et al. 2001).
We screened a Brachypodium sylvaticum BAC library
(Foote et al. 2004) with a highly conserved portion of the
wheat PT gene but found no positive clones (Lu and Faris,
unpublished data). We then screened the library with a
fragment of the DZF gene and identified a single positive
clone. This BAC was hybridized with various fragments of
WAK1, WAK3, RNP, and PT genes, but none of them
hybridized, indicating that, relative to wheat, this region in
Brachypodium may be even less conserved than rice.
The RNP and PT genesonctg548kbcosegregatedwith
Tsn1 in 5,438 gametes, but they do not appear to be can-
didate genes (Lu and Faris, unpublished data). Therefore, it
is likely that we have not yet reached Tsn1 by chromosome
walking, and it most likely lies within the gap between the
contigs. If markers derived from the three putative rice genes
between the WAK3 and PT orthologues on chromosome 9
detect loci within the gap between the contigs, then it may be
concluded that rice genomic information is useful for the
map-based cloning of Tsn1. However, it is important to note
that the colinearity observed with rice chromosome 9 was
undetectable until we had conducted multiple chromosome
walking steps that traversed nearly 400 kb in wheat.
Acknowledgement The project was supported by the National
Research Initiative of the USDA Cooperative State Research, Education
and Extension Service, grant number 2003-35300-13109 to J.D.F.
References
Ahn S, Anderson JA, Sorrells ME, Tanksley SD (1993) Homoeo-
logous relationships of rice, wheat and maize chromosomes.
Mol Gen Genet 241:483490
Akhunov ED, Goodyear AW, Geng S, Qi LL, Echalier B, Gill BS,
Miftahudin, Gustafson JP, Lazo G, Chao S, Anderson OD,
Linkiewicz AM, Dubcovsky J, La Rota M, Sorrells ME, Zhang
D, Nguyen HT, Kalavacharla V, Hossain K, Kianian SF, Peng J,
Lapitan NL, Gonzalez-Hernandez JL, Anderson JA, Choi DW,
Close TJ, Dilbirligi M, Gill KS, Walker-Simmons MK, Steber C,
McGuire PE, Qualset CO, Dvorak J (2003) The organization and
rate of evolution of wheat genomes are correlated with re-
combination rates along chromosome arms. Genome Res 13:
753763
Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W,
Lipman DJ (1997) A new generation of protein database search
programs. Nucleic Acids Res 25:33893402
Anderson JA, Ogihara Y, Sorrells ME, Tanksley SD (1992) Devel-
opment of a chromosomal arm map for wheat based on RFLP
markers. Theor Appl Genet 83:10351043
101
Bennett MD, Leitch IJ (1995) Nuclear DNA amounts in angio-
sperms. Ann Bot 76:113176
Bennetzen JL (2000) Comparative sequence analysis of plant nu-
clear genomes: microcolinearity and its many exceptions. Plant
Cell 12:10211029
Bennetzen JL, Ramakrishna W (2002) Numerous small rearrange-
ments of gene content, order and orientation differentiate grass
genomes. Plant Mol Biol 48:821827
Chantret N, Cenci A, Sabot F, Anderson O, Dubcovsky J (2004)
Sequencing of the Triticum monococcum hardness locus re-
veals good microcolinearity with rice. Mol Genet Genomics
271:377386
Cenci A, Chantret N, Kong X, Gu Y, Anderson OD, Fahima T,
Distelfeld A, Dubcovsky J (2003) Construction and character-
ization of a half million clone BAC library of durum wheat
(Triticum turgidum ssp. durum). Theor Appl Genet 107:931939
Conley EJ, Nduati V, Gonzalez-Hernandez JL, Mesfin A, Trudeau-
Spanjers M, Chao S, Lazo GR, Hummel DD, Anderson OD, Qi
LL, Gill BS, Echalier B, Linkiewicz AM, Dubcovsky J, Akhunov
ED, Dvorak J, Peng JH, Lapitan NL, Pathan MS, Nguyen HT,
Ma XF, Miftahudin, Gustafson JP, Greene RA, Sorrells ME,
Hossain KG, Kalavacharla V, Kianian SF, Sidhu D, Dilbirligi M,
Gill KS, Choi DW, Fenton RD, Close TJ, McGuire PE, Qualset
CO, Anderson JA (2004) A 2600-locus chromosome bin map of
wheat homoeologous group 2 reveals interstitial gene-rich islands
and colinearity with rice. Genetics 168:625637
Delaney D, Nasuda S, Endo TR, Gill BS, Hulbert SH (1995a)
Cytogenetically based physical maps of the group-2 chromo-
somes of wheat. Theor Appl Genet 91:568573
Delaney D, Nasuda S, Endo TR, Gill BS, Hulbert SH (1995b)
Cytogenetically based physical maps of the group-3 chromo-
somes of wheat. Theor Appl Genet 91:780782
Devos KM, Gale MD (2000) Genome relationships: the grass model
in current research. Plant Cell 12:637646
Distelfeld A, Uauy C, Olmos S, Schlatter AR, Dubcovsky J, Fahima
T (2004) Microcolinearity between a 2-cM region encompass-
ing the grain protein content locus Gpc-6B1 on wheat chro-
mosome 6B and a 350-kb region on rice chromosome 2. Funct
Integr Genomics 4:5966
Draper J, Mur LAJ, Jenkins G, Ghosh-Biswas GC, Bablak P,
Hasterok R, Routledge APM (2001) Brachypodium dys-
tachyon: a new model system for functional genomics in
grasses. Plant Physiol 127:15391555
Dubcovsky J, Luo M-C, Zhong G-Y, Bransteiter R, Desai A, Kilian
A, Kleinhofs A, Dvorak J (1996) Genetic map of diploid wheat,
Triticum monococcum L., and its comparison with maps of
Hordeum vulgare L. Genetics 143:983999
Endo TR, Gill BS (1996) The deletion stocks of common wheat.
J Hered 87:295307
Erayman M, Sandhu D, Sidhu D, Dilbirligi M, Baenziger PS, Gill
KS (2004) Demarcating the gene-rich regions of the wheat ge-
nome. Nucleic Acids Res 32:35463565
Faris JD, Anderson JA, Francl LJ, Jordahl JG (1996) Chromosomal
location of a gene conditioning insensitivity in wheat to a necrosis-
inducing culture filtrate from Pyrenophora tritici-r epentis. Phyto-
pathology 86:459463
Faris JD, Haen KM, Gill BS (2000) Saturation mapping of a gene-rich
recombination hot spot region in wheat. Genetics 154:823835
Faris JD, Fellers JP, Brooks SA and Gill BS (2003) A bacterial
artificial chromosome contig spanning the major domestication
locus Q in wheat and identification of a candidate gene.
Genetics 164:311321
Feuillet C, Keller B (1999) High gene density is conserved at
syntenic loci of small and large grass genomes. Proc Natl Acad
Sci U S A 96:82658270
Feuillet C, Keller B (2002) Comparative genomics in the grass
family: molecular characterization of grass genome structure
and evolution. Ann Bot 89:310
Foote T, Griffiths S, Allouis S, Moore G (2004) Construction and
analysis of a BAC library in the grass Brachypodium sylvat-
icum: its use as a tool to bridge the gap between rice and wheat
in elucidating gene content. Funct Integr Genomics 4:2633
Francki M, Carter M, Byan K, Hunter A, Bellgard M, Appels R
(2004) Comparative organization of wheat homoeologous group
3S and 7L using wheatrice synteny and identification of po-
tential markers for genes controlling xanthophylls content in
wheat. Funct Integr Genomics 4:1 18130
Gill KS, Gill BS, Endo TR (1993) A chromosome region-specific
mapping strategy reveals gene-rich telomeric ends in wheat.
Chromosoma 102:374381
Gill KS, Gill BS, Endo T, Boyko EV (1996a) Identification and
high-density mapping of gene-rich regions in chromosome
group 5 of wheat. Genetics 143:10011012
Gill KS, Gill BS, Endo TR, Taylor T (1996b) Identification and
high-density mapping of gene-rich regions in chromosome
group 1 of wheat. Genetics 144:18831891
Goff SA, Ricke D, Lan TH et al (2002) A draft sequence of the rice
genome (Oryza sativa L. ssp. japonica). Science 296:92100
Guyot R, Yahiaoui N, Feuillet C, Keller B (2004) In silico com-
parative analysis reveals mosaic conservation of genes within a
novel collinear region in wheat chromosome 1AS and rice
chromosome 5S. Funct Integr Genomics 4:4758
Haen KM, Lu H, Friesen TL, Faris JD (2004) Genomic targeting
and high-resolution mapping of the Tsn1 gene in wheat. Crop
Sci 44:951962
Hohmann U, Endo TR, Gill KS, Gill BS (1994) Comparison of
genetic and physical maps of group 7 chromosomes from
Triticum aestivum L. Mol Gen Genet 245:644653
Hossain KG, Kalavacharla V, Lazo GR, Hegstad J, Wentz MJ,
Kianian PM, Simons K, Gehlhar S, Rust JL, Syamala RR,
Obeori K, Bhamidimarri S, Karunadharma P, Chao S, Anderson
OD, Qi LL, Echalier B, Gill BS, Linkiewicz AM, Ratnasiri A,
Dubcovsky J, Akhunov ED, Dvorak J, Miftahudin, Ross K,
Gustafson JP, Radhawa HS, Dilbirligi M, Gill KS, Peng JH,
Lapitan NL, Greene RA, Bermudez-Kandianis CE, Sorrells
ME, Feril O, Pathan MS, Nguyen HT, Gonzalez-Hernandez JL,
Conley EJ, Anderson JA, Choi DW, Fenton D, Close TJ,
McGuire PE, Qualset CO, Kianian SF (2004) A chromosome
bin map of 2148 expressed sequence tag loci of wheat homoeo-
logous group 7. Genetics 168:687699
International Rice Genome Sequencing Project (2005) The map-
based sequence of the rice genome. Nature 436:793800
La Rota M, Sorrells ME (2004) Comparative DNA sequence
analysis of mapped wheat ESTs reveals the complexity of
genome relationships between rice and wheat. Funct Integr
Genomics 4:3436
Li W, Gill BS (2002) The colinearity of the Sh2/A1 orthologous
region in rice, sorghum and maize is interrupted and accompanied
by genome expansion in the Triticeae. Genetics 160:11531 162
Linkiewicz AM, Qi LL, Gill BS, Ratnasiri A, Echalier B, Chao S,
Lazo GR, Hummel DD, Anderson OD, Akhunov ED, Dvorak
J, Pathan MS, Nguyen HT, Peng JH, Lapitan NL, Miftahudin,
Gustafson JP, La Rota CM, Sorrells ME, Hossain KG,
Kalavacharla V, Kianian SF, Sandhu D, Bondareva SN, Gill
KS, Conley EJ, Anderson JA, Fenton RD, Close TJ, McGuire
PE, Qualset CO, Dubcovsky J (2004) A 2500-locus bin map of
wheat homoeologous group 5 provides insights on gene dis-
tribution and colinearity with rice. Genetics 168:665676
Liu S, Anderson JA (2003) Targeted molecular mapping of a major
wheat QTL for Fusarium head blight resistance using wheat
ESTs and synteny with rice. Genome 46:817823
Mickelson-Young L, Endo TR, Gill BS (1995) A cytogenetic ladder-
map of wheat homoeologous group-4 chromosomes. Theor
Appl Genet 90:10071011
102
Munkvold JD, Greene RA, Bermudez-Kandianis CE, La Rota CM,
Edwards H, Sorrells SF, Dake T, Benscher D, Kantety R,
Linkiewicz AM, Dubcovsky J, Akhunov ED, Dvorak J,
Miftahudin, Gustafson JP, Pathan MS, Nguyen HT, Matthews
DE, Chao S, Lazo GR, Hummel DD, Anderson OD, Anderson
JA, Gonzalez-Hernandez JL, Peng JH, Lapitan N, Qi LL,
Echalier B, Gill BS, Hossain KG, Kalavacharla V, Kianian SF,
Sandhu D, Erayman M, Gill KS, McGuire PE, Qualset CO,
Sorrells ME (2004) Group 3 chromosome bin maps of wheat
and their relationship to rice chromosome 1. Genetics 168:
639650
Peng JH, Zadeh H, Lazo GR, Gustafson JP, Chao S, Anderson OD,
Qi LL, Echalier B, Gill BS, Dilbirligi M, Sandhu D, Gill KS,
Greene RA, Sorrells ME, Akhunov ED, Dvorak J, Linkiewicz
AM, Dubcovsky J, Hossain KG, Kalavacharla V, Kianian SF,
Mahmoud AA, Miftahudin, Conley EJ, Anderson JA, Pathan
MS, Nguyen HT, McGuire PE, Qualset CO, Lapitan NL (2004)
Chromosome bin map of expressed sequence tags in homoeo-
logous group 1 of hexaploid wheat and homoeology with rice
and Arabidopsis. Genetics 168:609623
Qi LL, Echalier B, Chao S, Lazo GR, Butler GE et al (2004) A
chromosome bin map of 16,000 expressed sequence tag loci
and distribution of genes among the three genomes of polyploid
wheat. Genetics 168:701712
Roberts MA, Reader SM, Dalgliesh C, Miller TE, Foote TN, Fish
LJ, Snape JW, Moore G (1999) Induction and characterization
of Ph1 wheat mutants. Genetics 153:19091918
SanMiguel P, Ramakrishna W, Bennetzen JL, Buss CS, Dubcovsky
J (2002) Transposable elements, genes and recombination in a
215-kb contig from wheat chromosome 5A
m
. Funct Integr
Genomics 2:7080
Sears ER (1954) The aneuploids of common wheat. MO Agr Exp
Sta Res Bull 572:159
Sears ER (1966) Nullisomictetrasomic combinations in hexaploid
wheat. In: Riley R, Lewis KR (eds) Chromosome manipula-
tions and plant genetics. Oliver & Boyd, Edinburgh, pp 2945
Sears ER, Sears LMS (1978) The telocentric chromosomes of
common wheat. In: Ramanujan S (ed) Proceedings of the fifth
international wheat genetics symposium. Indian Society of
Genetics and Plant Breeding, New Delhi, India, pp 389407
Sorrells ME, La Rota M, Bermudez-Kandianis CE, Greene RA,
Kantety R, Munkvold JD, Miftahudin, Mahmoud A, Ma X,
Gustafson PJ, Qi LL, Echalier B, Gill BS, Matthews DE, Lazo
GR, Chao S, Anderson OD, Edwards H, Linkiewicz AM,
Dubcovsky J, Akhunov ED, Dvorak J, Zhang D, Nguyen HT,
Peng J, Lapitan NLV, Gonzalez-Hernandez JL, Anderson JA,
Hossain K, Kalavacharla V, Kianian SK, Choi DW, Close TJ,
Dilbirligi M, Gill KS, Steber C, Walker-Simmons MK, McGuire
PE, Qualset CO (2003) Comparative DNA sequence analysis of
wheat and rice genomes. Genome Res 13:18181827
Van Deynze AE, Dubcovsky J, Gill KS, Nelson JC, Sorrells ME,
Dvorak J, Gill BS, Lagudah ES, McCouch SR, Appels R
(1995a) Molecular-genetics maps for group 1 chromosomes of
Triticeae species and their relation to chromosomes in rice and
oat. Genome 38:4559
Van Deynze AE, Nelson JC, Yglesis ES, Harrington SE, Braga DP,
McCouch SR, Sorrells ME (1995b) Comparative mapping in
grasses. Wheat relationships. Mol Gen Genet 248:744754
Ware DH, Jaiswal P, Ni J, Yap IV, Pan X, Clark KY, Teytelman L,
Schmidt SC, Zhao W, Chang K, Cartinhour S, Stein LD,
McCouch SR (2002) Gramene, a tool for grass genomics. Plant
Physiol 130:16061613
Werner JE, Endo TR, Gill BS (1992) Towards a cytogenetically
based physical map of the wheat genome. Proc Natl Acad Sci U
S A 89:1130711311
Wicker T, Stein N, Albar L, Feuillet C, Schlagenhauf E, Keller B
(2001) Analysis of a contiguous 211 kb sequence in diploid
wheat (Triticum monococcum L.) reveals multiple mechanisms
of genome evolution. Plant J 26:307316
Yan L, Loukoianov A, Tranquilli G, Helguera M, Fahima T,
Dubcovsky J (2003) Positional cloning of wheat vernalization
gene VRN1. Proc Natl Acad Sci U S A 100:62636268
Yan L, Loukoianov A, Blechl A, Tranquilli G, Ramakrishna W,
SanMiguel P, Bennetzen JL, Echenique V, Dubcovsky J (2004)
The wheat VRN2 gene is a flowering repressor down regulated
by vernalization. Science 303:16401644
Yu J, Hu SN, Wang J et al (2002) A draft sequence of the rice
genome (Oryza sativa L. ssp. indica). Science 296:7991
103
... Cereal crops diverged from a common ancestor about 60 million years ago (Devos 2005) but some synteny was retained among them (Lu and Faris 2006). The long arm of chromosome 7H in barley was reported to be well aligned with chromosome 7 in the A, B, and D genomes of wheat (Danilova et al. 2019). ...
... Grass species diverged from a common ancestor about 60 million years ago 46 and have considerable variation in chromosome number, genome size, and sequence. However, most of the genes present in grass species are conserved, and the gene order among them is mainly collinear 47,48 . The long arm of barley chromosome 7H that harbors Rph3 is syntenic with the long arm of chromosome 7 in the wheat A, B, and D genomes 49 . ...
Article
Full-text available
Leaf rust, caused by Puccinia hordei, is an economically significant disease of barley, but only a few major resistance genes to P. hordei (Rph) have been cloned. In this study, gene Rph3 was isolated by positional cloning and confirmed by mutational analysis and transgenic complementation. The Rph3 gene, which originated from wild barley and was first introgressed into cultivated Egyptian germplasm, encodes a unique predicted transmembrane resistance protein that differs from all known plant disease resistance proteins at the amino acid sequence level. Genetic profiles of diverse accessions indicated limited genetic diversity in Rph3 in domesticated germplasm, and higher diversity in wild barley from the Eastern Mediterranean region. The Rph3 gene was expressed only in interactions with Rph3-avirulent P. hordei isolates, a phenomenon also observed for transcription activator-like effector-dependent genes known as executors conferring resistance to Xanthomonas spp. Like known transmembrane executors such as Bs3 and Xa7, heterologous expression of Rph3 in N. benthamiana induced a cell death response. The isolation of Rph3 highlights convergent evolutionary processes in diverse plant-pathogen interaction systems, where similar defence mechanisms evolved independently in monocots and dicots. Leaf rust is an economically significant disease of barley. Here the authors describe cloning of the barley Rph3 leaf rust resistance gene and reveal it encodes a predicted transmembrane protein that is expressed upon infection by Rph3-avirulent Puccinia hordei isolates.
... A study of Australian wheat genotypes showed that ToxA sensitivity was present in 45% of wheat genotypes (See et al. 2018). A similar result was obtained when screening Kazakh wheat germplasm: sensitivity to ToxA was observed in 37% of wheat genotypes and most of them were susceptible to Ptr Identification of diagnostic molecular markers for resistant genes for tan spot is in progress, and this will expedite and may reduce costs associated with breeding for resistance to tan spot (Lu and Faris 2006). From a more practical standpoint, the physical mapping, sequencing, and cloning of the Tsn1 locus allowed the development of a functional marker designated Xfcp623 (Faris et al. , 2013. ...
Article
Full-text available
Tan spot, caused by Pyrenophora tritici-repentis, is a major foliar disease of wheat in Kazakhstan. This study evaluated 103 winter wheat genotypes for tan spot resistance and high-productivity traits in the field in two years. Entries were also characterized using the molecular marker Xfcp623, indicative of Tsn1 gene conferring sensitivity to Ptr ToxA. The proportion of entries insensitive to Ptr ToxA was relatively high (71%). Forty-eight entries with the lowest tan spot severity in the field were confirmed to be insensitivity to Ptr ToxA in the molecular screening. Entries which were resistant under field conditions had similar level of seedling resistance. Principal component analysis biplots showed, that all spike productivity traits were highly correlated. Days to heading (DH) and tan spot severity, as well as the normalized difference vegetation index and DH were strongly correlated. Thousand kernels weight (TKW) and plant height were highly correlated and both were negatively correlated with DH and tan spot severity, indicating that taller cultivars tended to be more resistant. The Xfcp623 negative entries had lower disease severity (by about 40%), flowered up to 2 days later than entries with Tsn1 and had significantly more spikelets per spike, number of grains per spike, TKW and grain weight/spike. The results of the study will be important for increasing the efficiency of breeding based on the elimination of the genotypes with dominant allele Tsn1. Of the 103 entries evaluated, 28 can be directly used in breeding programs to improve tan spot resistance and productivity of winter wheat.
... To solve this problem, the construction of genetic maps using molecular markers based on RNA-seq-derived polymorphism information in the target species is still important. Nonetheless, this putative positional information is expected to be useful for further genetic studies because local synteny is well conserved in each restricted chromosomal region among closely related species (Lu and Faris 2006). ...
Article
Full-text available
A survey of genome-wide polymorphisms between closely related species is required to understand the molecular basis of the evolutionary differentiation of their genomes. Two wild diploid wheat species, namely Triticum monococcum ssp. aegilopoides and T. urartu, are closely related and harbour the Am and A genomes, respectively. The A-genome donor of tetraploid and common wheat is T. urartu, and T. monococcum ssp. monococcum is the cultivated form derived from the wild einkorn wheat subspecies aegilopoides. Although subspecies aegilopoides has been a useful genetic resource in wheat breeding, genome-wide molecular markers for this subspecies have not been sufficiently developed. Here, we describe the detection of genome-wide polymorphisms such as single-nucleotide polymorphisms (SNPs) and insertions/deletions (indels) from RNA sequencing (RNA-seq) data of leaf transcripts in 15 accessions of the two diploid wheat species. The SNPs and indels, detected using the A genome of common wheat as the reference genome, covered the entire chromosomes of these species. The polymorphism information facilitated a comparison of the genetic diversity of einkorn wheat with that of two related diploid Aegilops species, namely, Ae. tauschii and Ae. umbellulata. Cleaved amplified polymorphic sequence (CAPS) markers converted from the SNP data were efficiently developed to confirm the addition of aegilopoides subspecies chromosomes to tetraploid wheat in nascent allohexaploid lines with AABBAmAm genomes. In addition, the CAPS markers permitted linkage map construction in mapping populations of aegilopoides subspecies accessions. Therefore, these RNA-seq data provide information for further breeding of closely related species with no reference genome sequence data.
... Although the extent of microcolinearity (sequence level) varies in certain genomic regions, the extent of macrolinearity (chromosome level) is mostly conserved across cereals. For example, comparisons of genomic regions of wheat and rice have revealed rearrangements in gene order and content (Bennetzen, 2000;Feuillet and Keller, 2002;Li and Gill, 2002;Sorrells et al. 2003;Lu and Faris, 2006), as well as conservation of syntenic (conserved gene order) regions (Yan et al., 2003;Distelfeld et al., 2004;Valárik et al., 2006;Kuraparthy et al., 2008). Rice sequence data from collinear regions of rice and wheat have been used to identify candidate genes for traits of interest and develop markers for fine-mapping genes of interest in wheat (Distelfeld et al., 2004;Valárik et al., 2006). ...
... Although the extent of microcolinearity (sequence level) varies in certain genomic regions, the extent of macrolinearity (chromosome level) is mostly conserved across cereals. For example, comparisons of genomic regions of wheat and rice have revealed rearrangements in gene order and content (Bennetzen, 2000;Feuillet and Keller, 2002;Li and Gill, 2002;Sorrells et al. 2003;Lu and Faris, 2006), as well as conservation of syntenic (conserved gene order) regions (Yan et al., 2003;Distelfeld et al., 2004;Valárik et al., 2006;Kuraparthy et al., 2008). Rice sequence data from collinear regions of rice and wheat have been used to identify candidate genes for traits of interest and develop markers for fine-mapping genes of interest in wheat (Distelfeld et al., 2004;Valárik et al., 2006). ...
Article
Full-text available
Septoria nodorum blotch (SNB) is a foliar disease of wheat caused by the necrotrophic fungal pathogen Parastagonospora nodorum. Research over the last two decades has shown that the wheat-P. nodorum pathosystem mostly follows an inverse gene-for-gene model. The fungus produces necrotrophic effectors (NEs) that interact with specific host gene products encoded by dominant sensitivity (S) genes. When a compatible interaction occurs, a ‘defense response’ in the host leads to programmed cell death thereby provided dead/dying cells from which the pathogen, being a necrotroph, can acquire nutrients allowing it to grow and sporulate. To date, nine S gene-NE interactions have been characterized in this pathosystem. Five NE-encoding genes, SnTox1, SnTox3, SnToxA, SnTox5, and SnTox267, have been cloned along with three host S genes, Tsn1, Snn1, and Snn3-D1. Studies have shown that P. nodorum hijacks multiple and diverse host targets to cause disease. SNB resistance is often quantitative in nature because multiple compatible interactions usually occur concomitantly. NE gene expression plays a key role in disease severity, and the effect of each compatible interaction can vary depending on the other existing compatible interactions. Numerous SNB-resistance QTL have been identified in addition to the known S genes, and more research is needed to understand the nature of these resistance loci. Marker-assisted elimination of S genes through conventional breeding practices and disruption of S genes using gene editing techniques are both effective strategies for the development of SNB-resistant wheat cultivars, which will become necessary as the global demand for sustenance grows.
Article
Full-text available
Tan spot and Septoria nodorum blotch (SNB) are important diseases of wheat caused by the necrotrophic fungi Pyrenophora tritici-repentis and Parastagonospora nodorum, respectively. The P. tritici-repentis necrotrophic effector (NE) Ptr ToxB causes tan spot when recognized by the Tsc2 gene. The NE ToxA is produced by both pathogens and has been associated with the development of both tan spot and SNB when recognized by the wheat Tsn1 gene. Most work to study these interactions has been conducted in common wheat, but little has been done in durum wheat. Here, quantitative trait loci (QTL) analysis of a segregating biparental population indicated that the Tsc2-Ptr ToxB interaction plays a prominent role in the development of tan spot in durum. However, analysis of two biparental populations indicated the Tsn1-ToxA interaction was not associated with the development of tan spot, but was strongly associated with the development of SNB. Parastagonospora nodorum expressed ToxA at high levels in infected Tsn1 plants, whereas ToxA expression in P. tritici-repentis was barely detectable, suggesting that the differences in disease levels associated with the Tsn1-ToxA interaction were due to differences in pathogen expression of ToxA These and previous results together indicate that: 1) the effects of Tsn1-ToxA on tan spot in common wheat can range from nonsignificant to highly significant depending on the host genetic background; 2) Tsn1-ToxA is not a significant factor for tan spot development in durum wheat; and 3) Tsn1-ToxA plays a major role in SNB development in both common and durum wheat. Durum and common wheat breeders alike should strive to remove both Tsc2 and Tsn1 from their materials to achieve disease resistance.
Article
Full-text available
Rice, one of the world's most important food plants, has important syntenic relationships with the other cereal species and is a model plant for the grasses. Here we present a map-based, finished quality sequence that covers 95% of the 389 Mb genome, including virtually all of the euchromatin and two complete centromeres. A total of 37,544 non-transposable-element-related protein-coding genes were identified, of which 71% had a putative homologue in Arabidopsis. In a reciprocal analysis, 90% of the Arabidopsis proteins had a putative homologue in the predicted rice proteome. Twenty-nine per cent of the 37,544 predicted genes appear in clustered gene families. The number and classes of transposable elements found in the rice genome are consistent with the expansion of syntenic regions in the maize and sorghum genomes. We find evidence for widespread and recurrent gene transfer from the organelles to the nuclear chromosomes. The map-based sequence has proven useful for the identification of genes underlying agronomic traits. The additional single-nucleotide polymorphisms and simple sequence repeats identified in our study should accelerate improvements in rice production. Rice (Oryza sativa L.) is the most important food crop in the world and feeds over half of the global population. As the first step in a systematic and complete functional characterization of the rice genome, the International Rice Genome Sequencing Project (IRGSP) has generated and analysed a highly accurate finished sequence of the rice genome that is anchored to the genetic map. Our analysis has revealed several salient features of the rice genome: . We provide evidence for a genome size of 389 Mb. This size estimation is ,260 Mb larger than the fully sequenced dicot plant model Arabidopsis thaliana. We generated 370 Mb of finished sequence, representing 95% coverage of the genome and virtually all of the euchromatic regions. . A total of 37,544 non-transposable-element-related protein-cod-ing sequences were detected, compared with ,28,000–29,000 in Arabidopsis, with a lower gene density of one gene per 9.9 kb in rice. A total of 2,859 genes seem to be unique to rice and the other cereals, some of which might differentiate monocot and dicot lineages. . Gene knockouts are useful tools for determining gene function and relating genes to phenotypes. We identified 11,487 Tos17 retro-transposon insertion sites, of which 3,243 are in genes. . Between 0.38 and 0.43% of the nuclear genome contains orga-nellar DNA fragments, representing repeated and ongoing transfer of organellar DNA to the nuclear genome. . The transposon content of rice is at least 35% and is populated by representatives from all known transposon superfamilies. . We have identified 80,127 polymorphic sites that distinguish between two cultivated rice subspecies, japonica and indica, resulting in a high-resolution genetic map for rice. Single-nucleo-tide polymorphism (SNP) frequency varies from 0.53 to 0.78%, which is 20 times the frequency observed between the Columbia and Landsberg erecta ecotypes of Arabidopsis. . A comparison between the IRGSP genome sequence and the 6.3 £ indica and 6 £ japonica whole-genome shotgun sequence assemblies revealed that the draft sequences provided coverage of 69% by indica and 78% by japonica relative to the map-based sequence. Rice has played a central role in human nutrition and culture for the past 10,000 years. It has been estimated that world rice pro-duction must increase by 30% over the next 20 years to meet projected demands from population increase and economic devel-opment 1 . Rice grown on the most productive irrigated land has achieved nearly maximum production with current strains 1 . Environmental degradation, including pollution, increase in night time temperature due to global warming 2 , reductions in suitable arable land, water, labour and energy-dependent fertilizer provide additional constraints. These factors make steps to maximize rice productivity particularly important. Increasing yield potential and yield stability will come from a combination of biotechnology and improved conventional breeding. Both will be dependent on a high-quality rice genome sequence. Rice benefits from having the smallest genome of the major cereals, dense genetic maps and relative ease of genetic transformation 3 . The discovery of extensive genome colinearity among the Poaceae 4 has established rice as the model organism for the cereal grasses. These properties, along with the finished sequence and other tools under development, set the stage for a complete functional characterization of the rice genome.
Article
A new model for grass functional genomics is described based on Brachypodium distachyon, which in the evolution of the Pooideae diverged just prior to the clade of "core pooid" genera that contain the majority of important temperate cereals and forage grasses. Diploid ecotypes of B. distachyon (2n = 10) have five easily distinguishable chromosomes that display high levels of chiasma formation at meiosis. The B. distachyon nuclear genome was indistinguishable in size from that of Arabidopsis, making it the simplest genome described in grasses to date. B. distachyon is a self-fertile, inbreeding annual with a life cycle of less than 4 months. These features, coupled with its small size (approximately 20 cm at maturity), lack of seed-head shatter, and undemanding growth requirements should make it amenable to high-throughput genetics and mutant screens. Immature embryos exhibited a high capacity for plant regeneration via somatic embryogenesis. Regenerated plants display very low levels of albinism and have normal fertility. A simple transformation system has been developed based on microprojectile bombardment of embryogenic callus and hygromycin selection. Selected B. distachyon ecotypes were resistant to all tested cereal-adapted Blumeria graminis species and cereal brown rusts (Puccinia reconditia). In contrast, different ecotypes displayed resistance or disease symptoms following challenge with the rice blast pathogen (Magnaporthe grisea) and wheat/barley yellow stripe rusts (Puccinia striformis). Despite its small stature, B. distachyon has large seeds that should prove useful for studies on grain filling. Such biological characteristics represent important traits for study in temperate cereals.
Chapter
The discovery that the 21 different chromosomes of common wheat (Triticum aestivum L.) fall into seven homoeologous groups of three (Sears, 1952, 1954) was based primarily on the ability of each tetra-some to compensate for the nullisome of each of the other two chromosomes of the same group. Supporting evidence has come from the finding of Okamoto and Sears (1962) that the pairing in haploids is largely between chromosomes belonging to the same homoeologous group, and from the work of Riley and Kempanna (1963), who found only pairing of homoeologues when increased pairing was induced by the absence of chromosome 5B.
Article
Bennett and Smith (Philosophical Transactions of the Royal Society of London B274:227-274; B334: 309-345) and Bennett, Smith and Heslop-Harrison (Proceedings of the Royal Society of London, B216: 179-199) published lists of nuclear DNA amounts estimated for 1612 angiosperm species collected from 163 sources dated between 1951 and 1986. Subsequently, interest in genome size in angiosperms and its significance has continued, and many new DNA estimates were published during 1986-1994. Their inaccessibility, and the flow of enquiries for such information, shows that a further compilation is needed. This paper presents a supplementary list of nuclear DNA C-values for 105 sources for 899 angiosperm species not listed in the above-mentioned compilations, plus 284 additional estimates for 208 species already listed by them. The data are assembled primarily for reference purposes, with species listed in alphabetical order, rather than by any taxonomic scheme. Some advantages and limitations of flow cytometry, now increasingly used to quantify DNA C-values in plants, are reviewed. Recent reports regarding the occurrence and extent of intraspecific variation in genome size are also discussed. While some examples are real, others reflect technical shortcomings. Work has begun to combine the genome size data compiled in this and the above-mentioned papers into a unified data base, and to present the information in separate lists, with species in alphabetical and systematic orders, respectively. DNA C-values are now known for 1% of the world's angiosperm flora, but improved representation of taxonomic groups, geographical regions and plant life forms is urgently needed.Copyright 1995, 1999 Academic Press
Article
The genomes of grasses are very different in terms of size, ploidy level and chromosome number. Despite these significant differences, it was found by comparative mapping that the linear order (colinearity) of genetic markers and genes is very well conserved between different grass genomes. The potential of such conservation has been exploited in several directions, e.g. in defining rice as a model genome for grasses and in designing better strategies for positional cloning in large genomes. Recently, the development of large insert libraries in species such as maize, rice, barley and diploid wheat has allowed the study of large stretches of DNA sequence and has provided insight into gene organization in grasses. It was found that genes are not distributed randomly along the chromosomes and that there are clusters of high gene density in species with large genomes. Comparative analysis performed at the DNA sequence level has demonstrated that colinearity between the grass genomes is retained at the molecular level (microcolinearity) in most cases. However, detailed analysis has also revealed a number of exceptions to microcolinearity, which have given insight into mechanisms that are involved in grass-genome evolution. In some cases, the use of rice as a model to support gene isolation from other grass genomes will be complicated by local rearrangements. In this Botanical Briefing, we present recent progress and future prospects of comparative genomics in grasses.