ArticlePDF Available

Comparative analysis of medicinal plant Isodon rubescens and its common adulterants based on chloroplast genome sequencing

Frontiers
Frontiers in Plant Science
Authors:

Abstract and Figures

Isodon rubescens (Hemsley) H. Hara is the source of Donglingcao under the monograph Rabdosiae Rubescentis Herba in Chinese Pharmacopoeia. In the local marketplace, this medicine can be accidentally contaminated, deliberately substituted, or mixed with other related species. The contaminants of herbal products are a threat to consumer safety. Due to the scarcity of genetic information on Isodon plants, more molecular markers are needed to avoid misidentification. In the present study, the complete chloroplast (cp) genome of seven species of Isodon was sequenced, de novo assembled and characterized. The cp genomes of these species universally exhibited a conserved quadripartite structure, i.e., two inverted repeats (IRs) containing most of the ribosomal RNA genes and two unique regions (large single copy and small single copy). Moreover, the genome structure, codon usage, and repeat sequences were highly conserved and showed similarities among the seven species. Five highly variable regions (trnS-GCU-trnT-CGU, atpH-atpI, trnE-UUC-trnT-GGU, ndhC-trnM-CAU, and rps15-ycf1) might be potential molecular markers for identifying I. rubescens and its contaminants. These findings provide valuable information for further species identification, evolution, and phylogenetic research of Isodon.
This content is subject to copyright.
Comparative analysis of
medicinal plant Isodon
rubescens and its common
adulterants based on
chloroplast genome sequencing
Zhongyu Zhou
1
, Jing Wang
1,2
, Tingting Pu
1
, Jingjing Dong
1
,
Qin Guan
1
, Jun Qian
1
, Linchun Shi
3
*and Baozhong Duan
1
*
1
College of Pharmaceutical Science, Dali University, Dali, China,
2
College of Life Science, Northeast
Forestry University, Harbin, China,
3
Institute of Medicinal Plant Development, Chinese Academy of
Medical Sciences, Peking Union Medical College, Beijing, China
Isodon rubescens (Hemsley) H. Hara is the source of Donglingcao under the
monograph Rabdosiae Rubescentis Herba in Chinese Pharmacopoeia. In the
local marketplace, this medicine can be accidentally contaminated,
deliberately substituted, or mixed with other related species. The
contaminants of herbal products are a threat to consumer safety. Due to the
scarcity of genetic information on Isodon plants, more molecular markers are
needed to avoid misidentication. In the present study, the complete
chloroplast (cp) genome of seven species of Isodon was sequenced, de novo
assembled and characterized. The cp genomes of these species universally
exhibited a conserved quadripartite structure, i.e., two inverted repeats (IRs)
containing most of the ribosomal RNA genes and two unique regions (large
single copy and small single copy). Moreover, the genome structure, codon
usage, and repeat sequences were highly conserved and showed similarities
among the seven species. Five highly variable regions (trnS-GCU-trnT-CGU,
atpH-atpI,trnE-UUC-trnT-GGU, ndhC-trnM-CAU, and rps15-ycf1) might be
potential molecular markers for identifying I. rubescens and its contaminants.
These ndings provide valuable information for further species identication,
evolution, and phylogenetic research of Isodon.
KEYWORDS
Isodon rubescens, chloroplast genome, species identication, molecular
marker, phylogenetic
Frontiers in Plant Science frontiersin.org01
OPEN ACCESS
EDITED BY
Zemin Ning,
Wellcome Sanger Institute (WT),
United Kingdom
REVIEWED BY
Qingguo Ma,
Research Institute of Forestry, Chinese
Academy of Forestry, China
Gang Yao,
South China Agricultural University,
China
Cornelius Mulili Kyalo,
University of Chinese Academy of
Sciences, China
*CORRESPONDENCE
Baozhong Duan
bzduan@126.com
Linchun Shi
linchun_shi@163.com
SPECIALTY SECTION
This article was submitted to
Plant Bioinformatics,
a section of the journal
Frontiers in Plant Science
RECEIVED 04 September 2022
ACCEPTED 26 October 2022
PUBLISHED 21 November 2022
CITATION
Zhou Z, Wang J, Pu T, Dong J,
Guan Q, Qian J, Shi L and Duan B
(2022) Comparative analysis of
medicinal plant Isodon rubescens and
its common adulterants based on
chloroplast genome sequencing.
Front. Plant Sci. 13:1036277.
doi: 10.3389/fpls.2022.1036277
COPYRIGHT
© 2022 Zhou, Wang, Pu, Dong, Guan,
Qian, Shi and Duan. This is an open-
access article distributed under the
terms of the Creative Commons
Attribution License (CC BY). The use,
distribution or reproduction in other
forums is permitted, provided the
original author(s) and the copyright
owner(s) are credited and that the
original publication in this journal is
cited, in accordance with accepted
academic practice. No use,
distribution or reproduction is
permitted which does not comply with
these terms.
TYPE Original Research
PUBLISHED 21 November 2022
DOI 10.3389/fpls.2022.1036277
Introduction
Isodon rubescens (Hemsley) H. Hara belongs to the family
Lamiaceae (Liu et al., 2017), which is listed in the Chinese
Pharmacopoeia, and the Chinese name is Donglingcao
(Chinese Pharmacopoeia Commission, 2020). The Rabdosiae
Rubescentis Herba has crucial medicinal value in eliminating
inammation, reducing sore throats, and treating malignant
tumors (Xue et al., 2020;Guan et al., 2021). The previous
survey has revealed that I. rubescens is generally contaminated
with common adulterants, such as I. inexus (Thunb.) Kudo,I.
eriocalyx (Dunn) Kudo,I. excisus (Maxim.) Kudo,I.
lophanthoides (Buch.-Ham. ex D.Don) H.Hara, I. coetsa
(Buch.-Ham. ex D.Don) Kudo,andI. japonicus (Burm.f.)
H.Hara (Su et al., 2007;Xia et al., 2013;Ge et al., 2022). These
adulterants are usually of poor quality and some might even be
toxic (Xia et al., 2013;Duan et al., 2018). As the morphology of
these species is similar, interchangeable, and indistinguishable,
the identication of these species remains somewhat
controversial, which may affect their safety and effectiveness in
clinical use (Lazarski et al., 2014;Lin et al., 2019). Therefore, it is
imperative to develop a method for accurately identifying I.
rubescens and its common adulterants.
With the rapid development of molecular technology in
recent years, molecular identication has made signicant
progress in Chinese medicine, especially molecular markers,
which involve sequencing specic sections of the genome to
identify differences between individuals of different species or
populations (Roman and Houston, 2020). Recent studies have
shown high levels of genetic variability within species of Isodon
and an associated lack of phylogenetic resolution between
different species (Zhong et al., 2010;Chen et al., 2021).
Universal DNA markers, such as ITS, psbA-trnH, trnD-trnT,
rpl32-trnL, and ETS have been used to identify I. rubescens and
its related taxa (Xia et al., 2013;Yu et al., 2014;Chen et al., 2021).
Moreover, according to Harris and Klooster (2011), they found
that 11 microsatellite loci amplify reliably and are sufciently
variable for studying population genetics in I. rubescens.
However, some common adulterants were not included in
these studies. Therefore, more scienticandaccurate
identication methods must be developed. The chloroplast
(cp) is an essential organelle that plays a crucial role in plant
photosynthesis and biochemical processes (Bendich, 2004).
Compared with the gene fragments, the cp genome is
relatively conserved and slightly varied (Drouin et al., 2008;
Ferrarini et al., 2013), which has been widely used for identifying
Paris,Polygonatum, and its contaminants (Kawabe and
Miyashita, 2003;Jiang et al., 2022;Wang et al., 2022).
Recently, although the complete plastid genomes of I.
rubescens has reported by Lian et al. (2022) and Yue et al.,
(2021), the focus of these papers was to compare the intraspecic
variation or characterize one genome information. However, the
use of cp genomes for comparing Isodon species with their
common adulterants has not been reported.
Our study aims to: (i) contribute new fully-sequenced cp
genomes in Isodon and improve the understanding of the overall
structure of these genomes, (ii) perform comparative analyses
and elucidate the phylogenetic evolution of Isodon cp genomes,
and (iii) screen molecular markers to differentiate I. rubescens
and its adulterants. In the current work, the complete cp
genomes of seven Isodon species were sequenced, de novo
assembled, and annotated. These genomes were then used in a
comparative analysis of genome structure and evolution
relationships. The data acquired in this study increase the
genomic resources available for the Isodon genus and provide
valuable information support for the phylogenetic analysis and
identication of the Isodon genus, as well as safe medical
applications of I. rubescens. This study is the rst
comprehensive research on identifying I. rubescens and its
adulterants based on the cp genomes.
Materials and methods
Plant and DNA resources
The fresh, healthy leaves for seven species of I. rubescens
and its common adulterants, including I. inexus,I. eriocalyx,
I. excisus,I. lophanthoides,I. coetsa,andI. japonicus,were
collected from the Germplasm Resource Garden (Yunnan,
China, 24°4955N, 102°4858E), Kunming Zhifen
Biotechnology Co., Ltd. One individual sample of
approximately 1.0 g of fresh leaves per plant species was
gathered and stored in an ice-lled cooler or refrigerator (4°
C) until DNA extractions could be performed. Professor
Baozhong Duan authenticated the specimens, and the
detailed sample information is available in Supplementary
Table S1 and Figure S1. The voucher specimens were
deposited in the Dali University herbarium. Genomic DNA
was extracted from tissue samples using the Plant Genomic
DNA kit (Tiangen, Beijing, China) following the
manufacturers protocol. The extracted DNA was quantied
on high-sensitivity Qubit 4.0 uorometry (Life Technologies,
Inc.),andallPCRproductswereexaminedforthepresenceof
amplied products in agarose gels.
DNA sequencing, assembly,
and annotation
For sequencing library preparation, we used thirty
microlitres of high-quality (>100 ng/mL) DNA per individual.
All libraries were sequenced on the Illumina NovaSeq system
(Illumina, San Diego, CA). Paired-end sequence reads were
Zhou et al. 10.3389/fpls.2022.1036277
Frontiers in Plant Science frontiersin.org02
trimmed to remove low-quality bases and adapter sequences in
the Toolkit_v2.3.3 software. The cp genomes were assembled by
GetOrganelle v.1.6.4, exploiting Bowtie2 v.2.4.4, SPAdes v.3.13.0,
and Blast v.2.5.0 as dependencies (Jin et al., 2019). After
assembly, two online annotation tools, CpGAVAS2 and GeSeq
were used to annotate the circular cp genomes (Wick et al.,
2015), and the annotated cp genome sequences were submitted
to the GenBank database of the National Center for
Biotechnology Information (NCBI) (Table 1). Gene maps of
the cp genomes were produced with the online IRscope (https://
irscope.shinyapps.io/Chloroplot/).
Repeat analysis
The GC content was analyzed using the Geneious 9.0.2
software (Kearse et al., 2012). Four kinds of the dispersed
repeat sequence, including Forward (F), Reverse (R),
Palindromic (P), and Complementary (C), were detected using
the REPuter program (https://bibiserv.cebitec.uni-bielefeld.de/
reputer/)(Kurtz, 2001). The criteria for repeat determination
include a minimum repeat size of 20 bp with a similarity
between repeat pairs of 90% by putting edit value 3.
Furthermore, MISA software (http://pgrc.ipk-gatersleben.de/
misa/) was used to evaluate the simple sequence repeats (SSRs)
with the parameters of 10for mono, 5for di-, 4for tri-, and
3for tetra-, penta-, and hexanucleotide motifs (Beier
et al., 2017).
Comparative and phylogenetic analyses
Relative synonymous codon usage (RSCU) and codon usage
values were analyzed by CodonW v.1.4.2. Moreover, the RSCU
values were shown in a heatmap by Tbtools (Chen et al., 2020).
The contraction and expansion of IR regions at the junctions
were visualized using the online IRscope (https://irscope.
shinyapps.io/irapp/)(Amiryouseet al., 2018). The mVISTA
program in Shufe LAGAN mode was used to compare the cp
genomes of seven species of Isodon (Frazer et al., 2004), using the
annotation information of I. serra (GenBank NC064127) as a
reference. Additionally, the nucleotide variability (Pi) across the
cp genome sequences was assessed using DnaSP v.6.12.03, with a
window length of 600 sites and a step size of 200 sites (Rozas
et al., 2017). A value of Pi higher than 0.014 was recommended
as mutational hotspots (Cui et al., 2019a). In phylogenetic
analyses, 36 species, including 29 species downloaded from
NCBI (Table S2), were used to infer the phylogeny.
Simultaneously, two species, Scrophularia dentata (GenBank
NC036942) and S. henryi (GenBank NC036943) were used as
outgroups (Chase et al., 2016). All these sequences were aligned
using MAFFT, and alignments were checked manually. The
Maximum-likelihood (ML) tree was reconstructed with an
IQtree using default parameters of 1000 iterations, 1000
replications, and best-t model selection (Nguyen et al., 2015).
Results and discussion
Genome structure
The cp genome size and gene content of seven Isodon species
are listed in Table 1. All the genome sizes were similar to that of
the reference genome, i.e., around 150 kb. I. inexu had the
largest genome, with a size of 152,701 bp, and I. japonicus had
the smallest genome, with a size of 152,208 bp. Moreover, the
length of large single copy (LSC) regions ranged from 83,079 bp
(I. lophanthoides) to 83,577 bp (I. rubescens), small single copy
(SSC) regions ranged from 17,656 bp (I. excisus) to 17,729 bp (I.
lophanthoides), and IRa and IRb regions ranged from 25,700 bp
(I. lophanthoides and I. japonicus) to 25,728 bp (I. excisus). In
addition, the cp genomes of Isodon contained 132 genes,
including 88 protein-coding genes, 8 rRNA genes, and 36
tRNA genes, 18 of which are repeated as members of IR
regions (Figure 1), which were congruent and largely
concordant with recent studies of Isodon (Yue et al., 2021;
Lian et al., 2022). It is worth noting that the chIB,chIL, and
ycf68 were lost during evolution, typically in most angiosperms
(Millen et al., 2001). Meanwhile, the number and types of
introns were similar among the seven Isodon species (Table
TABLE 1 Information of Isodon cp genome features.
Genome
characterristics
Total length
(bp)
GC content
(%)
AT content
(%)
LSC length
(bp)
SSC length
(bp)
IR length
(bp)
GenBank
accession
I. inexus 152695 37.63 62.37 83558 17663 25722 OM808733
I. eriocalyx 152657 37.63 62.37 83546 17657 25727 OM808731
I. excisus 152643 37.64 62.36 83531 17656 25728 OM808732
I. lophanthoides 152208 37.61 62.39 83079 17729 25700 OM808735
I. japonicus 152238 37.60 62.40 83139 17699 25700 OM808734
I. coetsa 152441 37.63 62.37 83289 17670 25726 OM808730
I. rubescens 152690 37.62 62.38 83577 17661 25726 OM808736
Zhou et al. 10.3389/fpls.2022.1036277
Frontiers in Plant Science frontiersin.org03
S3). Each of the 18 genes contained one intron, including trnA-
UGC (×2), trnI-GAU (×2), rpl2 (×2), and ndhB (×2) were located
in the IR, and the genes (trnK-UUU,rps16,trnT-CGU,atpF,
rpoC1,trnL-UAA,petB,petD, and rpl16) were located in the LSC,
and the ndhA was the only present in the SSC region.
Furthermore, ycf3 and clpP have two introns, respectively,
consistent with previous genetic studies (Jiang et al., 2022).
The GC content ranged from 37.60% to 37.64% and varied
among the different regions. The ndings were identical to those
of other Isodon species (Yue et al., 2021), which is not
unexpected, given that the angiosperms possess the highly
conserved cp character at the genus level (Meng et al., 2018;
Cui et al., 2019b;Shahzadi et al., 2020).
Codon usage bias of cp genomes
The analyses of relative synonymous codon usage (RSCU)
provide information about the encoding frequency of codons for
an amino acid. As shown in Table S4, the results of RSCU
revealed that the cp protein sequences encoded 21 amino acids,
and 30 codons were used frequently in Isodon species, consistent
with recent codon usage studies (Lian et al., 2022). Moreover,
amino acid frequency analyses conrmed the highest frequency
of leucine and isoleucine, whereas cysteine was a rare amino acid
(Table S4), which was supported by other researchers based on
codon usage bias (Lian et al., 2022). Notably, the use of start
codon AUG for methionine and UGG for tryptophan in the
Isodon genus showed no codon usage bias, consistent with
previous reports of Isodon (Lian et al., 2022). In general, we
found high similarities in codon usage and amino acid frequency
among the seven species of Isodon. Furthermore, as illustrated in
Figure 2, higher RSCU values (1) were found for codons with A
or T at the 3position, which showed high encoding efcacy.
Similar ndings were reported for codon usage and amino acid
frequency in the cp genomes of other angiosperms, which may
be attributable to the high overall AT content in the cp genome
(Kawabe and Miyashita, 2003;Jiang et al., 2022;Wang et al.,
2022). In addition, the codon usage was similar in I. rubescens,I.
lophanthoides, and I. japonicaus, whereas the codon usage of I.
eriocalyx and I. excisus was relatively close to that of I. inexus
and I. coetsa (Figure 2).
It was shown that the GC content of synonymous third
codons positions (GC3s) was closely related to codon bias, which
provided the foundation for assessing the codon usage pattern
(Shang et al., 2011). In our study, the values of GC3s ranged
from 29.4% to 29.5%, demonstrating that the genus Isodon had a
greater preference for the A/U ending codons, which, along with
the highly conserved GC content in seven Isodon cp genomes,
suggest that natural selection had a profound impact on codon
usage patterns (Zhang et al., 2018). In addition, the values for the
effective number of codons ranged from 51.73 to 51.81, and both
FIGURE 1
Cp genome map of Isodon.
Zhou et al. 10.3389/fpls.2022.1036277
Frontiers in Plant Science frontiersin.org04
the codon adaptation index and frequency of optimal were less
than 0.5. These ndings indicated a slight bias of codon usage in
the seven Isodon species.
Repeat analysis
Repetitive sequences play a crucial role in the rearrangement
and stability of cp genomes (Rahemi et al., 2012;Kumar et al.,
2015). A total of 51, 50, 50, 51, 50, 46, and 53 SSRs were
identied in I. inexus,I. eriocalyx,I. excisus,I. lophanthoides,I.
japonicus,I. coetsa, and I. rubescens, respectively (Figure 3).
More than half of SSRs (62.00% 69.60%) were mononucleotide
A/T motifs, which is consistent with the previous works in the cp
genomes of angiosperms (Qian et al., 2013;Liang et al., 2019).
The second was dinucleotide (11.65% 18.00%) with a
predominant motif of AT/TA, followed by tetranucleotide
repeats (14.00% 19.61%) with a predominant motif of
AAAT/ATTT and AATC/ATTG, trinucleotide (1.89%
2.17%), pentanucleotide (1.89% 2.00%) with a predominant
motif of AATAT/ATATT, AATAG/ATTCT. Hexanucleotides
(1.96% 4.00%) were absent in the cp genomes of I. eriocalyx,I.
coetsa, and I. rubescens. This result was consistent with previous
ndings that most SSRs include mono- and dinucleotide repeats,
while tri-, tetra-, penta-, and hexanucleotide repeat sequences
exhibit lower frequencies (Dong et al., 2018).
Moreover, oligonucleotide repeats analysis of four types of
repeats in the cp genome, including F, R, P, and C, was
performed by REPuter. As illustrated in Figure 4, the number
of repeat types varied and presented random permutations
among the cp genomes of seven species. The number of
repeats varied among these species, but most repeat sequences
existed in 20 29 bp, which was supported by recent literature
(Lian et al., 2022). Meanwhile, the abundance of F and P repeats
was higher than that of R and C repeats. A total of 36 F repeats
and 47 P repeats were observed in I. rubescens, 38 and 48 in I.
inexus, 34 and 39 in I. eriocalyx, 39 and 48 in I. excisus, 38 and
43 in I. lophanthoides, 39 and 45 in I. japonicus, 42 and 47 in I.
coetsa, respectively. These repeats play a crucial role in the
generation of substitutions and indels, which makes them
important for detecting mutational hotspots (McDonald et al.,
2011;Ahmed et al., 2012)
FIGURE 2
Heat map of the RSCU values among Isodon cp genome.
Zhou et al. 10.3389/fpls.2022.1036277
Frontiers in Plant Science frontiersin.org05
Inverted repeats
The contraction and expansion of IRs are regarded as crucial
evolutionary phenomena that result in the pseudogenes, gene
duplication, or the reduction of duplicate genes to a single copy
(Abdullah et al., 2020). As illustrated in Figure 5, the rpl22 gene
was present in the LSC region, and rpl2 was entirely in the IRb
region, consistent with a previous study (Lian et al., 2022). A
truncated copy of rps19 gene was observed in all species at the
IRb/LSC junction except for I. ternifolious, which starts in LSC
and integrates into the IRb regions, while rp12 is exclusively
located in the IRb region. In monocotyledons, the rps19 gene is
in the IR region (Ahmed et al., 2012;Henriquez et al., 2020), but
our ndings show that things are different in Lamiaceae.
Additionally, the ndhF gene was also found at the junction of
IRb/SSC and integrated into the IRb with a size ranging from 42
to 46 kb. At the IRb/LSC junction, another truncated copy of
ycf1 gene was observed in all species except for I. ternifolious and
I. serra. Contrary to the ndings of Henriquez et al. (2020), the
de novo assembled genomes of seven Isodon species did not
include a ycf pseudogene at the IRa/SSC junction. Neither an
internal stop codon nor double peaks were observed in the
electropherograms, indicating that the seven Isodon species lack
amplied pseudogenes. In addition, psbA and trnH were in the
LSC, and rpl2 in the IRa.
Genome comparison and
nucleotide diversity
A comparison of overall sequence variation showed that the
cp genome of Isodon is highly conserved, and the coding region
is more conserved than non-coding regions. Except for ndhF,
ycf1, and ycf2 genes, all protein-coding genes showed a highly
conserved character (Figure 6); the intergenic spacers (IGS) with
the highest divergence were trnH-GUG-psbA,trnQ-UUG-psbK,
FIGURE 3
The number and type of SSRs in the cp genome of Isodon.
FIGURE 4
Repeat sequences detected in Isodon cp genome. P, F, C, and R indicate the repeat types: R (Reverse repeats), P (Palindromic repeats), F
(Forward repeats), C (Complement repeats).
Zhou et al. 10.3389/fpls.2022.1036277
Frontiers in Plant Science frontiersin.org06
trnS-GCU-trnT-CGU,atpH-atpI,trnE-UUC-trnT-GGU, psaA-
ycf3, ndhC-trnM-CAU, psbH-petB, and rps15-ycf1,as
predicted. In addition, the sliding window analysis revealed
that seven regions, including rps16-trnQ-UUG,ndhF,ndhB,
ccsA-ndhD,ndhA,ndhH,andycf1 genes, exhibited higher
nucleotide diversity values (> 0.014, Figure 7); the IR regions
exhibited lower sequence divergence than LSC and SSC regions,
consistent with the previous comparisons of cp genomes (Jiang
et al., 2022;Wang et al., 2022). Among these 16 high
polymorphic regions, 11 intergenic spacers were found in the
trnH-GUG-psbA,trnQ-UUG-psbK,trnS-GCU-trnT-CGU,atpH-
atpI,trnE-UUC-trnT-GGU, psaA-ycf3, ndhC-trnM-CAU, psbH-
petB, rps15-ycf1,rps16-trnQ-UUG,andccsA-ndhD.Itis
noteworthy that the atpH-atpI,rps16-trnQ-UUG, and ndhC-
trnM-CAU were also identied as mutational hotspots in a
previous study (Lian et al., 2022). IGS was considered one of
the evolutionary hotspots that exhibited more signicant rates of
nucleotide substitutions and indel mutations (Drummond,
2008). Therefore, these IGS might be undergoing more rapid
nucleotide substitution at the species level, which could be
served as a potential molecular marker for application in
phylogenetic analyses of the Isodon genus.
Species authentication analysis based
on IGS
Intergenic spacer regions are the most frequently used cp
markers for phylogenetic studies at lower taxonomic levels in
plants (Shaw et al., 2005), as they are regarded as more variable
and could provide more phylogenetically informative characters.
To nd candidate markers for identifying I. rubescens and its
adulterants, the 11 IGSs were extracted using the PhyloSuite
v1.2.2 from 14 Isodon species (Zhang et al., 2019). Each of the 11
IGSs was subject to maximum likelihood analyses in IQtree
(Nguyen et al., 2015). As illustrated in Figure S2.1-S2.11,ve
IGSs, including trnS-GCU-trnT-CGU,atpH-atpI,trnE-UUC-
trnT-GGU,ndhC-trnM-CAU,andrps15-ycf1 could be
FIGURE 5
Comparisons of the borders of LSC, SSC, and IRa/b regions among the 11 Isodon plastid genomes. The numbers represent the distance
between the gene ends and the border sites, and the numbers below represent the length of the LSC, SSC, and IRa/b regions. This Figure is not
to scale.
Zhou et al. 10.3389/fpls.2022.1036277
Frontiers in Plant Science frontiersin.org07
distinguished I. rubescens from I. japonicus and I. lophanthoides.
Whereas the rest of the Isodon species cannot be distinguished
based on these IGSs, bootstrap values for the relationship among
these major clades were weak (<70%). These results are partially
in line with the previous study, which found that the atpH-atpI
and ndhC-trnM-CAU could be potential molecular markers for
distinguishing Isodon species (Lian et al., 2022). Moreover, the
remaining fragment, including trnS-GCU-trnT-CGU, trnE-
UUC-trnT-GGU,andrps15-ycf1 also reported as potential
markers for other species identication (Kim et al., 2020;Li
et al., 2020;Alzahrani, 2021). Although the previous study has
revealed that universal DNA barcodes (e.g., psbA-trnH) could
differentiate I. rubescens from their related species (Xia et al.,
2013), some common adulterants were not included in this
FIGURE 6
Global comparison of complete genomes of Isodon.
FIGURE 7
Sliding window analysis of Isodon cp genome. The X-axis represents the midpoint of the window; The Y-axis represents nucleotide diversity
values. Window length: 600 bp; step size: 200 bp.
Zhou et al. 10.3389/fpls.2022.1036277
Frontiers in Plant Science frontiersin.org08
study. Furthermore, the comparative analysis showed that the
screened IGSs exhibit higher variability than psbA-trnH. These
IGSs could theoretically distinguish the selected 7 species,
whereas a much more detailed investigation of identication
accuracy and amplication efciency needs to be accomplished,
and more experimental evidence is needed. Moreover, the ML
phylogenetic tree was also inferred using a combination of these
ve IGSs. The results (Figure S3) showed that I. rubescens
(GenBank NW018469) and I. rubescens (GenBank NC053708)
clustered together as sister group groups and closely related to I.
excisus. Simultaneously, I. rubescens (GenBank OM808736) was
located in independent branches in the phylogeny, and well-
supported sister relationship between I. rubescens and I.
japonicus +I. lophanthoides (100% B/S). These results
indicated that the combination of ve IGSs could effectively
discriminate I. rubescens from its common adulterants.
Species identication and
phylogenetic analysis
The ML phylogenetic tree was inferred using 36 species,
with Scrophularia as the outgroup. As illustrated in Figure 8,on
the consensus trees, most nodes were supported with
maximum support (100% bootstrap support). Ocimoideae
and Lamioideae were sister taxa within the three subfamilies,
and Ajugoideae was sister to the clade containing Ocimoideae
+ Lamioideae. The phylogenetic trees crown was occupied by
the subfamily Ocimoideae, which included the genera Isodon
and Ocimum.Thesendings conrm the position of Isodon
within the Lamiaceae and are consistent with previous
phylogenomic studies (Chen et al., 2021;Yue et al., 2021).
The genus Isodon is further divided into three clades: (i) clade
A, including I. rubescens (GenBank NC053708, GenBank
NW018469), I. excise,I. serra,I. nervosus,I. amethystoides,I.
coetsa,I. inexus,I. eriocalyx,andI. rubescens (GenBank
NW376483); (ii) clade B, only I. ternifolious; (iii) clade C,
including I. japonicas,I. lophanthoides,andI. rubescens
(GenBank OM808736). It is worth noting that the I.
rubescens (GenBank OM808736) clustered differently from
the other three samples (GenBank NC053708, GenBank
NW018469, GenBank NW376483) of the same species in the
phylogeny, which is consistent with the nding of other
researcher based on rps16,trnL-trnF, and ITS sequence
(Harris et al., 2012). Moreover, Lian et al. (2022) also found
that the samples of I. rubescens from different geographical
FIGURE 8
ML phylogenetic tree reconstruction containing the cp genomes of 36 plants. The Sceophularia species were set as the outgroup.
Zhou et al. 10.3389/fpls.2022.1036277
Frontiers in Plant Science frontiersin.org09
areas were not recovered as monophyletic and were placed in
different branches in the previous report, which is well
distinguished by sampling locations, suggesting that the
intraspecicdiversitywaspresentinI. rubescens. This
phenomenon might be explained by the fact that the
geographical area of origin might inuence the variation of I.
rubescens. Another previous study supported the same
conclusion; the Artemisia argyi collected from different
geographical areas display high intraspecic diversity in the
cp genome (Chen et al., 2022). In addition, clade B only
includes a species of I. ternifolious, which was supported by
the ndings of other researchers based on trnD-trnT,psbA-
trnH,rpl32-trnL,trnL-trnF,rps16,andnrITS (Zhong et al.,
2010;Yu et al., 2014;Chen et al., 2021). Moreover, maximum
likelihood analysis demonstrated that I. rubescens (GenBank
OM808736) was located in independent branches in the
phylogeny and deeply nested within clade C, and the sister
relationship between I. rubescens and I. japonicus +I.
lophanthoides was highly supported (100% B/S), indicating
that the cp genome could discriminate I. rubescens from its
common adulterants. The ndings supported the results of
morphological classication reported by Ge et al. (2022) and
Xia et al. (2013).
Conclusion
In the current study, the complete cp genome of seven
species of Isodon was de novo assembled from Illumina high
throughput sequencing reads, and cp genome sequences of I.
inexus,I. eriocalyx,I. excisus, and I. coetsa were reported for the
rst time. These cp genomes were generally conserved and
exhibited similar gene content and genomic structure. Five
highly variable cp loci, including trnS-GCU-trnT-CGU,atpH-
atpI,trnE-UUC-trnT-GGU, ndhC-trnM-CAU, and rps15-ycf1,
were identied, which could serve as potential markers for
identifying I. rubescens and its common adulterants. In
conclusion, our study provides a powerful tool and valuable
scientic reference for the safety and effectiveness of clinical drug
use, and it also contributes to the bioprospecting and
conservation of Isodon species.
Data availability statement
Thedatapresentedinthestudyaredepositedinthe
GenBank repository, accession numbers were from OM808730
to OM808736.
Author contributions
ZZ, JW, TP, and BD participated in the conception and
design of the research. JW, QG, and BD collected the species. JQ,
LS, and JD are responsible for analyzing and processing data. ZZ
wrote the manuscript. The paper was revised by JQ, LS, and BD.
All authors contributed to the article and approved the
submitted version.
Funding
This work was supported by the Yunnan academician expert
workstation (202205AF150026, 202105AF150053), the key
technology projects in the Yunnan province of China
(202002AA100007), and the Yunnan Xingdian talent support
plan (YNWR-QNBJ-2020251).
Acknowledgments
We would like to thank Ms. Qingshu Yang for her assistance
in obtaining specimens for this study. We also thank Northeast
Forestry University and the China Academy of Chinese Medical
Sciences for technical assistance.
Conict of interest
The authors declare that the research was conducted in the
absence of any commercial or nancial relationships that could
be construed as a potential conict of interest.
Publishers note
All claims expressed in this article are solely those of the
authors and do not necessarily represent those of their afliated
organizations, or those of the publisher, the editors and the
reviewers. Any product that may be evaluated in this article, or
claim that may be made by its manufacturer, is not guaranteed
or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found
online at: https://www.frontiersin.org/articles/10.3389/
fpls.2022.1036277/full#supplementary-material
Zhou et al. 10.3389/fpls.2022.1036277
Frontiers in Plant Science frontiersin.org10
References
Abdullah,, Mehmood, F., Shahzadi, I., Waseem, S., Mirza, B., Ahmed, I., et al.
(2020). Chloroplast genome of Hibiscus rosa-sinensis (Malvaceae): comparative
analyses and identication of mutational hotspots. Genomics (San Diego Calif.)
112, 581591. doi: 10.1016/j.ygeno.2019.04.010
Ahmed, I., Biggs, P. J., Matthews, P. J., Collins, L. J., Hendy, M. D., and Lockhart,
P. J. (2012). Mutational dynamics of aroid chloroplast genomes. Genome Biol. Evol.
4, 13161323. doi: 10.1093/gbe/evs110
Alzahrani, D. A. (2021). Complete chloroplast genome of Abutilon fruticosum:
genome structure, comparative and phylogenetic analysis. Plants 10, 270. doi:
10.3390/plants10020270
Amiryouse, A., Hyvönen, J., and Poczai, P. (2018). IRscope: an online program
to visualize the junction sites of chloroplast genomes. Bioinformatics 34, 3030
3031. doi: 10.1093/bioinformatics/bty220
Beier, S., Thiel, T., Münch, T., Scholz, U., and Mascher, M. (2017). MISA-web: a
web server for microsatellite prediction. Bioinformatics 33, 25832585.
doi: 10.1093/bioinformatics/btx198
Bendich, A. J. (2004). Circular chloroplast chromosomes: the grand illusion.
Plant Cell 16, 16611666. doi: 10.1105/tpc.160771
Chase, M. W., Christenhusz, M. J. M., Fay, M. F., Byng, J. W., Judd, W. S., Soltis,
D. E., et al. (2016). An update of the angiosperm phylogeny group classication for
the orders and families of owering plants: APG IV. Bot. J. Linn. Soc 181, 120.
doi: 10.1111/boj.12385
Chen, C., Chen, H., Zhang, Y., Thomas, H. R., Frank, M. H., He, Y., et al. (2020).
TBtools: An integrative toolkit developed for interactive analyses of big biological
data. Mol. Plant 13, 11941202. doi: 10.1016/j.molp.2020.06.009
Chen, Y. P., Huang, C. Z., Zhao, Y., and Xiang, C. L. (2021). Molecular and
morphological evidence for a new species of Isodon (Lamiaceae) from southern
China. Plant Diversity 43, 5462. doi: 10.1016/j.pld.2020.06.004
Chen, C. J., Miao, Y. H., Luo, D. D., Li, J. X., Wang, Z. X., Luo, M., et al. (2022).
Sequence characteristics and phylogenetic analysis of the Artemisia argyi
chloroplast genome. Front. Plant Sci. 13. doi: 10.3389/fpls.2022.906725
Chinese Pharmacopoeia Commission (2020). Pharmacopoeia of the peoples
republic of China (Beijing: China Med. Sci. Press).
Cui, Y. X., Chen, X. L., Nie, L. P., Sun, W., Hu, H. Y., Lin, Y. L., et al. (2019a).
Comparison and phylogenetic analysis of chloroplast genomes of three medicinal
and edible Amomum species. Int. J. Mol. Sci. 20, 4040. doi: 10.3390/ijms20164040
Cui, Y. X., Nie, L. P., Sun, W., Xu, Z. C., Wang, Y., Yu, J., et al. (2019b).
Comparative and phylogenetic analyses of ginger (Zingiber ofcinale) in the family
zingiberaceae based on the complete chloroplast genome. Plants (Basel) 8, 283295.
doi: 10.3390/plants8080283
Dong, W. L., Wang, R. N., Zhang, N. Y., Fan, W. B., Fang, M. F., and Li, Z. H.
(2018). Molecular evolution of chloroplast genomes of orchid species: insights into
phylogenetic relationship and adaptive evolution. Int. J. Mol. Sci. 19, 716736.
doi: 10.3390/ijms19030716
Drouin, G., Daoud, H., and Xia, J. N. (2008). Relative rates of synonymous
substitutions in the mitochondrial, chloroplast and nuclear genomes of seed plants.
Mol. Phylogenet. Evol. 49, 827831. doi: 10.1016/j.ympev.2008.09.009
Drummond, C. S. (2008). Diversication of Lupinus (Leguminosae) in the
western new world: derived evolution of perennial life history and colonization of
montane habitats. Mol. Phylogenet. Evol. 48, 408421. doi: 10.1016/
j.ympev.2008.03.009
Duan, B. Z., Wang, Y. P., Fang, H. L., Xiong, C., Li, X. W., Wang, P., et al. (2018).
Authenticity analyses of rhizoma par idis using barcoding coupled with high
resolution melting (Bar-HRM) analysis to control its quality for medicinal plant
product. Chin. Med-Uk 13, 818. doi: 10.1186/s13020-018-0162-4
Ferrarini, M., Moretto, M., Ward, J. A., S
urbanovski, N., Stevanovic, V., Giongo,
L., et al. (2013). An evaluation of the PacBio RS platform for sequencing and de
novo assembly of a chloroplast genome. BMC Genomics 14, 670. doi: 10.1186/1471-
2164-14-670
Frazer, K. A., Pachter, L., Poliakov, A., Rubin, E. M., and Dubchak, I. (2004).
VISTA: computational tools for comparative genomics. Nucleic Acids Res. 32,
W273W279. doi: 10.1093/nar/gkh458
Ge, J. W., Li, P., Zhang, T. T., Gao, F., Wang, W. Y., Wang, J. N., et al. (2022).
Molecular identication of a newly discovered medicinal plant, Isodon rubescens,in
Shandong using DNA barcoding. Shandong Sci. 35, 1520. doi: 10.3976/j.issn.1002-
4026.2022.04.003
Guan, Y., Liu, X., Pang, X., Liu, W., Yu, G. X., Li, Y. R., et al. (2021). Recent
progress of oridonin and its derivatives for cancer therapy and drug resistance.
Med. Chem. Res. 30, 17951821. doi: 10.1007/s00044-021-02779-6
Harris, E. S. J., Cao, S. G., Schoville, S. D., Dong, C. M., Wang, W. Q., Jian, Z. Y.,
et al. (2012). Selection for high oridonin yield in the chinese medicinal plant Isodon
(Lamiaceae) using a combined phylogenetics and population genetics approach.
PLoS One 7, e50753. doi: 10.1371/journal.pone.0050753
Harris, E. S. J., and Klooster, M. R. (2011). Development of microsatellite
markers for the medicinal plant Isodon rubescens (Lamiaceae) and related species.
Am. J. Bot. 98, e293e295. doi: 10.3732/ajb.1100190
Henriquez, C. L., Abdullah Ahmed, I., Carlsen, M. M., Zuluaga, A., Croat, T. B.,
et al. (2020). Evolutionary dynamics of chloroplast genomes in subfamily Aroideae
(Araceae). Genomics 112, 23492360. doi: 10.1016/j.ygeno.2020.01.006
Jiang, Y., Miao, Y. J., Qian, J., Zheng, Y., Xia, C. L., Yang, Q. S., et al. (2022).
Comparative analysis of complete chloroplast genome sequences of ve
endangered species and new insights into phylogenetic relationships of Paris.
Gene 833, 146572. doi: 10.1016/j.gene.2022.146572
Jin, J. J., Yu, W. B., Yang, J. B., Song, Y., DePamphilis, C. W., Yi, T. S., et al.
(2019). GetOrganelle: a fast and versatile toolkit for accurate de novo assembly of
organelle genomes. Genome Biol. 21, 241272. doi: 10.1101/256479
Kawabe, A., and Miyashita, N. T. (2003). Patterns of codon usage bias in three
dicot and four monocot plant species. Genes Genet. Syst. 78, 343352. doi: 10.1266/
ggs.78.343
Kearse, M., Moir, R., Wilson, A., Stones-Havas, S., Cheung, M., Sturrock, S., et al.
(2012). Geneious basic: an integrated and extendable desktop software platform for
the organization and analysis of sequence data. Bioinformatics 28, 16471649.
doi: 10.1093/bioinformatics/bts199
Kim, G., Lim, C. E., Kim, J., Kim, K., Lee, J. H., Yu, H., et al. (2020). Comparative
chloroplast genome analysis of Artemisia (Asteraceae) in East Asia: Insights into
evolutionary divergence and phylogenomic implications. BMC Genomics 21, 415
432. doi: 10.1186/s12864-020-06812-7
Kumar, M., Choi, J., Kumari, N., Pareek, A., and Kim, S. (2015). Molecular
breeding in Brassica for salt tolerance: importance of microsatellite (SSR) markers
for molecular breeding in Brassica.Front. Plant Sci. 6. doi: 10.3389/fpls.2015.00688
Kurtz, S. (2001). REPuter: the manifold applications of repeat analysis on a
genomic scale. Nucleic Acids Res. 29, 46334642. doi: 10.1093/nar/29.22.4633
Lazarski, K. E., Moritz, B. J., and Thomson, R. J. (2014). The total synthesis of
Isodon diterpenes. Angewandte Chemie Int. Edition 53, 1058810599. doi: 10.1002/
anie.201404482
Liang, C. L., Wang, L., Lei, J., Duan, B. Z., Ma, W. S., Xiao, S. M., et al. (2019). A
comparative analysis of the chloroplast genomes of four Salvia medicinal plants.
Engineering-Prc 5, 907915. doi: 10.1016/j.eng.2019.01.017
Lian, C. L., Yang, H., Lan, J. X., Zhang, X. Y., Zhang, F., Yang, J. F., et al. (2022).
Comparative analysis of chloroplast genomes reveals phylogenetic relationships
and intraspecic variation in the medicinal plant Isodon rubescens.PLoS One 17,
e266546. doi: 10.1371/journal.pone.0266546
Lin, C. Z., Liu, F. L., Zhang, R. J., Liu, M. T., Zhu, C. C., Zhao, J., et al. (2019).
High-performance thin-layer chromatographic ngerprints of triterpenoids for
distinguishing between Isodon lophanthoides and Isodon lophanthoides var.
gerardianus.J. Aoac Int. 102, 714719. doi: 10.5740/jaoacint.18-0305
Liu, M., Wang, W. G., Sun, H. D., and Pu, J. X. (2017). Diterpenoids from Isodon
species: an update. Nat. Prod. Rep. 34, 10901140. doi: 10.1039/C7NP00027H
Li, D. M., Ye, Y. J., Xu, Y. C., Liu, J. M., and Zhu, G. F. (2020). Complete
chloroplast genomes of Zingiber montanum and Zingiber zerumbet: Genome
structure, comparative and phylogenetic analyses. PLoS One 15, e236590.
doi: 10.1371/journal.pone.0236590
McDonald, M. J., Wang, W., Huang, H., and Leu, J. (2011). Clusters of
nucleotide substitutions and insertion/deletion mutations are associated with
repeat sequences. PLoS Biol. 9, e1000622. doi: 10.1371/journal.pbio.1000622
Meng, J., Li, X. P., Li, H. T., Yang, J. B., Wang, H. D., and He, J. (2018).
Comparative analysis of the complete chloroplast genomes of four Aconitum
medicinal species. Molecules 23, 1015. doi: 10.3390/molecules23051015
Millen, R. S., Olmstead, R. G., Adams, K. L., Palmer, J. D., Lao, N. T., Heggie, L.,
et al. (2001). Many parallel losses of infA from chloroplast DNA during angiosperm
evolution with multiple independent transfers to the nucleus. Plant Cell 13, 645
658. doi: 10.1105/tpc.13.3.645
Nguyen, L. T., Schmidt, H. A., Haeseler, A. V., and Minh, B. Q. (2015). IQ-
TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood
phylogenies. Mol. Biol. Evol. 32, 268274. doi: 10.1093/molbev/msu300
Qian, J., Song, J. Y., Gao, H. H., Zhu, Y. J., Xu, J., Pang, X. H., et al. (2013). The
complete chloroplast genome sequence of the medicinal plant Salvia miltiorrhiza.
PLoS One 8, e57607. doi: 10.1371/journal.pone.0057607
Rahemi, A., Fatahi, R., Ebadi, A., Taghavi, T., Hassani, D., Gradziel, T., et al.
(2012). Genetic diversity of some wild almonds and related Prunus species revealed
by SSR and EST-SSR molecular markers. Plant Syst. Evol. 298, 173192.
doi: 10.1007/s00606-011-0536-x
Zhou et al. 10.3389/fpls.2022.1036277
Frontiers in Plant Science frontiersin.org11
Roman, M. G., and Houston, R. (2020). Investigation of chloroplast regions
rps16 and clpP for determination of Cannabis sativa crop type and biogeographical
origin. Legal Med-Tokyo 47, 101759. doi: 10.1016/j.legalmed.2020.101759
Rozas, J., Ferrer-Mata, A., Sanchez-DelBarrio, J. C., Guirao-Rico, S., Librado, P.,
Ramos-Onsins, S. E., et al. (2017). DnaSP 6: DNA sequence polymorphism analysis
of large data sets. Mol. Biol. Evol. 34, 32993302. doi: 10.1093/molbev/msx248
Shahzadi, I., Abdullah,, Mehmood, F., Ali, Z., Ahmed, I., and Mirza, B. (2020).
Chloroplast genome sequences of Artemisia maritima and Artemisia absinthium:
Comparative analyses, mutational hotspots in genus Artemisia and phylogeny in
family asteraceae. Genomics 112, 14541463. doi: 10.1016/j.ygeno.2019.08.016
Shang, M. Z., Liu, F., Hua, J. P., and Wang, K. B. (2011). Analysis on codon usage
of chloroplast genome of Gossypium hirsutum.Sci. Agr. Sin. 44, 245253.
doi: 10.3864/j.issn.0578-1752.2011.02.003
Shaw, J., Lickey, E. B., Beck, J. T., Farmer, S. B., Liu, W., Miller, J., et al. (2005). The
tortoise and the hare II: relative utility of 21 noncoding chloroplast DNA sequences
for phylogenetic analysis. Am. J. Bot. 92, 142166. doi: 10.3732/ajb.92.1.142
Su, X. H., Dong, C. M., Feng, W. S., Chen, S. Q., Wang, L., Wang, W. L., et al.
(2007). Study on the diversity of morphological characteristics of leaves of Rabdosia
rubescens(Hemls.) hara. Lishizhen Med. Mat. Med. 18, 23512353. doi: 10.3969/
j.issn.1008-0805.2007.10.010
Wang, J., Qian, J., Jiang, Y., Chen, X. C., Zheng, B. J., Chen, S. L., et al. (2022).
Comparative analysis of chloroplast genome and new insights into phylogenetic
relationships of Polygonatum and tribe polygonatea. Front. Plant Sci. 13.
doi: 10.3389/fpls.2022.882189
Wick, R. R., Schultz, M. B., Zobel, J., and Holt, K. E. (2015). Bandage: interactive
visualization ofde novo genome assemblies. Bioinformatics 31, 33503352.
doi: 10.1093/bioinformatics/btv383
Xia,Z.,Zhang,H.R.,Li,H.M.,andGao,Z.M.(2013).Molecular
authentication and phylogenetic relationship of Isodon rubescens and its
relatives. Chin. Trad. Herb. Drugs 44, 29042910. doi: 10.7501/j.issn.0253-
2670.2013.20.021
Xue, J. T., Yu, S. S., Wang, J. C., Li, C. Y., Yang, M. Y., Shi, Y. L., et al. (2020).
Network pharmacological screening of the active ingredients and hypoglycemic
effect of Isodon rubescens in the treatment of diabetes. Planta Med. 86, 556564.
doi: 10.1055/a-1147-9196
Yue, Z. Y., Wang, Y., Zhou, B. Z., and Wang, H. P. (2021). The complete
chloroplast genome of Isodon rubescens, a traditional Chinese herb. Mitochondrial.
DNA B Resour. 6, 337338. doi: 10.1080/23802359.2020.1860704
Yu, X., Maki, M., Drew, B. T., Paton, A. J., Li, H., Zhao, J., et al. (2014).
Phylogeny and historical biogeography of Isodon (Lamiaceae): rapid radiation in
south-west China and Miocene overland dispersal into Africa. Mol. Phylogenet.
Evol. 77, 183194. doi: 10.1016/j.ympev.2014.04.017
Zhang, Z., Chuang, Y., Huang, N., and Mitch, W. A. (2019). Predicting the
contribution of chloramines to contaminant decay during Ultraviolet/Hydrogen
peroxide advanced oxidation process treatment for potable reuse. Environ. Sci.
Technol. 53, 44164425. doi: 10.1021/acs.est.8b06894
Zhang,D.S.,Hu,P.,Liu,T.G.,Wang,J.,Jiang,S.W.,Xu,Q.H.,etal.(2018).
GC bias lead to increased small amino acids and random coils of proteins in
cold-water shes. BMC Genomics 19, 315325. doi: 10.1186/s12864-018-
4684-z
Zhong, J. S., Li, J., Li, L., Conran, J. G., and Li, H. W. (2010). Phylogeny of Isodon
(Schrad. ex benth.) spach (Lamiaceae) and related genera inferred from nuclear
ribosomal ITS, trnLtrnF region, and rps16 intron sequences and morphology. Syst.
Bot. 35, 207219. doi: 10.1600/036364410790862614
Zhou et al. 10.3389/fpls.2022.1036277
Frontiers in Plant Science frontiersin.org12
... The chloroplast (cp) is involved in plant photosynthesis and numerous biochemical processes [10,11]. Unlike the nuclear genome, the cp genome is comparatively stable, exhibiting only minor variations [12,13], which has been successfully employed for discriminating Amomum, Isodon, and their contaminants [14,15]. Although the cp genomes of some Scutellaria species have been described [16,17], these studies mainly characterized a single genome or compared the intraspecific variation. ...
... Scutellaria exhibited a high coding efficiency and a strong preference for A/U termination codons, possibly owing to the overall high AT content in the cp genome. This trend was also evident in other angiosperms [13,15]. ...
... Several molecular studies have demonstrated the high identification capability of cp genetic markers in distinguishing Phyllanthus and Isodon species [15,28]. Four regions (matK-rps16, ndhC-trnV-UAC , psbE-petL, and rps16-trnQ-UUG ) have been filtered as candidate molecular markers for discriminating S. baicalensis from its common adulterants in this study. ...
Article
Full-text available
Background Scutellaria baicalensis Georgi has been extensively used as a medicinal herb in China for over 2000 years. They may be intentionally or inadvertently substituted or blended with comparable species in the local market, threatening clinical medication safety. Molecular markers are effective tools to prevent misidentification and eliminate doping and falsification among Scutellaria plants. This study screened four highly variable regions to identify Scutellaria and its adulterants. In addition, a phylogenetic analysis was performed using the complete cp genome combined with published Scutellaria species samples. Moreover, a comparative analysis of the cp genomes was conducted to investigate the cp genome evolution of S. baicalensis. Results The complete cp genome of five species of Scutellaria was sequenced for the first time, and four previously published Scutellaria species were re-sequenced. They all exhibited a conserved quadripartite structure in their cp genomes, including two distinct regions, namely a small and large single copy region, respectively, and two inverted repeats encompassing the majority of ribosomal RNA genes. Furthermore, the nine species exhibited high conservation from aspects of the genome structure, codon usage, repeat sequences, and gene content. Four highly variable regions (matK-rps16, ndhC-trnV-UAC, psbE-petL, and rps16-trnQ-UUG) may function as potential molecular markers for differentiating S. baicalensis from its adulterants. Additionally, the monophyly of Scutellaria was ascertained and could be reclassified into two subgenera, subgenus Anaspis and subgenus Scutellaria, as evidenced by the phylogenetic analyses on sequences of cp genome and shared protein-coding sequences. According to the molecular clock analysis, it has been inferred that the divergence of Scutellaria occurred at approximately 4.0 Mya during the Pliocene Epoch. Conclusion Our study provides an invaluable theoretical basis for further Scutellaria species identification, phylogenetics, and evolution analysis.
... It is widely used in the study of plant molecular evolution and phylogeny, as well as in genetic transformation, genetic engineering, and molecular breeding [13][14][15]. Recently, with the gradual popularization and low cost of high-throughput sequencing technology, some scientists have proposed that cp genome data can be used for species identification and interspecific differentiation research [16][17][18][19][20]. There is no significant difference in some appearance indicators among the three H. forbesii. ...
... Intergenic spacer regions are the most frequently used cp markers for phylogenetic studies at lower taxonomic levels in plants [41], as they are regarded as more variable and could provide more phylogenetically informative characters. In a previous study, rps15-ycf1 was identified as a mutational hotspot for identifying Isodon rubescens [18]. The PetD located in the LSC region, as a protein-coding gene, contains an intron, which is consistent with previous genetic studies [40] and can also be used as a highly variable region for species identification. ...
Article
Full-text available
To find the gene hypervariable regions of three varieties of Hansenia forbesii H. Boissieu and determine their phylogenetic relationship, the chloroplast (cp) genome of these three varieties were firstly sequencing by the Illumina hiseq platform. In this study, we assembled the complete cp genome sequences of Hansenia forbesii LQ (156,954 bp), H. forbesii QX (157,181 bp), H. forbesii WQ (156,975 bp). They all contained 84 protein-coding genes, 37 tRNAs, and 8 rRNAs. The hypervariable regions between three cp genomes were atpF-atpH, petD, and rps15-ycf1. Phylogenetic analysis showed that H. forbesii LQ and H. forbesii WQ were closely related, followed by H. forbesii QX. This study showed that the three varieties of H. forbesii could be identified by the complete cp genome and specific DNA barcode (trnC-GCA-petN) and provided a new idea for germplasm identification of similar cultivated varieties.
... Cynanchum exhibited a high coding efficiency and a strong preference for A/U termination codons, possibly owing to the overall high AT content in the cp genome. This trend was also evident in other angiosperms [45]. Our investigation of Cynanchum genomes revealed that non-coding regions displayed more substantial variation than coding regions. ...
Article
Full-text available
Cynanchum belongs to the Apocynaceae family and is a morphologically diverse genus that includes around 200 shrub or perennial herb species. Despite the utilization of CPGs, few molecular phylogenetic studies have endeavored to elucidate infrafamilial relationships within Cynanchum through extensive taxon sampling. In this research, we constructed a phylogeny and estimated divergence time based on the chloroplast genomes (CPGs) of nine Cynanchum species. We sequenced and annotated nine chloroplast (CP) genomes in this study. The comparative analysis of these genomes from these Cynanchum species revealed a typical quadripartite structure, with a total sequence length ranging from 158,283 to 161,241 base pairs (bp). The CP genome (CPG) was highly conserved and moderately differentiated. Through annotation, we identified a total of 129–132 genes. Analysis of the boundaries of inverted repeat (IR) regions showed consistent positioning: the rps19 gene was located in the IRb region, varying from 46 to 50 bp. IRb/SSC junctions were located between the trnN and ndhF genes. We did not detect major expansions or contractions in the IR region or rearrangements or insertions in the CPGs of the nine Cynanchum species. The results of SSR analysis revealed a variation in the number of SSRs, ranging from 112 to 150. In five types of SSRs, the largest number was mononucleotide repeats, and the smallest number was hexanucleotide repeats. The number of long repeats in the cp genomes of nine Cynanchum species was from 35 to 80. In nine species of Cynanchum, the GC3s values ranged from 26.80% to 27.00%, indicating a strong bias towards A/U-ending codons. Comparative analyses revealed four hotspot regions in the CPG, ndhA-ndhH, trnI-GAU-rrn16, psbI-trnS-GCU, and rps7-ndhB, which could potentially serve as molecular markers. In addition, phylogenetic tree construction based on the CPG indicated that the nine Cynanchum species formed a monophyletic group. Molecular dating suggested that Cynanchum diverged from its sister genus approximately 18.87 million years ago (Mya) and species diversification within the Cynanchum species primarily occurred during the recent Miocene epoch. The divergence time estimation presented in this study will facilitate future research on Cynanchum, aid in species differentiation, and facilitate diverse investigations into this economically and ecologically important genus.
... Previous studies demonstrated that the identification efficiency of DNA barcode markers in specific regions for closely related species was only about 80% . Several highly informative DNA barcode markers for specific taxa have been developed using comparative analyses of c hloroplast genomes (Zhou et al., 2022). After a comparative analysis of the Rheum palmatum, R. tanguticum, and R. officinale chloroplast genomes, five hypervariable regions (rps16-trnQ, psaA-ycf3, psbE-petL, ndhF-rpl32, and trnT-trnL) were identified and used as specific DNA barcodes for the identification of 42 samples among R. tanguticum, R. officinale, and R. palmatum (Li et al., 2022). ...
Article
Full-text available
Pseudostellaria heterophylla (Miq.) Pax is a well-known medicinal and ecologically important plant. Effectively distinguishing its different genetic resources is essential for its breeding. Plant chloroplast genomes can provide much more information than traditional molecular markers and provide higher-resolution genetic analyses to distinguish closely related planting materials. Here, seventeen P. heterophylla samples from Anhui, Fujian, Guizhou, Hebei, Hunan, Jiangsu, and Shandong provinces were collected, and a genome skimming strategy was employed to obtain their chloroplast genomes. The P. heterophylla chloroplast genomes ranged from 149,356 bp to 149,592 bp in length, and a total of 111 unique genes were annotated, including 77 protein-coding genes, 30 tRNA genes, and four rRNA genes. Codon usage analysis showed that leucine had the highest frequency, while UUU (encoding phenylalanine) and UGC (encoding cysteine) were identified as the most and least frequently used codons, respectively. A total of 75–84 SSRs, 16–21 short tandem repeats, and 27–32 long repeat structures were identified in these chloroplast genomes. Then, four primer pairs were revealed for identifying SSR polymorphisms. Palindromes are the dominant type, accounting for an average of 47.86% of all long repeat sequences. Gene orders were highly collinear, and IR regions were highly conserved. Genome alignment indicated that there were four intergenic regions (psaI-ycf4, ycf3-trnS, ndhC-trnV, and ndhI-ndhG) and three coding genes (ndhJ, ycf1, and rpl20) that were highly variable among different P. heterophylla samples. Moreover, 10 SNP/MNP sites with high polymorphism were selected for further study. Phylogenetic analysis showed that populations of Chinese were clustered into a monophyletic group, in which the non-flowering variety formed a separate subclade with high statistical support. In this study, the comparative analysis of complete chloroplast genomes revealed intraspecific variations in P. heterophylla and further supported the idea that chloroplast genomes could elucidate relatedness among closely related cultivation materials.
Article
i>Dendrobium porphyrochilum Lindl. is a valued traditional herb in China for over 2000 years. However, due to overexploitation and habitat destruction, this species now faces the threat of extinction. This study presents the first sequencing and assembly of the complete chloroplast (cp) genome of D. porphyrochilum and investigates its phylogenetic relationships. The result showed that the cp genome of D. porphyrochilum is 154,604 bp in length, comprising a large single copy (LSC) region of 84,816 bp, a small single copy (SSC) region of 17,660 bp, and two inverted repeats (IR) regions of 52,128 bp each. The overall GC content of the plastome was determined to be 37.3%, with GC contents of 35.0% for the LSC, 30.4% for the SSC, and 43.5% for IRa and IRb. In addition, 118 genes were annotated, including 72 protein-coding genes, 8 ribosomal RNA genes, and 38 transfer RNA genes. A preliminary phylogenetic analysis confirmed that D. porphyrochilum belongs to the genus Dendrobium based on the cp genome and coding sequences. These findings provide reliable genetic information for future investigations into the taxonomy and evolutionary relationships of D. porphyrochilum and related species.
Article
The chloroplast genomes of five Fritillaria ussuriensis materials from different production areas were comparatively analyzed, atpF and petB were screened as specific DNA barcodes, and the population identification and genetic diversity of F. ussuriensis were analyzed based on them. The F. ussuriensis chloroplast genome showed a total length of 151 515–151 548 bp with a typical tetrad structure and encoded 130 genes. atpF and petB were used to amplify 183 samples from 13 populations, and they could identify 6 and 9 haplotypes, respectively. Joint analysis of the two sequences revealed 18 haplotypes, named H1–H18, with the most widely distributed and most abundant being H4. Ten haplotypes were unique for 7 populations that they could be used to distinguish from others. Haplotype diversity and nucleotide diversity were 0.99 and 2.09 × 10−3, respectively, indicating the genetic diversity was relatively rich. The results of the intermediary adjacency network showed that H5 was the oldest haplotype, and stellate radiation was centered around it, indicating that population expansion occurred in genuine production areas. This study lays a theoretical foundation for the population identification, genetic evolution, and breed selection of F. ussuriensis.
Article
i>Polygonatum hirtellum is a perennial herb within the family Asparagaceae. In the present study, the chloroplast (cp) genome of P. hirtellum is reported for the first time, and its phylogenetic relationships are also investigated. The findings revealed that the cp genome was 155,697 bp and exhibited a typical quadripartite structure, consisting of a large single copy (84,468 bp), a small single copy (18,491 bp), and a pair of invert repeat sequences (26,369 bp). The GC content of the cp genomes amounted to 37.7%; 130 genes were identified, consisting of 84 protein-coding genes, 38 transfer RNA genes, and eight ribosomal RNA genes. A phylogenetic analysis based on the cp genome and coding sequence showed that P. hirtellum was a member of section Verticillata and could be distinguished from other species of the genus Polygonatum used in the analysis.
Article
Full-text available
Introduction The potential contamination of herbal medicinal products poses a significant concern for consumer health. Given the limited availability of genetic information concerning Ajuga species, it becomes imperative to incorporate supplementary molecular markers to enhance and ensure accurate species identification. Methods In this study, the chloroplast (cp) genomes of seven species of the genus Ajuag were sequenced, de novo assembled and characterized. Results exhibiting lengths ranging from 150,342 bp to 150,472 bp, encompassing 86 - 88 protein-coding genes (PCGs), 35 - 37 transfer RNA, and eight ribosomal RNA. The repetitive sequences, codon uses, and cp genomes of seven species were highly conserved, and PCGs were the reliable molecular markers for investigating the phylogenetic relationship within the Ajuga genus. Moreover, four mutation hotspot regions (accD-psaI, atpH-atpI, ndhC-trnV(UAC), and ndhF-rpl23) were identified within cp genomes of Ajuga, which could help distinguish A. bracteosa and its contaminants. Based on cp genomes and PCGs, the phylogenetic tree preliminary confirmed the position of Ajuga within the Lamiaceae family. It strongly supported a sister relationship between Subsect. Genevense and Subsect. Biflorae, suggesting the merger of Subsect. Biflorae and Subsect. Genevenses into one group rather than maintaining separate categorizations. Additionally, molecular clock analysis estimated the divergence time of Ajuga to be around 7.78 million years ago. Discussion The species authentication, phylogeny, and evolution analyses of the Ajuga species may benefit from the above findings.
Preprint
Full-text available
We analyzed and compared the three Leonurus species from appearance characteristics, intrinsic gene contents and functions, gene transcribed spacer regions (ITS2) in the nuclear genome, and the process of evolutionary development. The chloroplast genomes of them were found to be common circular structures with the four regions, the length range from 151236bp to 151689bp including the identical number of 37 tRNA and 8 rRNA. The special trans-spliced rps 12 genes with two introns and three exons can be caught sight of L. sibiricus. The isoleucine (AUU) and cysteine (UGC) was the most and least abundant amino acid with the codons ending with most bases of A/U. The eleven hotspot divergent regions and four specific CDS genes were systematically counted with the highest nucleotide diversity. We cloned the DNA sequences of the two chosen IGS regions ( atp H- atp I and rps1 5- ycf 1) to develop the DNA barcodes, thus found that sixteen specific variable SNP sites and two Indel sites within the three Leonurus species. Furthermore, the ITS2 DNA sequences of 7 Leonurus species were varied after the 57th base in a total of 221 bases. In the aftermath of evolutional analysis, the three Leonurus species were significantly clustered into one great clade, while they were located at the different sub-branch with similar topology and close relationship of Phlomoides rotata based on the sequences of 64 shared nucleotide and ITS2 DNA. Thus above results can directly offer various evidence to better clarify the specific distinction among the three Leonurus species and their evolutionary history.
Article
Full-text available
Members of Polygonatum are perennial herbs that have been widely used in traditional Chinese medicine to invigorate Qi, moisten the lung, and benefit the kidney and spleen among patients. However, the phylogenetic relationships and intrageneric taxonomy within Polygonatum have long been controversial because of the complexity of their morphological variations and lack of high-resolution molecular markers. The chloroplast (cp) genome is an optimal model for deciphering phylogenetic relationships in related families. In the present study, the complete cp genome of 26 species of Trib. Polygonateae were de novo assembled and characterized; all species exhibited a conserved quadripartite structure, that is, two inverted repeats (IR) containing most of the ribosomal RNA genes, and two unique regions, large single sequence (LSC) and small single sequence (SSC). A total of 8 highly variable regions (rps16-trnQ-UUG, trnS-GCU-trnG-UCC, rpl32-trnL-UAG, matK-rps16, petA-psbJ, trnT-UGU-trnL-UAA, accD-psaI, and trnC-GCA-petN) that might be useful as potential molecular markers for identifying Polygonatum species were identified. The molecular clock analysis results showed that the divergence time of Polygonatum might occur at ∼14.71 Ma, and the verticillate leaf might be the ancestral state of this genus. Moreover, phylogenetic analysis based on 88 cp genomes strongly supported the monophyly of Polygonatum. The phylogenetic analysis also suggested that Heteropolygonatum may be the sister group of the Polygonatum, but the Disporopsis, Maianthemum, and Disporum may have diverged earlier. This study provides valuable information for further species identification, evolution, and phylogenetic research of Polygonatum.
Article
Full-text available
Artemisia argyi Levl. et Van is an important Asteraceae species with a high medicinal value. There are abundant A. argyi germplasm resources in Asia, especially in China, but the evolutionary relationships of these varieties and the systematic localization of A. argyi in the family Asteraceae are still unclear. In this study, the chloroplast (cp) genomes of 72 A. argyi varieties were systematically analyzed. The 72 varieties originated from 47 regions in China at different longitudes, latitudes and altitudes, and included both wild and cultivated varieties. The A. argyi cp genome was found to be ∼151 kb in size and to contain 114 genes, including 82 protein-coding, 28 tRNA, and 4 rRNA genes. The number of short sequence repeats (SSRs) in A. argyi cp genomes ranged from 35 to 42, and most of them were mononucleotide A/T repeats. A total of 196 polymorphic sites were detected in the cp genomes of the 72 varieties. Phylogenetic analysis demonstrated that the genetic relationship between A. argyi varieties had a weak relationship with their geographical distribution. Furthermore, inverted repeat (IR) boundaries of 10 Artemisia species were found to be significantly different. A sequence divergence analysis of Asteraceae cp genomes showed that the variable regions were mostly located in single-copy (SC) regions and that the coding regions were more conserved than the non-coding regions. A phylogenetic tree was constructed using 43 protein-coding genes common to 67 Asteraceae species. The resulting tree was consistent with the traditional classification system; Artemisia species were clustered into one group, and A. argyi was shown to be closely related to Artemisia lactiflora and Artemisia montana. In summary, this study systematically analyzed the cp genome characteristics of A. argyi and compared cp genomes of Asteraceae species. The results provide valuable information for the definitive identification of A. argyi varieties and for the understanding of the evolutionary relationships between Asteraceae species.
Article
Full-text available
Isodon rubescens (Hemsley) H. Hara (Lamiaceae) is a traditional Chinese medicine plant that has been used to treat various human diseases and conditions such as inflammation, respiratory and gastrointestinal bacterial infections, and malignant tumors. However, the contents of the main active components of I. rubescens from different origins differ significantly, which greatly affected its quality. Therefore, a molecular method to identify and classify I. rubescens is needed. Here, we report the DNA sequence of the chloroplast genome of I. rubescens collected from Lushan, Henan province. The genome is 152,642 bp in length and has a conserved structure that includes a pair of IR regions (25,726 bp), a LSC region (83,527 bp) and a SSC region (17,663 bp). The chloroplast genome contains 113 unique genes, four rRNA genes, 30 tRNA genes, and 79 protein-coding genes, 23 of which contain introns. The protein-coding genes account for a total of 24,412 codons, and most of them are A/T biased usage. We identified 32 simple sequence repeats (SSRs) and 48 long repeats. Furthermore, we developed valuable chloroplast molecular resources by comparing chloroplast genomes from three Isodon species, and both mVISTA and DnaSP analyses showed that rps16-trnQ, trnS-trnG, and ndhC-trnM are candidate regions that will allow the identification of intraspecific differences within I. rubescens. Also 14 candidate fragments can be used to identify interspecific differences between species in Isodon. A phylogenetic analysis of the complete chloroplast genomes of 24 species in subfamily Nepetoideae was performed using the maximum likelihood method, and shows that I. rubescens clustered closer to I. serra than I. lophanthoides. Interestingly, our analysis showed that I. rubescens (MW018469.1) from Xianyang, Shaanxi Province (IR-X), is closer to I. serra than to the other two I. rubescens accessions. These results strongly indicate that intraspecific diversity is present in I. rubescens. Therefore, our results provide further insight into the phylogenetic relationships and interspecific diversity of species in the genus Isodon.
Article
Full-text available
Numerous antitumor drugs exhibited excellent anti-proliferation activities, discovered from natural products. Oridonin is isolated from a traditional Chinese medicine Isodon rubescens, and it exhibit various biological functions including anticancer, anti-inflammation, antibacterial, and so on. The anticancer mechanisms of oridonin include apoptosis, autophagy, cell cycle arrest, and so on. However, clinical research and application of oridonin has been hampered, due to the poor pharmaceutical property. So many studies on oridonin have been explored in the purpose of improve the pharmaceutical property, including investigation of the mechanism of action in depth and structural modification. In this review, we summarized oridonin and oridonin derivatives as potential anticancer agents, and detailed discussion on biological properties, mechanism of actions and potential for overcoming therapeutic resistance.
Article
Full-text available
Isodon rubescens (Hemsley) H. Hara is a kind of traditional medical herb which can be used for cancer treatment. In this study, the complete chloroplast genome sequence of I. rubescens was assembled. Its complete circular chloroplast DNA length was 152,761 bp. The genome was made up of a large single-copy region of 83,655 bp, a small single-copy region of 17,660 bp, and a pair of inverted repeat regions of 25,723 bp. The genome totally encoded 129 genes, containing 85 protein-coding genes, 36 tRNA genes, and eight rRNA genes. Phylogenetic analysis indicated that I. rubescens had a close relationship with basil (Ocimum basilicum).
Article
Full-text available
Abutilon fruticosum is one of the endemic plants with high medicinal and economic value in Saudi Arabia and belongs to the family Malvaceae. However, the plastome sequence and phylogenetic position have not been reported until this study. In this research, the complete chloroplast genome of A. fruticosum was sequenced and assembled, and comparative and phylogenetic analyses within the Malvaceae family were conducted. The chloroplast genome (cp genome) has a circular and quadripartite structure with a total length of 160,357 bp and contains 114 unique genes (80 protein-coding genes, 30 tRNA genes and 4 rRNA genes). The repeat analyses indicate that all the types of repeats (palindromic, complement, forward and reverse) were present in the genome, with palindromic occurring more frequently. A total number of 212 microsatellites were identified in the plastome, of which the majority are mononucleotides. Comparative analyses with other species of Malvaceae indicate a high level of resemblance in gene content and structural organization and a significant level of variation in the position of genes in single copy and inverted repeat borders. The analyses also reveal variable hotspots in the genomes that can serve as barcodes and tools for inferring phylogenetic relationships in the family: the regions include trnH-psbA, trnK-rps16, psbI-trnS, atpH-atpI, trnT-trnL, matK, ycf1 and ndhH. Phylogenetic analysis indicates that A. fruticosum is closely related to Althaea officinalis, which disagrees with the previous systematic position of the species. This study provides insights into the systematic position of A. fruticosum and valuable resources for further phylogenetic and evolutionary studies of the species and the Malvaceae family to resolve ambiguous issues within the taxa.
Article
Full-text available
Zingiber montanum (Z. montanum) and Zingiber zerumbet (Z. zerumbet) are important medicinal and ornamental herbs in the genus Zingiber and family Zingiberaceae. Chloroplast-derived markers are useful for species identification and phylogenetic studies, but further development is warranted for these two Zingiber species. In this study, we report the complete chloroplast genomes of Z. montanum and Z. zerumbet, which had lengths of 164,464 bp and 163,589 bp, respectively. These genomes had typical quadripartite structures with a large single copy (LSC, 87,856–89,161 bp), a small single copy (SSC, 15,803–15,642 bp), and a pair of inverted repeats (IRa and IRb, 29,393–30,449 bp). We identified 111 unique genes in each chloroplast genome, including 79 protein-coding genes, 28 tRNAs and 4 rRNA genes. We analyzed the molecular structures, gene information, amino acid frequencies, codon usage patterns, RNA editing sites, simple sequence repeats (SSRs) and long repeats from the two chloroplast genomes. A comparison of the Z. montanum and Z. zerumbet chloroplast genomes detected 489 single-nucleotide polymorphisms (SNPs) and 172 insertions/deletions (indels). Thirteen highly divergent regions, including ycf1, rps19, rps18-rpl20, accD-psaI, psaC-ndhE, psbA-trnK-UUU, trnfM-CAU-rps14, trnE-UUC-trnT-UGU, ccsA-ndhD, psbC-trnS-UGA, start-psbA, petA-psbJ, and rbcL-accD, were identified and might be useful for future species identification and phylogeny in the genus Zingiber. Positive selection was observed for ATP synthase (atpA and atpB), RNA polymerase (rpoA), small subunit ribosomal protein (rps3) and other protein-coding genes (accD, clpP, ycf1, and ycf2) based on the Ka/Ks ratios. Additionally, chloroplast SNP-based phylogeny analyses found that Zingiber was a monophyletic sister branch to Kaempferia and that chloroplast SNPs could be used to identify Zingiber species. The genome resources in our study provide valuable information for the identification and phylogenetic analysis of the genus Zingiber and family Zingiberaceae.
Article
Full-text available
Isodon brevipedunculatus, a new species from southern China, is described and illustrated. The phylogenetic position of the new species within the genus was analyzed based on two nuclear ribosomal DNA regions and an ingroup sampling of about 80% of Asian species of Isodon. The results show that I. brevipedunculatus is recovered in a clade that consists of species mainly with glandular mericarps and that are distributed in the Sino-Japanese region. Combining molecular and geographical evidence, our study reveals that I. brevipedunculatus is most closely related to I. amethystoides and I. bifidocalyx, but differs from the former in lamina shape, number of flowers per cyme, and peduncle length, and from the latter in lamina indumentum, calyx morphology, and corolla length.
Article
Paris L. genus has been a precious traditional herb for more than 2000 years in China. However, due to overexploitation and habitat destruction, Paris is threatened by extinction. Similar morphological features cause the classification of Paris species in dispute. The chloroplast (cp) genome approach has been used to investigate the evolution of Paris. However, some studies confirm that the cp genome may result in misleading relationships because of the length variation, gaps/indels deletion, and incorrect models of sequence evolution in concatenated datasets. Therefore, there is a high demand for a reconstructed phylogenetic relationship and developed genetic markers to conserve these species. Recent studies have demonstrated that the protein-coding genes could provide a better phylogenetic relationship in the phylogenetic investigation. In this study, the complete cp genomes of five species were characterized, and the length of five cp genomes ranges from 162,927 bp to 165,267 bp, covering 89 protein-coding genes, 38 tRNA, and eight rRNA. The analysis of the repeat sequences, codon usage, RNA-editing sites, and comparison of cp genomes shared a high degree of conservation. Based on the protein-coding genes, the phylogenetic tree confirmed Paris’s position in the order Melanthiaceae, providing maximum support for a sister relationship of the subgenera Paris sensu strict (Paris s.s.) with the Daiswa and Trillium. In addition, the molecular clock showed that subgenus Paris was inferred to have occurred at about 52.81 Mya, whereas subgenus Daiswa has originated at 24.56 Mya, which was consistent with the phylogenetic investigation. This study provided a valuable insight into the evolutionary dynamics of cp genome structure in the family Melanthiaceae, and it also contributes to the bioprospecting and conservation of Paris species.
Article
Cannabis sativa can be classified as either hemp (a legal crop containing less than 0.3% delta-9-tetrahydrocannabinol, THC) or marijuana (an illegal drug containing more than 0.3% THC). Despite its legalization in 33 states for medicinal or recreational use, marijuana remains the most commonly used illicit drug in the USA, and it is heavily trafficked into and within the country. Discriminating between marijuana and hemp is critical to the legal process. Genetic analysis provides a means of analyzing samples unsuitable for chemical analysis, and in addition to discriminating between crop types, DNA may be able to determine the biogeographical origin of samples. In addition, the sharing of rare haplotypes between different seizures may be useful for linking cases and providing investigative leads to law enforcement. This study evaluates the potential of two highly polymorphic regions of the chloroplast genome of C. sativa, rps16 and clpP, to be used for determination of crop type and biogeographical origin. Custom fragment analysis and SNaPshot™ assays were developed to genotype nine polymorphic loci in hemp samples from the USA and Canada, marijuana samples from USA-Mexico and Chile, and medical marijuana samples from Chile. Haplotype analysis revealed eight haplotypes. Only Canadian hemp could be completely differentiated from the other sample groups by haplotype. Phylogenetic analysis and principal component analysis suggested a closer relationship among USA-Mexico marijuana, Chilean marijuana and medical marijuana, and USA hemp. Genotyping additional polymorphisms in future studies is expected to reveal further differences between these sample groups.