ArticlePDF Available

Characteristics of Amorphophallus konjac as indicated by its genome

Springer Nature
Scientific Reports
Authors:

Abstract and Figures

Amorphophallus konjac, belonging to the genus Amorphophallus of the Araceae family, is an economically important crop widely used in health products and biomaterials. In the present work, we performed the whole-genome assembly of A. konjac based on the NovaSeq platform sequence data. The final genome assembly was 4.58 Gb with a scaffold N50 of 3212 bp. The genome includes 39,421 protein-coding genes, and 71.75% of the assemblies were repetitive sequences. Comparative genomic analysis showed 1647 gene families have expanded and 2685 contracted in the A. konjac genome. Likewise, genome evolution analysis indicated that A. konjac underwent whole-genome duplication, possibly contributing to the expansion of certain gene families. Furthermore, we identified many candidate genes involved in the tuber formation and development, cellulose and lignification synthesis. The genome of A. konjac obtained in this work provides a valuable resource for the further study of the genetics, genomics, and breeding of this economically important crop, as well as for evolutionary studies of Araceae family.
This content is subject to copyright. Terms and conditions apply.
1
Vol.:(0123456789)
Scientic Reports | (2023) 13:22684 | https://doi.org/10.1038/s41598-023-49963-9
www.nature.com/scientificreports
Characteristics of Amorphophallus
konjac as indicated by its genome
Lifang Li
1,6, Min Yang
1,6, Wei Wei
1, Jianrong Zhao
1, Xuya Yu
2, Rarisara Impaprasert
3,
Jianguang Wang
4, Jiani Liu
1, Feiyan Huang
1, George Srzednicki
5* & Lei Yu
1*
Amorphophallus konjac, belonging to the genus Amorphophallus of the Araceae family, is an
economically important crop widely used in health products and biomaterials. In the present work, we
performed the whole-genome assembly of A. konjac based on the NovaSeq platform sequence data.
The nal genome assembly was 4.58 Gb with a scaold N50 of 3212 bp. The genome includes 39,421
protein-coding genes, and 71.75% of the assemblies were repetitive sequences. Comparative genomic
analysis showed 1647 gene families have expanded and 2685 contracted in the A. konjac genome.
Likewise, genome evolution analysis indicated that A. konjac underwent whole-genome duplication,
possibly contributing to the expansion of certain gene families. Furthermore, we identied many
candidate genes involved in the tuber formation and development, cellulose and lignication
synthesis. The genome of A. konjac obtained in this work provides a valuable resource for the further
study of the genetics, genomics, and breeding of this economically important crop, as well as for
evolutionary studies of Araceae family.
e genus Amorphophallus1, a member of the Araceae family, is a perennial, herbaceous plant (Fig.1a). It is
estimated that it includes over 170 species occurring from West Africa, through subtropical and tropical Asia
and further south in the tropical regions of the western Pacic and north-eastern Australia2. e Amorphophal-
lus plants store their reserve polysaccharides, starch and glucomannan, in underground tubers. Some of these
species contain considerable amounts of konjac glucomannan (KGM). e species producing glucomannan are
generally known by the common name ‘konjac’ and are economically important as a raw material for food and
pharmaceutical products worldwide3. is common name comes from Amorphophallus konjac, species that has
been used widely in China and Japan for commercial KGM production. KGM it is used in products ranging
from emulsiers to weight loss supplements, in addition to its long-standing usage as a food and traditional
medicine. China is both, a center of diversity for Amorphophallus and one of the major producers of this plant
worldwide. It is also, along with Japan, one of the leading producers of KGM derived products. A. konjac is a
diploid species (2n = 13) and is one of the important commercial crops cultivated in the central and western
regions of China because it is the only plant species which is rich in KGM concentration4. KGM is a water-soluble,
neutral polysaccharide with a high molecular weight5,6. KGM is a β-1, 4 linked polysaccharide composed of a
-glucose (G) and -mannoses (M) backbone7. e KGM backbone possesses 5–10% acetyl-substituted residues
and the presence of substituted group benets KGM for the solubility in aqueous solution, leading to high vis-
cosity that forms a thick hydrocolloid even when used at low concentrations8. is property makes it one of the
most versatile and economically useful hydrocolloids with industrial applications including the manufacture of
foods, pharmaceuticals and chemicals. KGM is used in a wide range of commercial products throughout Asia
and increasingly throughout the rest of the world6. us, the high quality and purity of KGM obtained from A.
konjac make it the most abundant cultivated Amophophallus species in China, especially in Yunnan. Daguan
county is one of the largest plantation areas of A. konjac in Yunnan and the local A. konjac as an economically
important crop for rural revitalization in this region. en, the representative landrace A. konjac in this region
was used for whole genome sequencing.
Given the economic potential of KGM, a number of studies have been conducted on Amorphophallus species
producing this biopolymer6,911. e researchers focused on the relationship between genetic markers and KGM
biosynthesis in A. konjac, and adopted a transcriptomics approach to identify potentially useful regions in the
genome. ey also studied several other KGM producing Amorphophallus species. ese studies are on-going in
OPEN
1College of Agronomy, Yunnan Urban Agricultural Engineering and Technological Research Center, Kunming
University, Kunming, China. 2Faculty of Life Science and Technology, Kunming University of Science and
Technology, Kunming, China. 3Department of Microbiology, King Mongkut’s University of Technology Thonburi,
Bangkok, Thailand. 4School of Life Sciences, Yunnan University, Kunming, China. 5Food Science & Technology,
School of Chemical Engineering, The University of New South Wales, Sydney, Australia. 6These authors contributed
equally: Lifang Li and Min Yang. *email: georgesrz@yahoo.com; yulei0425@163.com
Content courtesy of Springer Nature, terms of use apply. Rights reserved
2
Vol:.(1234567890)
Scientic Reports | (2023) 13:22684 | https://doi.org/10.1038/s41598-023-49963-9
www.nature.com/scientificreports/
order to better understand the association between genetic diversity and KGM content in a broader population
of Amorphophallus species.
e main species of Amorphophallus genus have been studied and described in relation to their morphol-
ogy and palynology1215. Since the morphological and palynological characters are highly variable, a number
of molecular markers have been employed to determine relationships in the genus. ese markers include the
LEAFY (FLint2) gene and the chloroplast regions rbcL, matK and trnL1619. Since phylogenetic studies based on
these regions do not produce consistent cladograms (due to a high level of conicting signals in the informative
characters), further variable regions and also other non-sequencing molecular methods are needed to establish
the evolutionary history of Amorphophallus. e transcriptomics approach may lead to useful insights into
important traits such as KGM production, tuber formation and development and other characteristics.
e genomes of two important monocotyledonous species in the order of Alismatales namely Spirodela
polyrhiza20 and Zostera marina21 have been sequenced and their characteristics have been described by the
authors of these papers. Although A. konjac as a glucomannan-producing cash crop in many Asian countries,
there have been no any genomic information reports on A. konjac before we conducted whole-genome sequenc-
ing on this species. erefore, we sequenced the whole genome of A. konjac, and the data was submitted to the
NCBI database in 2020. Although Gao etal. subsequently provided a high-quality chromosome-level genome
of A. konjac22, our results can also enrich the genomic information of Amorphophallus to a certain extent. In this
study, we performed a series of genomic analyses on A. konjac including assembly, annotations, identication
of phylogenetic relationship, gene family analysis, divergence time estimation. We also identied cellulose and
lignication synthesis genes, and tuber formation and development genes. e results will provide important
insights as well as resources for future study of A. konjac.
130.7
124.6
86.2 47.5 0
Million years ago
Gene families
Expansion
Contraction
+1119
-2621
+587
-1829
+1647
-2685
+52
-874
+0
-3
+3302
-674
+534
-2580
+814
-349
MRCA
(11659)
Combined Change
Across Lineages
Z. marina
S. polyrhiza
A. konjac
Z. mays
O. sativa
124.6
124.6
86.2
86.2
130.7
130.7
47.5
47.5
A. konjac
Z. marina
O. sativa
S. polyrhize
Z. mays
0
8,000
16,000
24,000
32,000
40,000
48,000
Single-copy orthologs
Multiple-copy orthologs
Unique paralogs
Other orthologs
Unclustered genes
Number of genes
0
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 1.6 1.7 1.8 1.9 2
Substitutions per synonymous site (Ks)
Percentage of gene pairs
All
Tandem
Corrected
b
c
d
a
Figure1. Overview for evolutionary analysis of A. konjac. (a) Images of the sequenced A. konjac. (b) Ortholog
clustering analysis of the protein-coding genes in the A. konjac genome. (c) Phylogenetic tree and divergence
time of A. konjac and four plant species. Phylogenetic tree was generated from the single-copy orthologs using
the maximum-likelihood method. e divergence time range is shown by red blocks. e predicted divergence
time is shown as number inside the pink blocks. e pie charts show the proportion of expanded/contracted
gene families in each plant species. (d) Distribution of substitutions per synonymous site (Ks) in A. konjac.
Content courtesy of Springer Nature, terms of use apply. Rights reserved
3
Vol.:(0123456789)
Scientic Reports | (2023) 13:22684 | https://doi.org/10.1038/s41598-023-49963-9
www.nature.com/scientificreports/
Results
Genome assembly and annotation
e DNA sequencing data (1119.58Gb, average 110× coverage) of the A. konjac sample were obtained using the
Illumina Hiseq 2500 sequencer. A summary of the sequence data used for the assembly is presented in TableS1.
e estimated genome size is 4,512,012,462bp using 19-mer frequency distribution based on the paired-end
sequenceing data (Fig.S1), which is consistent with measurement by ow cytometry (Fig.S2). Based on the
Illumina sequencing data, 2.99Gb contigs were assembled using SOAPdenovo223 (TableS2). Aer construct-
ing scaolds and lling gaps, the 4.58Gb A. konjac reference genome was assembled, and this resulted in the
7,423,768 scaolds with a scaold N50 of 3212bp (Tables1, S2). e A. konjac genome shows signicant genomic
synteny with S. polyrhiza. e assembly performed in this study captured 75.81% (188 of 248) of core eukaryotic
genes (TableS3) and captured 624 complete BUSCOs v5.2.2 (TableS4) using core eukaryotic genes mapping
approach soware (CEGMA) and BUSCO soware24, respectively.
Combination of de novo prediction and homology-based search resulted in identication of 3,289,511,160bp
repetitive elements in A. konjac genome (TableS5), make up about 71.75% of the assembled genomes (TableS5).
Most of the repeats were de novo predicted (70.98%), the repeats detected by homologous method were relatively
few (TableS5). Among the repeats in the A. konjac genome, 69.16% were transposable elements (TEs), of which
52.06% were long terminal repeats (LTR), including 31.42% Gypsy LTRs and 11.6% Copia LTRs (TableS6).
A total of 39,241 protein-coding genes were predicted in assembled genomes following a combination of
homology and abinitio methods, with an average coding length of 1372.75bp and a mean of 2.29 exons per
gene, respectively (Table1, Fig.S3, TableS7), the gene number and average gene length is close to that of S.
polyrhiza and the average gene is longer than that of Oryza sativa and Zea mays (Fig.S4, TableS7). Moreover,
an average of 92.22% of the RNA sequencing (RNA-seq) reads of the four A. konjac tissues (leaf, stem, root and
tuber) could be mapped to the genome. In addition, 65.26% of the predicted genes (25,725/39,241) showed
expression levels (FPKM > 0.05) by aligning leaf, stem, root and tuber RNA-seq data to the set of protein-coding
genes using TopHat225, and estimating expression values based on the resulting alignments using Cuinks26. In
total, 26,456, 26,512, 25,797 and 33,715 of the predicted genes were assigned with a functional annotation in the
Swiss-Prot, KEGG, InterProScan, and Trembl databases, respectively (TableS8), a total of 34,126 of the predicted
genes (87%) were assigned with a functional annotation in at least one database (TableS8).
An overview of annotated ncRNA is shown in TableS9. 1078 miRNAs, 761 tRNAs, 2894 rRNAs and 1553
snRNAs were predicted in A. konjac.
Table 1. Summary of genome assembly and annotation.
Assembly
Assembled genome size (bp) 4,584,988,971
Genome-sequencing depth (×) 244.18
No. of scaolds 7,423,768
N50 of scaolds (bp) 3212
Longest scaold (bp) 85,347
GC content of the genome (%) 45.71
N length (bp) 887,681,325
Annotation
Percentage of repeat sequences (%) 71.75
Repeat sequence length (bp) 3,289,511,160
No. of predicted protein-coding genes 39,241
Percentage of average gene length (bp) 1,372.75
Average exon length (bp) 257.08
Average exon per gene 2.29
Total intron length (bp) 30,870,726
tRNAs 761
rRNAs 2894
snRNAs 1553
miRNAs 1078
Family number 13,190
Genes in families 22,730
Average genes per family 1.72
Unique families 3001
Un-clustered genes 16,691
Content courtesy of Springer Nature, terms of use apply. Rights reserved
4
Vol:.(1234567890)
Scientic Reports | (2023) 13:22684 | https://doi.org/10.1038/s41598-023-49963-9
www.nature.com/scientificreports/
Gene family cluster
Based on pair-wise protein sequence similarity, the gene family clustering analysis of ve species genes, Z.
marina, O. sativa, S. polyrhiza, Z. mays and A. konjac has been carried out. A total of 22,730 genes in A. konjac
were clustered into 13,190 gene families, however, A. konjac has 16,691 unclustered genes and 3001 unique gene
families (Table1,Fig.1b, Fig.S5A, TableS10), that is more than other four species, and the number of single-
copy orthologs genes in A. konjac is 4509. e Venn diagram (Fig.S5a) shows that ve species share a common
core set of 6438 gene families.
e unique gene families in A. konjac were enriched in nucleobase-containing compound biosynthetic
process, nucleobase-containing compound catabolic process, regulation of nucleobase-containing compound
metabolic process, aromatic compound biosynthetic process, heterocycle catabolic process, negative regula-
tion of growth, 1,3-beta--glucan synthase complex, cytoskeleton organization, membrane, molecular function
regulator, peptidase regulator activity, 1,3-beta--glucan synthase activity and so on (Fig.S5B). Moreover, the
unique gene families containa large number of unique paralogous genes (7847 genes) that are not orthologous
to any known genes in other four species, which were enriched in 1,3-beta--glucan synthase complex, a series
of related components of vesicle membrane and so on in cellular component. e 1,3-beta--glucan synthase
complex can catalyse the transfer of a glucose group from UDP-glucose to a (13)-beta--glucan chain, which
may be related with the high starch content in tuber and the fast-growing trait in A. konjac.
Evolution, expansion and contraction
To systematically study the evolutionary dynamics of Alismatales species, species phylogeny was performed
utilizing single-copy orthologous genes among ve species, which included 4509 single-copy orthologous genes
in A. konjac. As illustrated in Fig.1c, the estimated divergence time is 130.7 (124.6–139.9) million years ago
(MYA) between Alismatales and Poaceae, Araceae and Zosteraceae separated at about 124.6 (115.3–131.9) MYA,
the divergence time is 86.2 (78.2–96.0) MYA between S. polyrhiza and A. konjac (Fig.1c). is result based on
genomic data will provide a phylogenetic framework for interpreting the evolutionary events of the family.
Comparative analysis of the gene family expansion and contraction showed that 1647 gene families have
expanded and 2685 contracted in the A. konjac genome (Fig.1c). Based on the InterProScan functional annota-
tion, the expansive genes in A. konjac were enriched in iron coordination entity transport, vitamin E metabolic
process, vitamin E biosynthetic process, cofactor transport, heme transport and so on in the biochemical pro-
cesses (p-value < 0.05) (Fig.S6). Furthermore, the gene families that had undergone contraction in A. konjac were
enriched in reproduction, pollination, pollen-pistil interaction, multi-sprout formation, reproductive process,
cell recognition and various biochemical processes (p-value < 0.05) (Fig.S7), which may suggest that the mode
of reproduction is asexual reproduction principally in A. konjac, and the occurrence of sexual reproduction
needs particular conditions.
Whole-genome duplication (WGD) followed by gene loss has been found in most eudicots and is regarded
as the major evolutionary force that gives rise to gene neofunctionalisation in both plants and animals27. Syn-
onymous substitution rates showed a unimodal distribution, indicating that the WGD of A. konjac occurred
recently (Fig.1d), it needs better reference genome to identify that whether or not it corresponds to the SP/
βSP WGDs in Alismatales20.
Detection of positively selected genes
Positive selection was proposed to contribute to tness. Respectively 686 and 122 genes of A. konjac were deter-
mined as positively selected genes and compared with S. polyrhiza and Z. marina (TablesS11, S12). GO enrich-
ments showed that more positively selected genes in A. konjac in comparison with S. polyrhiza were involved in
RNA biosynthetic process, regulation of biosynthetic process, regulation of gene expression, protein modication
process, cell wall organization or biogenesis, transcription, DNA-templated cell synthesis, cell growth and so
on (Fig.S8). Moreover, the positively selected genes in A. konjac were more involved than those in Z. marina in
leucine biosynthetic process, regulation of signal transduction, regulation of cell communication, regulation of
signaling, regulation of response to stimulus and so on (Fig.S9).
Analysis of transcription factor families
Transcription factors regulate gene expression and protein kinases regulate cellular activities by phosphorylat-
ing target proteins in response to internal or external signals. is study identied a total of 1275 transcription
factors and 345 transcriptional regulators in A. konjac (TableS13). e number of transcription factors in A.
konjac is more than in S. polyrhiza (1115 genes), and the number of transcriptional regulators in A. konjac is
more than in both, S. polyrhiza and Z. marina (271 and 307 genes, respectively), but fewer than that in maize
(573 genes). e AP2/ERF-ERF, GRAS, HSF, SBP, ULT transcription factors are more abundant in A. konjac in
comparison with S. polyrhiza and Z. marina, as well as the AUX/IAA, mTERF, and SNF2 transcriptional regula-
tors. is dierence may be caused by dierent growth environment, A. konjac is a terrestrial plant, while other
two are hydrophilous plants. In addition, the number of BBR-BPC and ULT genes in A. konjac is higher than in
maize. In co-transfection experiments, BBR activates (GA/TC)-containing promoters27, and its overexpression in
tobacco leads to a pronounced leaf shape modication28. In Arabidopsis, the ULTRAPETALA1 (ULT1) gene is a
key negative regulator of cell accumulation shoot and oral meristems, and the mutations in ULT1 can cause the
enlargement of inorescence and oral meristems, the production of supernumerary owers and oral organs,
and a delay in oral meristem termination, downregulation of both ULT genes can lead to shoot apical meristem
arrest shortly aer germination, revealing a requirement for ULT activity in early development29.
Content courtesy of Springer Nature, terms of use apply. Rights reserved
5
Vol.:(0123456789)
Scientic Reports | (2023) 13:22684 | https://doi.org/10.1038/s41598-023-49963-9
www.nature.com/scientificreports/
Contractive cellulose and lignication synthesis genes
Amorphophallus konjac is a lodging plant a trait that is consistent with a reduction of genes involved in cell wall
biosynthesis and lignication. According to InterProScan annotation, 50 cellulose synthase (CesA) and cellu-
lose synthase-like (Csl) genes were identied in A. konjac (Table2), which is obviously fewer than in the woody
bamboo species. Lignin, a major component of secondary cell wall, plays an important role for support, water
transport and stress responses in vascular plants19. A total of 20 genes involved in the lignin biosynthesis pathway
were detected in A. konjac (Table2), which contained 6 lignin biosynthesis gene families out of 10 families (PAL,
4CL, HCT, CCR , F5H, CAD but not C4H, C3H, CCoAMT, COMT). Overall, the absolute copy number of both
cellulose- and lignin-related genes decreased in A. konjac compared with woody species. e expression of CesA
and Csl genes also showed two dierent proles (Fig.2a), of which the expression of most genes (Cluster I and
Cluster II) was higher in tuber, bre and stem than in leaf, and expression of six genes (cluster III) were higher
in leaf than in tuber, bre and stem. For the expressed prole of lignin-related genes, the leaf and stem showed
distinct dierence against bre and tuber (Fig.2b).
Tuber formation and development genes
Sucrose metabolism is considered important for the development of a plant sink organ. In most plants, assimi-
lated carbon in source leaves is transported as sucrose into sink organs, including roots, tubers, fruit, and seeds30.
e present study investigated the genes related to starch and sucrose metabolism pathway and found that the
expressed prole of most genes in bre and tuber showed distinct dierence against the leaf and stem, which
were consistently high expression (Fig.3, TableS14). To utilise sucrose, this bond should be cleaved to generate
the two monosaccharides. Sucrose synthase (SUS) is the key enzyme that catalyzes both the synthesis and the
cleavage of sucrose30. SUS is a glycosyl transferase, which converts sucrose into UDP-glucose and fructose in
the presence of uridine diphosphate (UDP). SUS shows consistently high expression patterns in bre and tuber,
whereas low expression was observed in leaf and stem (Fig.3). On the other hand, SPS plays a major role in
photosynthetic sucrose synthesis by catalysing the rate-limiting step of sucrose biosynthesis from UDP-glucose
and fructose-6-phosphate. e expression of sucrose-phosphate synthase (SPS) gene was higher in leaf (Fig.3),
which was consistent with the role played as a limiting factor in the export of photoassimilates out of the leaf.
ese results suggest that sucrose synthase specically facilitates the storage and maturation of sinks.
Sucrose generated from photosynthates in source organs is transported to sink organs and is then converted
into starch. Plants store sugar as polymerised starch, enabling the storage of a larger amount of sugar without
problems caused by osmotic pressure30. In A. konjac, the starch synthase (glgA), granule-bound starch synthase
(WAXY), and glucose-1-phosphate adenylyltransferase (glgC) showed high expression patterns in bre and tuber
(Fig.3), which catalyse precursor substances to synthesise starch. Specially, the expression of 1,4-alpha-glucan
branching enzyme (GBE1) gene was slightly higher in leaf when comparing the three tissues. GBE catalyzes
the formation of α-1,6 branching points in starch and plays a key role in synthesis31. In general, starch synthe-
sized and accumulated directly from the products of photosynthesis in the leaf during the daytime, and is then
degraded into sugars as an energy source for the following night32. erefore, the high expression of GBE1 in
leaf may be related to the synthesis of starch through photosynthesis.In addition, 59 putative genes involved in
Table 2. Copy number variations of cellulose synthase (CesA), cellulose synthase-like (Csl), and lignication
synthesis related genes between 12 plants. PAL: Phenylalanine ammonia lyase; C4H: Cinnamate-4-
hydroxylase; C3H: ρ-Coumaroyl 3-hydroxylase/Coumaroyl 3-hydroxylase; 4CL: 4-Coumarate CoA Ligas;
HCT: Hydroxycinnamoyl-CoA: shikimate/quinate hydroxycinnamoyltransferase; CCR: Cinnamoyl-CoA
reductase; CCoAOMT: Trans-caeoyl-CoA 3-O-methyltransferase; CAD: Cinnamyl alcohol dehydrogenase;
F5H: Ferulate 5-hydroxylase; COMT: Caeic acid 3-O-methyltransferase. Akon: Amorphophallus konjac; Bam:
Bonia amplexicaulis; Ped: Phyllostachys edulis; Ola: Olyra latifolia; Rgu: Raddia guianensis; Bdi: Brachypodium
distachyon; Osa: Oryza sativa; Zma: Zea mays; Sbi: Sorghum bicolor; Ath: Arabidopsis thaliana; Ptr: Populus
trichocarpa; Spir: Spirodela polyrhiza. a Data from Guo etal.70. b data from Wang etal.20.
Akon BamaPedaOlaaRguaBdiaOsaaZmaaSbiaAthaPtraSpirb
CesA 15 27 26 12 10 19 11 20 12 10 18 10
Csl 35 55 51 40 35 24 34 33 37 29 37 21
PAL 3 13 8 6 7 8 9 10 9 4 5 3
C4H 0 6 4 1 2 3 4 4 3 1 2 3
4CL 7 11 6 4 5 5 5 3 5 4 5 9
HCT 1 5 4 2 2 2 2 2 2 1 2 20
C3H 0 3 3 1 2 1 2 2 2 3 3 1
CCoAOMT 0 2 2 2 1 1 1 2 1 1 2 1
CCR 3 7 5 5 2 2 2 1 2 2 7 21
F5H 4 3 3 2 2 2 3 2 2 2 4 3
COMT 0 2 1 2 1 1 1 1 1 1 2 5
CAD 2 2 3 5 1 1 1 1 1 2 1 4
Content courtesy of Springer Nature, terms of use apply. Rights reserved
6
Vol:.(1234567890)
Scientic Reports | (2023) 13:22684 | https://doi.org/10.1038/s41598-023-49963-9
www.nature.com/scientificreports/
Leaf
Stem
Fibr
e
T
uber
C424552781.1
scaffold570138.1
scaffold1428569.1
scaffold1231807.1
scaffold1294268.1
scaffold172367.1
scaffold793354.2
scaffold477935.2
scaffold1416963.1
scaffold778347.1
scaffold567120.2
scaffold477935.1
scaffold442503.1
scaffold1474830.1
scaffold1368005.1
scaffold1333147.1
scaffold1282474.2
C427401444.1
C427193443.1
C427100123.1
C426578615.1
C427076827.1
scaffold560982.1
scaffold662248.1
C426434765.1
scaffold1498400.1
scaffold1178008.1
C426594413.1
C427108391.1
C426912747.1
C427253058.1
scaffold1370363.1
scaffold1378781.1
C426910267.1
scaffold1320605.1
scaffold940359.1
scaffold490911.2
scaffold525577.1
C427166595.1
C427243432.1
scaffold1383231.1
scaffold1457361.1
scaffold1223788.1
scaffold823186.1
scaffold397646.1
scaffold1253996.1
scaffold337472.1
C426321109.1
scaffold470710.1
scaffold470711.1
0
0.5
1
1.5
2
2.5
Cluster I
Cluster II
Cluster III
a
Leaf
Stem
Fibre
Tuber
C427350524.1
scaffold713276.1
scaffold623084.1
scaffold234563.2
scaffold1191157.1
C426494589.1
scaffold1269903.1
scaffold1279511.1
scaffold1325657.1
scaffold1330045.1
scaffold1119499.1
scaffold234563.1
scaffold1488998.1
scaffold649897.1
scaffold1324338.2
scaffold1362070.1
C426742117.1
scaffold1174802.1
scaffold1347659.1
scaffold1235466.1
0
0.
5
1
1.
5
2
2.
5
3
b
Figure2. Heatmaps of gene expression. (a) Heatmap depicting the expressed prole of CesA and Csl genes; (b)
Heatmap depicting the expressed prole of lignin-related genes.
D-Fructose-6P
D-Fructose-6P
UDP-glucose
Sucrose
Sucrose-6P
α-D-Glucose-1P
D-Glucose-6P
Celldextrin
ADP-glucoseAm
y
lose
Dextrin
D-Glucose
Trehalose-6Po Trehalose
Cellulose
Cellobiose
β-D-Glucoside
1,3-β-Glucan
Sucrose (extracellular)
D-Fructose
D-Glucose-6P
Maltose
Trehalose (extracellular)
PYG
SUS
SPS
GBE1
malQ
HK
HK
FRK
glgC
AMY
AMY
BMY
BMY
EG
EG
malZ
malZ
bglU
INV
INV
ENPP1_3
GPI
pgm
bglX
bglX
bglB
bglU
bglX
bglB
SPP
WAXY
glgA
TPS
otsB
TPS
GN
Fibre
Tuber
Leaf
Stem
0
2
4
6
8
10
D-Glucose
bglU
bglB
Starch;Glycogen
SPS
WAXY
EG
GN
Figure3. e expression proles in FPKM (fragments per kilobase per million reads mapped) of genes
involved in the starch and sucrose metabolism pathway in the four tissues (tuber, bre, stem and leaf) from
7-month-old plant of A. konjac. Data are plotted as log10 values. PYG: glycogen phosphorylase; SUS: sucrose
synthase; GBE1: 1,4-alpha-glucan branching enzyme; glgA: starch synthase; malQ: 4-alpha-glucanotransferase;
HK: hexokinase; FRK: fructokinase; glgC: glucose-1-phosphate adenylyltransferase; otsB: trehalose 6-phosphate
phosphatase; AMY: alpha-amylase; BMY: beta-amylase; EG: endoglucanase; malZ: alpha-glucosidase; bglU:
beta-glucosidase; INV: beta-fructofuranosidase; ENPP1_3: ectonucleotide pyrophosphatase/phosphodiesterase
family member 1/3; GPI: glucose-6-phosphate isomerase; pgm: phosphoglucomutase; bglX: beta-glucosidase;
bglB: beta-glucosidase; SPP: sucrose-6-phosphatase; WAXY: granule-bound starch synthase; TPS: trehalose
6-phosphate synthase/phosphatase; GN: included GN1_2_3 (glucan endo-1,3-beta-glucosidase 1/2/3), GN4
(glucan endo-1,3-beta-glucosidase 4) and GN5_6 (glucan endo-1,3-beta-glucosidase 5/6).
Content courtesy of Springer Nature, terms of use apply. Rights reserved
7
Vol.:(0123456789)
Scientic Reports | (2023) 13:22684 | https://doi.org/10.1038/s41598-023-49963-9
www.nature.com/scientificreports/
the pathway wrere identied (Fig.4) according the previous studies on glucomannan biosynthesis22,33, and most
of them also were highly expressed in bre and tubers.
Discussion
As a major provider of KGM, A. konjac is abundant in southern China and Japan. e dierent species of genus
Amorphophallus show high genetic diversity. A. konjac is classied as a species with high KGM content. Its tubers
contain between 40 and 70% KGM33. In the natural habitat, fruiting eciency of A. konjac is less than 1% through
sexual reproduction. Although breeding strategies for A. konjac comprise asexual and sexual reproduction, sexual
reproduction happens on the condition of cross-pollination. Increasingly agricultural studies reported that special
structure of inorescence in A. konjac can facilitate the cross-pollination process and possibly increase diversity
of KGM-biosynthetic gene pool. However, genomic background of many traits of A. konjac is little known.
Here, we report the earliest sequenced A. konjac genome, which was sequenced by our research team in 2018
and uploaded to the ncbi database. e genome assembly of A. konjac exhibited a total size of 4.58Gb, which
was smaller than the another genome of A. konjac (5.60Gb) was assembled by Gao etal. using a combination of
Illumina, PacBio, and Hi-C technology22. Meanwhile, Gao etal. also identied 80.6% of the assembled sequences
as repetitive sequences, and 75.6% were transposable elements (TEs)22. Among the various TEs, long terminal
repeats (LTRs, 74.04%), especially Gypsy (40.28%) and Copia (9.58%) type, were remarkably prevalent in the
genome22. Nevertheless, we found that A. konjac genome comprised of 71.75% repeat sequences and 69.16%
were TEs, including 31.42% Gypsy LTRs and 11.6% Copia LTRs. A potential reason for the smaller genome size
and fewer repetitive sequences may be related to the second-generation sequencing data used in the present
study. e second-generation sequencing technologies are dicult to get the large repetitive sequences and lead
to incomplete assemblies34,35. Strong correlation between genome size and the proportion of TEs (especially
LTR-Copia and LTR-Gypsy) has been reported in many studies34,36, 37. In addition, previous studiep also found
that the A. konjac and the S. polyrhiza shared a recent WGD event, which is consistent with the results of this
study21. is study employed the genome analysis to characterise genetic traits of A. konjac. e results implied
that A. konjac possesses 3001 unique families and 4509 single-copy orthologs in a total of 13,190 identied
genes in comparison with the other four species (Z. marina, O. sativa, S. polyrhiza and Z. mays). In addition,
time-tree based on phylogenetic analysis showed that a more closely genetic relationship was found between S.
polyrhiza and A. Konjac (divergent time, 86.2 million years) than another three species (divergent time, over 100
million years between A. konjac and Z. marina, O. sativa and Z. mays). Moreover, the data of this study further
illustrated that some contracted genes in A. konjac genome are involve in pollination, pollen-pistil interaction
and reproductive process, which may oer genomic hints for sexual reproduction of A. konjac.
Positive selection was proposed to contribute to tness. e ratio of non-synonymous to synonymous sub-
stitutions (Ka/Ks), is widely used for the estimation of positive selection at the amino-acid site38. Analysis of the
ratios of Ka/Ks between Chrysanthemum morifolium and C. boreale two Chrysanthemum species, indicating
that 107 genes experienced positive selection, with Ka/Ks more than one, which may have been crucial for the
adaptation, domestication, and speciation of Chrysanthemum39. In current study, we identied 625 and 111
genes in A. konjac were detected under positive selection compared to S. polyrhiza and Z. marina, respectively.
Enrichment analysis suggested that those genes under positive selection are involved in biosynthetic process of
RNA and other organic substances, regulatory process of biogenesis, cellular organization and cell growth. ese
results support the fact that diverse genes were under positive selection in A. konjac, which might inuence the
FRK
Starch
GDP-mannose
UGP
Glucomannan
CSLA
CSLD
Sucrose
glucose
HXK
UDP-glucose
SUS
fructose
HXK
Fructose-6-
phospate
glucose-6-
phospate
glucose-1-
phospate
PGI PGM
Mannose-6-phospate
PMI
PMM
Mannose-1-phospate
GMPP
GDP-glucose
SS
ADP-glucose
AGP
Amylose
SBE
GMPP
INV
-1.5
-1.0
-0.5
0.0
0.5
1.0
L
S
F
T
L
S
F
T
L
S
F
T
L
S
F
T
L
S
F
T
L
S
F
T
L
S
F
T
L
S
F
T
L
S
F
T
L
S
F
T
L
S
F
T
L
S
F
T
L
S
F
T
L
S
F
T
L
S
F
T
Figure4. Putative biosynthetic pathway of KGM. SUS: Sucrose synthase, INV: invertase, PGI: phosphoglucose
isomerase, PGM: phosphoglucomutase\, PMI: phosphomannose isomerase, PMM: phosphomannomutase,
SS: starch synthase, GMPP: GDP-mannose pyrophosphorylase, UGP: UDP-glucose pyrophosphorylase, AGP:
ADP-glucose pyrophosphorylase, FRK: fructokinase, HXK: hexokinase, SBE: starch branching enzyme, CSLA:
cellulose synthase-like A, CSLD: Cellulose synthase-like D. L: Leaf, S: Stem, F: Fibre, T: Tuber.
Content courtesy of Springer Nature, terms of use apply. Rights reserved
8
Vol:.(1234567890)
Scientic Reports | (2023) 13:22684 | https://doi.org/10.1038/s41598-023-49963-9
www.nature.com/scientificreports/
adaptation and evolution of A. konjac. Some genes under positive selection can be used as potential biomarkers
for breeding outcrossing species. So far, asexual reproduction of tubers is widely used for breeding A. konjac in
traditional agriculture. However, many problems are related to asexual breeding process, such as low breeding
eciency, long cultivation cycle, high risk of infectious diseases, and breeding degeneration. Genome analysis
in the present study partially demonstrates evolutionary scenario of A. konjac undergoing articial breeding,
and helps to screen outcrossing populations with high KGM content.
Additionally, the analysis of the data collected in the present study suggested that a total of 20 genes were
observed to act in biosynthetic pathways of lignin, which might help cells of A. konjac adapt in habitats suitable
for fast-growing.
Over a few decades, puried KGM from tubers of A. konjac, a dietary bre composed of hydro-colloidal
polysaccharide, was used widely as food additive as well as dietary supplement in many countries. Results
from nutritional studies indicated that KGM can decrease the levels of triglycerides, glucose, cholesterol, and
blood pressure, and prevent many chronic diseases through wide-ranging regulation of metabolism40. Other
studies suggested that KGM content over 50% dry matter should be used to obtain high-purity glucomannan
for development of additives and supplements since high-purity glucomannan can easily form transparent and
odourless gel with high viscosity. e cultivated A. konjac was reported to be major source of high KGM content
material (KGM content over 45% dry matter). Apart from environmental factors and cultivation conditions,
genetic factors are presumed to contribute to productive eciency of high KGM content. However, it is still not
clear which genes of A. konjac genome are involved in regulatory process of KGM biosynthesis in tubers. In
this study, genomic and transcriptomic analysis has been applied to characterise the metabolic process of starch
and sucrose in A. konjac. Previous studies have demonstrated that polysaccharide metabolism is essential both
for formation of tuber sink and biosynthetic source of KGM in A. konjac. Transcriptomic analysis of A. konjac
in the present study suggested that expression patterns of starch and sucrose metabolism diered between
tubers and leaf or stem, and sucrose metabolism related genes maintained consistently higher expression level in
tubers than in leaf and stem. For example, starch synthase (glgA), granule-bound starch synthase (WAXY), and
glucose-1-phosphate adenylyltransferase (glgC) are more expressed in tubers and bres than in leaf and stem.
Previously, some physiological tests suggested the role of sucrose-phosphate synthase (SPS) as exporting factor
of photoassimilates out ohe leaf. Down regulation of SPS can specically help A. konjac facilitate storage and
maturation of polysaccharides in tubers. e ndings in the present study partially clarify versatile functions
of polysaccharide metabolism specic to tubers of A. konjac, and thus potentially help to study biosynthetic
mechanism of formation of KGM.
Conclusions
In this study, we sequenced, assembled, annotated, and analysed the genome of the A. konjac, which belongs to
the genus Amorphophallus of the family Araceae. e 4.58Gb A. konjac genome encoded 39,421 protein-coding
genes and 3,289,511,160bp repetitive sequences, accounting for 71.75% of the genome sequences. Whole-genome
duplication event has been observed within the A. konjac genome. In addition, the sequencing of A. konjac
genome revealed the evolution and the gene expressed dierence in tuber formation and provided a genomic
resource for further study of Amorphophallus genus. Comparative genomics analyses identied the contraction
of gene families associated with reproduction and also genes related with cellulose and lignication synthesis.
e knowledge of the genomic sequences may help in improvement of A. konjac germplasm and facilitate further
studies on KGM synthesis.
Methods
DNA isolation and sequencing
Amorphophallus konjac was obtained from the Daguan county (one of the main plantation areas of A.konjac in
Yunnan), and cultivated in the glasshouse of Kunming University in Yunnan. Fresh leaves were collected from
mature A. konjac plants and frozen in liquid nitrogen. en genomic DNA was extracted from leaves using the
cetyltrimethylammonium bromide (CTAB) method41. e integrity of the extracted DNA was checked by 0.75%
agarose gel electrophoresis. e quantity and quality of the DNA were detected using a NanoDrop ND-2000
(NanoDrop products, Wilmington, DE, USA) and Qubit 2.0 Fluorometer (Invitrogen Ltd, Paisley, UK). Paired-
end libraries with insert sizes of 325bp, 434bp, 529bp, and 647bp were constructed using NEBNext Ultra II
DNA Library Prep Kit for Illumina (NEB, USA), and mate pair libraries with insert sizes of 3kb, 7kb, 12kb,
and 16kb were constructed using Illumina Nextera Mate Pair Library Preparation Kit (Illumina, USA). All the
constructed libraries were sequenced on a NovaSeq platform (Illumina, USA) using PE-150 module. In total,
about 1119.58Gb of data were generated on Illumina platforms.
All reads were preprocessed for quality control and ltered using the in-house Perl script. e raw data were
ltered by removing reads with more than 5% N or more than 40bp low-quality bases called below Q30. e
redundant reads resulting in duplicate base calls were ltered; only one copy of any duplicated paired-end reads
was retained. e yielded clean data were used for de novo assembly.
Genome size estimation
Before genome assembly, we used Illumina short reads to estimate the genome size using a k-mer based method.
An optimal k-mer value was obtained by Jellysh42, and genome size was estimated using GenomeScope v2.043
based on the 19-mer frequency distribution data. A 19-mer was the k-mer length recommended for use with the
GenomeScope 2.0 program and was not adjusted because we had high coverage and a low error rate. e genome
size was also estimated by ow cytometry using Z. mays as internal standard and propidium iodide as the stain.
Content courtesy of Springer Nature, terms of use apply. Rights reserved
9
Vol.:(0123456789)
Scientic Reports | (2023) 13:22684 | https://doi.org/10.1038/s41598-023-49963-9
www.nature.com/scientificreports/
Assembly
e ltered reads were used to perform assembly with SOAPdenovo223 developed by BGI. First, the contigs
were constructed with k-mer = 47 using pair-end data, and the scaolds were assembled with k-mer = 33 using
both mate-pair and pair-end data. e nal assembly was generated aer gap lling with Gapcloser v1.12 in
SOAPdenovo package23.
Repeats annotation
First, the research team searched for tandem repeats across the genome using the program Tandem Repeat Finder
(TRF)44. e transposable elements (TEs) in the genome were identied by a combination of homology-based
and de novo approaches. For homolog-based prediction, known repeats were identied using RepeatMasker45
and RepeatProteinMask45 against Repbase16.1046. RepeatMasker was applied for DNA-level identication using
a custom library. At the protein level, RepeatProteinMask was used to perform an RMBLAST search against the
TE protein database. For de novo prediction, RepeatModeler (http:// repea tmask er. org/) and LTR FINDER47
were used to identify de novo evolved repeats inferred from the assembled genome.
Gene prediction and functional annotation
e research team employed EVidence Modeler (EVM)48 to consolidate RNA-seq, protein alignments with
abinitio gene predictions and homologous method annotation into a nal gene set. For transcriptome, reads
were cleaned with Trimmomatic Version 0.3249. is step removed reads containing adapter, reads containing
poly-N and low-quality reads from the raw data and yielded clean data for downstream analysis. en, the reads
were aligned to the genome with HISAT2 Version: 2.1.050. Alignments were then assembled independently with
StringTie Version: v1.3.3b51. Protein sequences of ve plant species: Arabidopsis thaliana52, Oryza sativa53, Zea
mays54, Zostera marina21 and Spirodela polyrhiza20 were used for the homology-based method. First, the tblastn
was performed with e-value cuto 1e-5, blast hits with low quality in the genome were discarded. en predicted
regions were extended by 2000bp both upstream and downstream, and aligned against protein sequence using
GeneWise55 to identify gene structure. e soware AUGUSTUS56, GenScan57, GlimmerHMM58 and SNAP59
were used for abinitio gene prediction, AUGUSTUS and GenScan prediction used the gene model parameters
trained on maize, but GlimmerHMM and SNAP prediction used gene model parameters trained on rice. All
lines of evidence were then fed into EVM using intuitive weighting (RNAseq > cDNA/protein > ab initio gene
predictions).
Gene functions were assigned according to the best match alignment using Blastp against Swiss-Prot,
TrEMBL and KEGG databases. InterProScan functional analysis and Gene Ontology IDs were obtained using
InterProScan60.
e GO enrichment was done with Ontologizer 2.061 by using one-sided Fisher’s exact test, the Parent–Child-
Union method, with a p-value cut-o of 0.05.
Genes related to cellulose synthase (CesA), cellulose synthase-like (Csl) were identied according to the
InterProScan annotation, and the genes related to phenylpropanoid-lignin biosynthesis and starch and sucrose
metabolism pathway were identied according to the KEGG annotation. Furthermore, the genes with alignment
hits covering over 200 amino acids and at least 50% protein sequence identity were considered to be candidate
genes.
Non-coding gene annotation
Soware tRNAscan-SE62 is specied for Eukaryotic tRNA and was deployed for tRNA annotation. e research
team used homologous method to identify rRNA. e rRNA sequence data downloaded from Rfam database63
was used as a reference. INFERNAL64 was used to identify snRNA.
Gene family cluster
To identify dierent sets of gene clusters, protein-coding genes sequences of O. sativa53, Z. mays54, Z. marina21and
S. polyrhiza20 were used to locate gene clusters. Aer pairwise aligning using Blastp with an e-value cuto of 1e-5
had been conducted, OrthoMCL package65 was performed to identify the gene family clusters using the Blastp
output with default parameters, nal paralogous and orthologous genes were dened using MCL soware in
OrthoMCL.
Phylogenetic tree construction
Single-copy orthologous genes dened by OrthoMCL65 were formed, and then multiple single-copy genes were
aligned using Muscle66 and the aligned sequences were extracted to feed to MrBayes (http:// mrbay es. sourc
eforge. net) to infer the species phylogeny using a maximum likelihood (ML) approach under the best-t model
GTR + G from ModelFinder. Z. mays and O. sativa were used as outgroups. To estimate the divergence time of
each species, the information about the already known divergence time data between these species from http://
www. timet ree. org/ were collected. e topology of the ML tree was fed to MCMCTREE in paml version 4.467 for
constructing a divergence time tree and calculate the divergence time. Based on the calculated phylogeny and
the divergence time, CAFÉ (Computational Analysis of Gene Family Evolution, version 2.1)68, a tool based on
the stochastic birth and death model for the statistical analysis of the evolution of gene family size, was applied
to identify gene families that had undergone expansion and/or contraction.
Content courtesy of Springer Nature, terms of use apply. Rights reserved
10
Vol:.(1234567890)
Scientic Reports | (2023) 13:22684 | https://doi.org/10.1038/s41598-023-49963-9
www.nature.com/scientificreports/
Detection of positively selected genes
To detect genes under positive selection, Blastn was performed to align the coding sequence (CDS) libraries of
Z. marina21and S. polyrhiza20 against the A. konjac CDS library, respectively, in order to nd the gene pairs with
the best alignments. e resulting orthologous gene pairs were aligned again with the default parameters as a
preparation for KaKs_Calculator 1.269 which nally yielded a dataset of each gene pair’s Ka/Ks ratio, and the Ka/
Ks ratio > 1 was dened as a positively selected gene (signicance, P-value < 0.05).
RNA-seq
Four tissues (namely tubers, bres, stems and leaves) of A. konjac were harvested from the same 7-month-old
plant, and three biological replicates for each tissue of living plants were collected. Total RNA was extracted
from these tissues using the RNAprep pure plant kit (Tiangen). 3g of total RNA per sample were used as input
material for the RNA sample preparation. Beads with oligo (dT) were used to isolate poly (A) mRNA from total
RNA. RNA sequencing libraries were constructed from these mRNA using the TruSeq RNA Sample Preparation
Kit (Illumina, San Diego, USA). Briey, the Elution 2-Frag-Prime (94°C for 8min, 4°C hold) was used to elute,
fragment and prime the mRNA with Elute, Prime, Fragment Mix (Illumina). First strand cDNA synthesis was
performed with First Strand Master Mix and SuperScript II mix (ratio: 1l SuperScript II/7l First Strand Master
Mix) (Invitrogen). e second strand was synthesized with Second Strand Master Mix (Illumina) and Ampure
XP beads (Illumina) were used to separate the double-stranded (ds) cDNA from the 2nd strand reaction mix.
Aer end repair and the addition of a 3’-dA overhang, the cDNA was ligated to Illumina PE adapter oligo mix
(Illumina), and size-selected for 350 ± 20bp fragments by gel purication. Aer 15 cycles of PCR amplication,
the 350bp paired-end libraries were sequenced using the paired-end sequencing module (150bp at each end)
of the Illumina HiSeq 4000 platform.
e corresponding trimmed clean reads were aligned to the related reference genome employing TopHat224
soware with default settings. Calculation of gene expression level was conducted using Cuinks v2.2.125. Frag-
ments per kilobase of exon per million fragments mapped (FPKM) were used to normalize RNA-seq fragment
counts and estimate the relative abundance of each gene. e DEGs were decided based on a P-value < 0.05 and
at least a twofold change between the two FPKMs.
Ethical approval
We conrm that all the experimental research and eld studies on plants (either cultivated or wild), including
the collection of plant material, complied with relevant institutional, national, and international guidelines and
legislation. e tuber of A. konjac was collected from Daguan county, and was cultured in the green house. All
the material is owned by the authors and/or no permissions are required.
Data availability
Accession numbers: e genome sequence of A. konjac has been deposited in DDBJ/EMBL/GenBank nucleo-
tide core database under accession code SUB7124908 (https:// www. ncbi. nlm. nih. gov/ sra/ PRJNA 608095). e
sequencing reads of Illumina sequencing libraries have been deposited under NCBI Sequence Read Archive with
Project ID PRJNA608095. e Project ID of all the RNA-seq data is SRP251185.
Received: 1 June 2023; Accepted: 14 December 2023
References
1. Decaisne, J. Description dúne herbier de li’le de Timor. Nouvelles Annales du Museum d’Histoire Naturelle 3, 333–501 (2004).
2. Banerjee, A. et al. A so computing tool for species classication and prediction of glucomannan content in Amorphophallus genus.
Eng. Life. Sci. 17(12), 1254–1263. https:// doi. org/ 10. 1002/ elsc. 20170 0040 (2017).
3. Chua, M., Baldwin, T. C., Hocking, T. J. & Chan, K. Traditional uses and potential health benets of Amorphophallus konjac K.
Koch ex N.E.Br. J. Ethnopharmacol. 128(2), 268–278. https:// doi. org/ 10. 1016/j. jep. 2010. 01. 021 (2010).
4. Wu, J. P. et al. Amorphophallus konjac anthracnose caused by Colletotrichum siamense in China. J. Appl. Microbiol. 128(1), 225–231
(2020).
5. Nishinari, K. Konjac Glucomannan. Dev. Food Sci. 41, 309–330 (2000).
6. Mekkerdchoo, O. et al. Tracing the evolution and economic potential of konjac glucomannan in Amorphophallus species (Araceae)
using molecular phylogeny and RAPD markers. Phytotaxa 282(2), 081–106 (2016).
7. Shahbuddin, M. et al. High molecular weight plant heteropolysaccharides stimulate broblasts but inhibit keratinocytes. Carbohydr.
Res. 375, 90–99. https:// doi. org/ 10. 1016/j. carres. 2013. 04. 006 (2013).
8. Li, B., Xia, J., Wang, Y. & Xie, B. Grain-size eect on the structure and antiobesity activity of konjac our. J. Agric. Food. Chem.
53(19), 7404–7407. https:// doi. org/ 10. 1021/ jf050 751q (2005).
9. Gille, S. et al. Deep sequencing of voodoo lily (Amorphophallus konjac): An approach to identify relevant genes involved in the
synthesis of the hemicellulose glucomannan. Planta 234(3), 515–526. https:// doi. org/ 10. 1007/ s00425- 011- 1422-z (2011).
10. Zheng, X. et al. Development of microsatellite markers by transcriptome sequencing in two species of Amorphophallus (Araceae).
BMC Genomics 14, 490. https:// doi. org/ 10. 1186/ 1471- 2164- 14- 490 (2013).
11. Diao, Y. et al. De novo transcriptome and small RNA analyses of two Amorphophallus species. PLoS ONE 9(4), e95428. https://
doi. org/ 10. 1371/ journ al. pone. 00954 28 (2014).
12. Hetterscheid, W. L. A. & Ittenbach, S. Everything you always wanted to know about Amorphophallus, but were afraid to stick your
nose into. Aroideana 19, 7–131 (1996).
13. Hetterscheid, W. L. A. Notes on the Genus Amorphophallus (Araceae) 15. New Species from SE Asia. Aroideana 29, 53–79 (2006).
14. Claudel, C. et al. Large-scale phylogenetic analysis of Amorphophallus (Araceae) derived from nuclear and plastid sequences reveals
new subgeneric delineation. Bot. J. Linn. Soc. 184(1), 32–45 (2017).
15. Punekar, S. A. & Kumaran, K. P. N. Pollen morphology and pollination ecology of Amorphophallus species from North Western
Ghats and Konkan region of India. Flora 205, 326–336. https:// doi. org/ 10. 1016/j. ora. 2009. 12. 024 (2010).
Content courtesy of Springer Nature, terms of use apply. Rights reserved
11
Vol.:(0123456789)
Scientic Reports | (2023) 13:22684 | https://doi.org/10.1038/s41598-023-49963-9
www.nature.com/scientificreports/
16. Grob, G., Gravendeel, B., Eurlings, M. C. M. & Hetterscheid, W. L. A. Phylogeny of the tribe thomsonieae (araceae) based on
chloroplast matk and trnl intron sequences. Syst. Bot. 27(3), 453–467. https:// doi. org/ 10. 1043/ 0363- 6445- 27.3. 453 (2002).
17. Grob, G. B. J., Gravendeel, B. & Eurlings, M. C. M. Potential phylogenetic utility of the nuclear FLORICAULA/LEAFY second
intron: Comparison with three chloroplast DNA regions in Amorphophallus (Araceae). Mol. Phylogenet. Evol. 30(1), 13–23. https://
doi. org/ 10. 1016/ s1055- 7903(03) 00183-0 (2004).
18. Sedayu, A., Eurlings, M. C. M., Gravendeel, B. & Hetterscheid, W. L. A. Morphological character evolution of Amorphophallus
(Araceae) based on a combined phylogenetic analysis of trnL, rbcL and LEAFY second intron sequences. Bot. Stud. 51, 473–490
(2010).
19. Wahyudi, D., Azrianingsih, R. & Mastuti, R. Genetic variability of porang populations (Amorphophallus muelleri) in West Java
and Central Java based on trnL intron sequences. J. Biodivers. Environ. Sci. 3, 31–41 (2013).
20. Wang, W. et al. e Spirodela polyrhiza genome reveals insights into its neotenous reduction fast growth and aquatic lifestyle. Nat.
Commun. 5, 3311. https:// doi. org/ 10. 1038/ ncomm s4311 (2014).
21. Olsen, J. L. et al. e genome of the seagrass Zostera marina reveals angiosperm adaptation to the sea. Nature 530(7590), 331–335.
https:// doi. org/ 10. 1038/ natur e16548 (2016).
22. Gao, Y. et al. A chromosome-level genome assembly of Amorphophallus konjac provides insights into konjac glucomannan bio-
synthesis. Comput. Struct. Biotechnol. J. 20, 1002–1011. https:// doi. org/ 10. 1016/j. csbj. 2022. 02. 009 (2022).
23. Luo, R., Liu, B. & Wang, J. SOAPdenovo2: An empirically improved memory-ecient short-read de novo assembler. GigaScience
1, 18. https:// doi. org/ 10. 1186/ 2047- 217X-1- 18 (2012).
24. Simao, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V. & Zdobnov, E. M. BUSCO: Assessing genome assembly and
annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212. https:// doi. org/ 10. 1093/ bio in forma tics/ btv351
(2015).
25. Kim, D. et al. TopHat2: Accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome
Biol. 14(4), R36. https:// doi. org/ 10. 1186/ gb- 2013- 14-4- r36 (2013).
26. Trapnell, C. et al. Dierential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cuinks. Nat.
Protoc. 7(3), 562–578. https:// doi. org/ 10. 1038/ nprot. 2012. 016 (2012).
27. Huang, S. et al. Dra genome of the kiwifruit Actinidia chinensis. Nat. Commun. 4, 2640. https:// doi. org/ 10. 1038/ ncomm s3640
(2013).
28. Santi, L. et al. e GA octodinucleotide repeat binding factor BBR participates in the transcriptional regulation of the homeobox
gene Bkn3. Plant J. 34(6), 813–826. https:// doi. org/ 10. 1046/j. 1365- 313x. 2003. 01767.x (2003).
29. Carles, C. C., Chones-Inada, D., Reville, K., Lertpiriyapong, K. & Fletcher, J. C. ULTRAPETALA1 encodes a SAND domain
putative transcriptional regulator that controls shoot and oral meristem activity in Arabidopsis. Development 132(5), 897–911.
https:// doi. org/ 10. 1242/ dev. 01642 (2005).
30. Mitsui, Y. Gene expression proles during tuberous root development. e Radish Genome, 109–119 https:// doi. org/ 10. 1007/
978-3- 319- 59253-4_8 (2017).
31. Jiang, H. et al. Flexible loop in carbohydrate-binding module 48 allosterically modulates substrate binding of the 1,4-α-glucan
branching enzyme. J. Agric. Food Chem. 69(20), 5755–5763. https:// doi. org/ 10. 1021/ acs. jafc. 1c002 93 (2021).
32. Yu, J., Wang, K. & Beckles, D. M. Starch branching enzymes as putative determinants of postharvest quality in horticultural crops.
BMC Plant Biol. 21(1), 479. https:// doi. org/ 10. 1186/ s12870- 021- 03253-6 (2021).
33. Liu, P. Y. et al. Professional Standard for the Classication, Requirements and Test Methods of Konjac Flour, Technical Report NY/T
494 (Chinese Ministry of Agriculture, 2002).
34. Zhang, J. et al. e high-quality genome of diploid strawberry (Fragaria nilgerrensis) provides new insights into anthocyanin
accumulation. Plant Biotechnol. J. 18(9), 1908–1924. https:// doi. org/ 10. 1111/ pbi. 13351 (2020).
35. Michael, T. P. & Van Buren, R. Progress, challenges and the future of crop genomes. Curr. Opin. Plant Biol. 24, 71–81 (2015).
36. Tenaillon, M. I. et al. Genome size and transposable element content as determined by high-throughput sequencing in maize and
Zea luxurians. Genome Biol. Evol 3, 219–229 (2011).
37. Huang, L. et al. Genome assembly provides insights into the genome evolution and owering regulation of orchardgrass. Plant
Biotechnol. J 18, 373–388 (2019).
38. Yadav, I. S. et al. Whole genome resequencing and comparative genome analysis of three Puccinia striiformis f. sp. tritici pathotypes
prevalent in India. PLoS ONE 17(11), e0261697 (2022).
39. Won, S. Y. et al. Comparative transcriptome analysis reveals whole-genome duplications and gene selection patterns in cultivated
and wild Chrysanthemum species. Plant Mol Biol. 95(4–5), 451–546. https:// doi. org/ 10. 1007/ s11103- 017- 0663-z (2017).
40. Devaraj, R. D., Reddy, C. K. & Xu, B. Health-promoting eects of konjac glucomannan and its practical applications: A critical
review. Int. J. Biol. Macromol. 126, 273–281. https:// doi. org/ 10. 1016/j. ijbio mac. 2018. 12. 203 (2019).
41. Doyle, Jj. & Doyle, Jl. A rapid DNA isolation procedure from small quantities of fresh leaf tissues. Phytochem. Bull. 19(1), 11–15
(1987).
42. Marçais, G. & Kingsford, C. A fast, lock-free approach for ecient parallel counting of occurrences of k-mers. Bioinformatics 27(6),
764–770. https:// doi. org/ 10. 1093/ bioin forma tics/ btr011 (2011).
43. Vurture, G. W. et al. GenomeScope: Fast reference-free genome proling from short reads. Bioinformatics 33(14), 2202–2204.
https:// doi. org/ 10. 1093/ bioin forma tics/ btx153 (2017).
44. Benson, G. Tandem repeats nder: A program to analyze DNA sequences. Nucleic Acids Res. 27(2), 573–580. https:// doi. org/ 10.
1093/ nar/ 27.2. 573 (1999).
45. Tarailo-Graovac, M. & Chen, N. Using RepeatMasker to identify repetitive elements in genomic sequences. Curr. Protocols Bioin-
form. https:// doi. org/ 10. 1002/ 04712 50953. bi041 0s25 (2009).
46. Jurka, J. et al. Repbase Update, a database of eukaryotic repetitive elements. Cytogenet. Genome Res. 110(1–4), 462–467. https://
doi. org/ 10. 1159/ 00008 4979 (2005).
47. Xu, Z. & Wang, H. LTR_FINDER: An ecient tool for the prediction of full-length LTR retrotransposons. Nucleic Acids Res.
35(Web Server issue), W265–W268. https:// doi. org/ 10. 1093/ nar/ gkm286 (2007).
48. Haas, B. et al. Automated eukaryotic gene structure annotation using EVidenceModeler and the Program to Assemble Spliced
Alignments. Genome Biol. 9(1), R7. https:// doi. org/ 10. 1186/ gb- 2008-9- 1- r7 (2008).
49. Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: A exible trimmer for Illumina sequence data. Bioinformatics 30(15), 2114–
2120. https:// doi. org/ 10. 1093/ bioin forma tics/ btu170 (2014).
50. Kim, D., Langmead, B. & Salzberg, S. L. HISAT: A fast spliced aligner with low memory requirements. Nat. Methods 12(4), 357–360.
https:// doi. org/ 10. 1038/ nmeth. 3317 (2015).
51. Pertea, M. et al. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat. Biotechnol. 33(3), 290–295.
https:// doi. org/ 10. 1038/ nbt. 3122 (2015).
52. Arabidopsis Genome Initiative. Analysis of the genome sequence of the owering plant Arabidopsis thaliana. Nature 408(6814),
796–815. https:// doi. org/ 10. 1038/ 35048 692 (2000).
53. Go, S. A. et al. A dra sequence of the rice genome (Oryza sativa L. ssp. japonica). Science 296(5565), 92–100. https:// doi. org/
10. 1126/ scien ce. 10682 75 (2002).
54. Schnable, P. S. et al. e B73 maize genome: Complexity, diversity, and dynamics. Science 326(5956), 1112–1115. https:// doi. org/
10. 1126/ scien ce. 11785 34 (2009).
Content courtesy of Springer Nature, terms of use apply. Rights reserved
12
Vol:.(1234567890)
Scientic Reports | (2023) 13:22684 | https://doi.org/10.1038/s41598-023-49963-9
www.nature.com/scientificreports/
55. Birney, E., Clamp, M. & Durbin, R. GeneWise and genomewise. Genome Res. 14(5), 988–995. https:// doi. org/ 10. 1101/ gr. 18655 04
(2004).
56. Stanke, M., Steinkamp, R., Waack, S. & Morgenstern, B. AUGUSTUS: A web server for gene nding in eukaryotes. Nucleic Acids
Res. 32, W309–W312. https:// doi. org/ 10. 1093/ nar/ gkh379 (2004).
57. Burge, C. & Karlin, S. Prediction of complete gene structures in human genomic DNA. J. Mol. Biol. 268(1), 78–94. https:// doi. org/
10. 1006/ jmbi. 1997. 0951 (1997).
58. Majoros, W. H., Pertea, M. & Salzberg, S. L. TigrScan and GlimmerHMM: Two open source ab initio eukaryotic gene-nders.
Bioinformatics 20(16), 2878–2879. https:// doi. org/ 10. 1093/ bioin forma tics/ bth315 (2004).
59. Korf, I. Gene nding in novel genomes. BMC Bioinform. 5, 59. https:// doi. org/ 10. 1186/ 1471- 2105-5- 59 (2004).
60. Zdobnov, E. M. & Apweiler, R. InterProScan–an integration platform for the signature-recognition methods in InterPro. Bioin-
formatics 17(9), 847–848. https:// doi. org/ 10. 1093/ bioin forma tics/ 17.9. 847 (2001).
61. Bauer, S., Grossmann, S., Vingron, M. & Robinson, P. N. Ontologizer 2.0—A multifunctional tool for GO term enrichment analysis
and data exploration. Bioinformatics 24(14), 1650–1651. https:// doi. org/ 10. 1093/ bioin forma tics/ btn250 (2008).
62. Lowe, T. M. & Eddy, S. R. tRNAscan-SE: A program for improved detection of transfer RNA genes in genomic sequence. Nucleic
Acids Res. 25(5), 955–964. https:// doi. org/ 10. 1093/ nar/ 25.5. 955 (1997).
63. Burge, S. W. et al. Rfam 11.0: 10 years of RNA families. Nucleic Acids Res. 41(Database issue), D226–D232. https:// doi. org/ 10. 1093/
nar/ gks10 05 (2013).
64. Nawrocki, E. P. & Eddy, S. R. Infernal 1.1: 100-fold faster RNA homology searches. Bioinformatics 29(22), 2933–2935. https:// doi.
org/ 10. 1093/ bioin forma tics/ btt509 (2013).
65. Li, L., Stoeckert, C. J. Jr. & Roos, D. S. OrthoMCL: Identication of ortholog groups for eukaryotic genomes. Genome Res. 13(9),
2178–2189. https:// doi. org/ 10. 1101/ gr. 12245 03 (2003).
66. Edgar, R. C. MUSCLE: Multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 32(5), 1792–1797.
https:// doi. org/ 10. 1093/ nar/ gkh340 (2004).
67. Puttick, M. N. MCMCtreeR: Functions to prepare MCMCtree analyses and visualize posterior ages on trees. Bioinformatics 35(24),
5321–5322. https:// doi. org/ 10. 1093/ bioin forma tics/ btz554 (2019).
68. De Bie, T., Cristianini, N., Demuth, J. P. & Hahn, M. W. CAFE: A computational tool for the study of gene family evolution. Bio-
informatics 22(10), 1269–1271. https:// doi. org/ 10. 1093/ bioin forma tics/ btl097 (2006).
69. Zhang, Z. et al. KaKs_Calculator: Calculating Ka and Ks through model selection and model averaging. Genomics Proteomics
Bioinform. 4(4), 259–263. https:// doi. org/ 10. 1016/ S1672- 0229(07) 60007-2 (2006).
70. Guo. Z. H. et al. Genome sequences provide insights into the reticulate origin and unique traits of woody bamboos. Mol. Plant.
12(10), 1353–1365. https:// doi. org/ 10. 1016/j. molp. 2019. 05. 009 (2019).
Acknowledgements
is study was supported by Yunnan Province Youth Talent Support Program (Grant No.YNWR-QNBJ-2018-32);
Yunnan Fundamental Research Projects (Grant NO. 202101BA070001-163); Yunnan Education Department
Research Project (Grant No. 2022J0644, 2023J0827); Yunnan Provincial Science and Technology Department
(No. 2019FH001-008, 2019FH001-051). e funders had no role in study design, data collection and analysis,
decision to publish, or preparation of the manuscript.
Author contributions
L.L. and L.Y. conceived the project and its components and wrote the manuscript. W.W., J.L. and J.Z. prepared
the sample material for sequencing. L.L., L.Y., M.Y., X.Y. and R.I. conducted the genome sequencing and assem-
bling. L.L., G.S., L.Y. and M.Y. performed the data analysis. M.Y., R.I. and F.H. prepared the gures and tables.
All authors reviewed the manuscript.
Competing interests
e authors declare no competing interests.
Additional information
Supplementary Information e online version contains supplementary material available at https:// doi. org/
10. 1038/ s41598- 023- 49963-9.
Correspondence and requests for materials should be addressed to G.S.orL.Y.
Reprints and permissions information is available at www.nature.com/reprints.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and
institutional aliations.
Open Access is article is licensed under a Creative Commons Attribution 4.0 International
License, which permits use, sharing, adaptation, distribution and reproduction in any medium or
format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the
Creative Commons licence, and indicate if changes were made. e images or other third party material in this
article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the
material. If material is not included in the article’s Creative Commons licence and your intended use is not
permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from
the copyright holder. To view a copy of this licence, visit http:// creat iveco mmons. org/ licen ses/ by/4. 0/.
© e Author(s) 2023
Content courtesy of Springer Nature, terms of use apply. Rights reserved
1.
2.
3.
4.
5.
6.
Terms and Conditions
Springer Nature journal content, brought to you courtesy of Springer Nature Customer Service Center GmbH (“Springer Nature”).
Springer Nature supports a reasonable amount of sharing of research papers by authors, subscribers and authorised users (“Users”), for small-
scale personal, non-commercial use provided that all copyright, trade and service marks and other proprietary notices are maintained. By
accessing, sharing, receiving or otherwise using the Springer Nature journal content you agree to these terms of use (“Terms”). For these
purposes, Springer Nature considers academic use (by researchers and students) to be non-commercial.
These Terms are supplementary and will apply in addition to any applicable website terms and conditions, a relevant site licence or a personal
subscription. These Terms will prevail over any conflict or ambiguity with regards to the relevant terms, a site licence or a personal subscription
(to the extent of the conflict or ambiguity only). For Creative Commons-licensed articles, the terms of the Creative Commons license used will
apply.
We collect and use personal data to provide access to the Springer Nature journal content. We may also use these personal data internally within
ResearchGate and Springer Nature and as agreed share it, in an anonymised way, for purposes of tracking, analysis and reporting. We will not
otherwise disclose your personal data outside the ResearchGate or the Springer Nature group of companies unless we have your permission as
detailed in the Privacy Policy.
While Users may use the Springer Nature journal content for small scale, personal non-commercial use, it is important to note that Users may
not:
use such content for the purpose of providing other users with access on a regular or large scale basis or as a means to circumvent access
control;
use such content where to do so would be considered a criminal or statutory offence in any jurisdiction, or gives rise to civil liability, or is
otherwise unlawful;
falsely or misleadingly imply or suggest endorsement, approval , sponsorship, or association unless explicitly agreed to by Springer Nature in
writing;
use bots or other automated methods to access the content or redirect messages
override any security feature or exclusionary protocol; or
share the content in order to create substitute for Springer Nature products or services or a systematic database of Springer Nature journal
content.
In line with the restriction against commercial use, Springer Nature does not permit the creation of a product or service that creates revenue,
royalties, rent or income from our content or its inclusion as part of a paid for service or for other commercial gain. Springer Nature journal
content cannot be used for inter-library loans and librarians may not upload Springer Nature journal content on a large scale into their, or any
other, institutional repository.
These terms of use are reviewed regularly and may be amended at any time. Springer Nature is not obligated to publish any information or
content on this website and may remove it or features or functionality at our sole discretion, at any time with or without notice. Springer Nature
may revoke this licence to you at any time and remove access to any copies of the Springer Nature journal content which have been saved.
To the fullest extent permitted by law, Springer Nature makes no warranties, representations or guarantees to Users, either express or implied
with respect to the Springer nature journal content and all parties disclaim and waive any implied warranties or warranties imposed by law,
including merchantability or fitness for any particular purpose.
Please note that these rights do not automatically extend to content, data or other material published by Springer Nature that may be licensed
from third parties.
If you would like to use or distribute our Springer Nature journal content to a wider audience or on a regular basis or in any other manner not
expressly permitted by these Terms, please contact Springer Nature at
onlineservice@springernature.com
... Konjac (Amorphophallus sp.), a shade-tolerant perennial herbaceous tuber plant belonging to the Araceae family of Amorphophallus, has a long history of cultivation in China (Li et al. 2023). 26 konjac species have been discovered and named in China, accounting for 22.6% of the world's konjac species. ...
Article
Full-text available
The karst area has become a high-risk of Cd exposure. Konjac is a tuber vegetable, and tuberous vegetables tend to accumulate heavy metals. However, there is limited research on accumulation, translocation, and assessment of heavy metals in konjac. In this study, soil and Konjac plant samples were collected from a typical karst area to analyze heavy metals. The results showed that The Cd, Cu, and As of soil exceed screening values (GB 15618-2018), and exceeding points were 100%, 70% and 100%, respectively. The concentrations of As, Cu, Ni, and Zn in konjac followed the order: Stem <Tuber < leaf < root < soil. The concentrations of available Cd was much greatest, and BF of Cd in konjac roots were considerably greatest, indicating that Cd was easily absorbed. The Cd and Cr in konjac tubers exceeded food safety standards in China (GB 2762-2017). The HRIs of Cd, As, Pb, Cr, Cu, Ni, and Zn in konjac tuber were 1.156, 5.052, 0.084, 0.007, 0.518, 0.947, 0.180 and 7.943 respectively. There was a significant health risk associated with As and Cd consuming konjac tubers for adult inhabitants. Therefore, heavy metals in konjac tubers should be paid attention should be carried out.
... Previous studies on evolutionary analysis of CDPK gene family members have found that CDPK family members are often divided into four subgroups: Arabidopsis [38], rice [39,40], patchouli [41], and white clover [19,20]]. Similarly, AkCDPK family members were divided into four subgroups. ...
Preprint
Full-text available
Background Konjac is an economically important crop that is often threatened by low temperatures, drought, salt, pests, and diseases, leading to reduced yield and quality. Calcium-dependent protein kinases (CDPKs) play crucial roles in plant growth and stress responses, yet their presence and functions in konjac remain unexplored. This study aimed to identify and analyse the CDPK gene family in the Amorphophallus konjac genome. Results We identified 29 CDPK genes categorised into four subgroups that unevenly distributed across 12 chromosomes. Most AkCDPK genes have undergone purifying selection during evolution. Cis-acting element analysis revealed that these genes were involved in phytohormone induction, defence, stress response, and plant development. Expression analysis indicated tissue specificity and responses to salt, drought, low temperature, and Pcc stress. Moreover, AkCDPK15 was cloned and its physicochemical properties and functions were analysed. We found that the protein encoded by AkCDPK15 is mainly localised on the cell membrane, while a small amount aggregates in the nucleus. This protein has eight potential phosphorylation sites and was found to positively promote drought tolerance by regulating the antioxidant system. Conclusions These findings provide a theoretical foundation for future research on the CDPK gene family’s functions in A. konjac , potentially aiding in the development of stress-resistant konjac varieties.
... Plants of the konjac species (Amorphophallus spp.) are rich in dietary fiber known as konjac glucomannan (KGM), which is currently widely used in food, pharmaceutical health care, and chemical industries. Therefore, konjac is extensively cultivated in China as a cash crop, and China is also the world's largest producer and grower of konjac 26 . Amorphophallus muelleri is a unique species of konjac that can reproduce with seeds (triploid apomixis 2n = 39) 27,28 . ...
Article
Full-text available
Konjac seeds of Amorphophallus muelleri are produced through a unique form of apomixis in triploid parthenogenesis, and typically require a longer maturation period (approximately 8 months). To date, the relevant functions of endophytic microbial taxa during A. muelleri seed development and maturation remain largely unexplored. In this study, we analyzed the functional adaptability and temporal dynamics of endophytic microbial communities during three stages of A. muelleri seed maturation. Through metagenomic sequencing, we determined that the functions of the endophytic microbiome in A. muelleri seeds were driven by the seed maturation status, and the functions of the microbial communities in the seed coats and seeds differed significantly. The species annotation results show that Proteobacteria, Actinobacteria, Ascomycota, and Basidiomycota were the dominant bacterial and fungal communities in A. muelleri seeds at different maturation stages. The KEGG and COG functional gene annotation results revealed that the seed samples during the three maturation stages had higher KO functional diversity than the seed coat samples, and the COG functional diversity of the green and red seed samples was also significantly higher than that of the seed coat samples. At different maturation stages, microbial functional genes involved in energy production and conversion as well as carbon fixation were enriched in the A. muelleri seed coats, while microbial functional genes involved in signal transduction mechanisms, amino acid transport and metabolism, carbohydrate metabolism, and lipid metabolism were more highly expressed in the seeds. Moreover, in the middle to late stages of seed maturation, the microbial functional genes involved in the biosynthesis of resistant compounds such as phenols, flavonoids, and alkaloids were significantly enriched to enhance the resistance and environmental adaptation of A. muelleri seeds. The results verified that the functions of the endophytic microbial communities change dynamically during A. muelleri seed maturation to adapt to the current needs of the host plant, which has significant implications for the exploration and utilization of functional microbial resources in A. muelleri seeds.
... However, A. konjac cultivation has been considerably affected by increasing environmental pollution, which has contributed to extreme temperatures and the greenhouse effect-induced cold damage, water scarcitytriggered drought, and contamination of irrigation water. The main manifestations include reduced resistance of the plants to abiotic stress, slow growth and development, severe sunburn of leaves under high-temperature stress, increased susceptibility to white rot and soft rot diseases, decreased yield, and rotting of tubers during digging, harvesting, and storage [13][14][15][16]. ...
Article
Full-text available
Background Amorphophallus konjac (A. konjac), a perennial tuberous plant, is widely cultivated for its high konjac glucomannan (KGM) content, a heteropolysaccharide with diverse applications. The cellulose synthase-like (CSL) gene family is known to be a group of processive glycan synthases involved in the synthesis of cell-wall polysaccharides and plays an important role in the biological process of KGM. However, in A. konjac the classification, structure, and function of the AkCSLA superfamily have been studied very little. Results Bioinformatics methods were used to identify the 11 AkCSLA genes from the whole genome of Amorphophallus konjac and to systematically analyze their characteristics, phylogenetic evolution, promoter cis-elements, expression patterns, and subcellular locations. Phylogenetic analysis revealed that the AkCSLA gene family can be divided into three subfamilies (Groups I- III), which have close relationships with Arabidopsis. The promoters of most AkCSLA family members contain MBS elements and ABA response elements. Analysis of expression patterns in different tissues showed that most AkCSLAs are highly expressed in the corms. Notably, PEG6000 induced down-regulation of the expression of most AkCSLAs, including AkCSLA11. Subcellular localization results showed that AkCSLA11 was localized to the plasma membrane, Golgi apparatus and endoplasmic reticulum. Transgenic Arabidopsis experiments demonstrated that overexpression of AkCSLA11 reduced the plant’s drought tolerance. This overexpression also inhibited the expression of drought response genes and altered the sugar components of the cell wall. These findings provide new insights into the response mechanisms of A. konjac to drought stress and may offer potential genetic resources for improving crop drought resistance. Conclusion In conclusion, the study reveals that the AkCSLA11 gene from A. konjac negatively impacts drought tolerance when overexpressed in Arabidopsis. This discovery provides valuable insights into the mechanisms of plant response to drought stress and may guide future research on crop improvement for enhanced resilience.
Article
Full-text available
Introduction Amorphophallus albus, a perennial herb in the Araceae family, is a valuable cash crop known for its high production of konjac glucomannan and high disease resistance. Methods In this study, we present a high-quality, chromosome-scale genome assembly of A. albus using a combination of PacBio HiFi sequencing, DNBSEQ short-read sequencing, and Hi-C technology. To elucidate the molecular mechanisms underlying southern blight resistance, we performed an integrated analysis of transcriptomic and metabolomic profiles across three infection stages of A. albus. Results and discussion Here, we assembled and annotated the complete genome of A. albus, providing a chromosome-level assembly with a total genome size of 5.94 Gb and a contig N50 of 5.61 Mb. The A. albus genome comprised 19,908 gene families, including 467 unique families.The slightly larger genome size of A. albus compared to A. konjac may have been affected by a recent whole-genome duplication event. Transcriptional and metabolic analyses revealed significant enrichment of differentially expressed genes (DEGs) and differentially accumulated metabolites (DAMs) involved in phenylpropanoid biosynthesis, plant hormone signal transduction, phenylalanine metabolism, and the biosynthesis of phenylalanine, tyrosine, and tryptophan. These findings not only advance the understanding of genetic and evolutionary characteristics of A. albus but also provide a foundation for future research on the resistance mechanisms of konjac against southern blight disease.
Article
Full-text available
Amorphophallus paeoniifolius (Dennst.) Nicolson, 1885, often known as elephant foot yam, is a tropical tuber crop that originates from south-east Asia and belongs to the Araceae family. It is known for its high production potential and popularity as a medicinal plant. However, the phylogeny and genes for this species are still unavailable. In this study, the first complete chloroplast genome of A. paeoniifolius was reported and phylogenetic analysis was conducted with Araceae species. The chloroplast genome was 176,258 bp in length with 34.80% overall GC content and includes a large single-copy (LSC) region (93,951 bp), a small single-copy (SSC) region (15,013 bp), and a pair of inverted repeat (IRs) regions (33,647 bp). The chloroplast genome has 130 genes, which include 85 protein-coding genes, 37 tRNA genes, and eight rRNA genes. A maximum-likelihood (ML) phylogenetic analysis indicated that all Amorphophallus species formed a single monophyletic clade with a high bootstrap value and A. paeoniifolius was closely related to A. konjac, A. albus, A. krausei, and A. titanum. The chloroplast genome reported in this study will be useful for further taxonomic and evolutionary studies of Amorphophallus.
Article
Full-text available
Stripe rust disease of wheat, caused by Puccinia striiformis f. sp. tritici, (Pst) is one of the most serious diseases of wheat worldwide. In India, virulent stripe rust races have been constantly evolving in the North-Western Plains Zone leading to the failure of some of the most widely grown resistant varieties in the region. With the goal of studying the recent evolution of virulent races in this region, we conducted whole-genome re-sequencing of three prevalent Indian Pst pathotypes Pst46S119, Pst78S84 and Pst110S119. We assembled 58.62, 58.33 and 55.78 Mb of Pst110S119, Pst46S119 and Pst78S84 genome, respectively and found that pathotypes were highly heterozygous. Comparative phylogenetic analysis indicated the recent evolution of pathotypes Pst110S119 and Pst78S84 from Pst46S119. Pathogenicity-related genes classes (CAZyme, proteases, effectors, and secretome proteins) were identified and found to be under positive selection. Higher rate of gene families expansion were also observed in the three pathotypes. A strong association between the effector genes and transposable elements may be the source of the rapid evolution of these strains. Phylogenetic analysis differentiated the Indian races in this study from other known United States, European, African, and Asian races. Diagnostic markers developed for the identification of three Pst pathotypes will help tracking of yellow rust at farmers field and strategizing resistance gene deployment.
Article
Full-text available
Amorphophallus konjac, a perennial herb in the Araceae family, is a cash crop that can produce a large amount of konjac glucomannan. To explore mechanisms underlying such large genomes in the genus Amorphophallus as well as the gene regulation of glucomannan biosynthesis, we present a chromosomelevel genome assembly of A. konjac with a total genome size of 5.60 Gb and a contig N50 of 1.20 Mb. Comparative genomic analysis reveals that A. konjac has undergone two whole-genome duplication (WGD) events in quick succession. Two recent bursts of transposable elements are identified in the A. konjac genome, which contribute greatly to the large genome size. Our transcriptomic analysis of the developmental corms characterizes key genes involved in the biosynthesis of glucomannan and related starches. High expression of cellulose synthase-like A, Cellulose synthase-like D, mannan-synthesis related 1, GDP-mannose pyrophosphorylase and phosphomannomutase fructokinase contributes to glucomannan synthesis during the corm expansion period while high expression of starch synthase, starch branching enzyme and phosphoglucomutase is responsible for starch synthesis in the late corm development stage. In conclusion, we generate a high-quality genome of A. konjac with different sequencing technologies. The expansion of transposable elements has caused the large genome of this species. And the identified key genes in the glucomannan biosynthesis provide valuable candidates for molecular breeding of this crop in the future.
Article
Full-text available
Starch branching enzymes (SBEs) are key determinants of the structure and amount of the starch in plant organs, and as such, they have the capacity to influence plant growth, developmental, and fitness processes, and in addition, the industrial end-use of starch. However, little is known about the role of SBEs in determining starch structure-function relations in economically important horticultural crops such as fruit and leafy greens, many of which accumulate starch transiently. Further, a full understanding of the biological function of these types of starches is lacking. Because of this gap in knowledge, this minireview aims to provide an overview of SBEs in horticultural crops, to investigate the potential role of starch in determining postharvest quality. A systematic examination of SBE sequences in 43 diverse horticultural species, identified SBE1, 2 and 3 isoforms in all species examined except apple, olive, and Brassicaceae, which lacked SBE1, but had a duplicated SBE2. Among our findings after a comprehensive and critical review of published data, was that as apple, banana, and tomato fruits ripens, the ratio of the highly digestible amylopectin component of starch increases relative to the more digestion-resistant amylose fraction, with parallel increases in SBE2 transcription, fruit sugar content, and decreases in starch. It is tempting to speculate that during the ripening of these fruit when starch degradation occurs, there are rearrangements made to the structure of starch possibly via branching enzymes to increase starch digestibility to sugars. We propose that based on the known action of SBEs, and these observations, SBEs may affect produce quality, and shelf-life directly through starch accumulation, and indirectly, by altering sugar availability. Further studies where SBE activity is fine-tuned in these crops, can enrich our understanding of the role of starch across species and may improve horticulture postharvest quality.
Article
Full-text available
Fragaria nilgerrensis is a wild diploid strawberry species endemic to east and southeast region in Asia and provides a rich source of genetic variations for strawberry improvement. Here, we present a chromosome‐scale assembly of F. nilgerrensis using single‐molecule real‐time (SMRT) Pacific Biosciences sequencing and chromosome conformation capture (Hi‐C) genome scaffolding. The genome assembly size was 270.3 Mb, with a contig N50 of ∼8.5 Mb. A total of 28,780 genes and 117.2 Mb of transposable elements were annotated for this genome. Next, detailed comparative genomics with the high‐quality F. vesca reference genome was conducted to obtain the difference among transposable elements, SNPs, Indels, and so on. The genome size of F. nilgerrensis was enhanced by around 50 Mb relatively to F. vesca, which is mainly due to expansion of transposable elements. In comparison to the F. vesca genome, we identified 4,561,825 SNPs, 846,301 Indels, 4,243 inversions, 35,498 translocation, and 10,099 relocation. We also found a marked expansion of genes involved in phenylpropanoid biosynthesis, starch and sucrose metabolism, cyanoamino acid metabolism, plant‐pathogen interaction, brassinosteroid biosynthesis, and plant hormone signal transduction in F. nilgerrensis, which may account for its specific phenotypes and considerable environmental adaptability. Interestingly, we found sequence variations in the upstream regulatory region of FnMYB10, a core transcriptional activator of anthocyanin biosynthesis, resulted in the low expression level of the FnMYB10 gene, which is likely responsible for white fruit phenotype of F. nilgerrensis. The high‐quality F. nilgerrensis genome will be a valuable resource for biological research and comparative genomics research.
Article
Full-text available
Orchardgrass (Dactylis glomerata L.) is an important forage grass for cultivating livestock worldwide. Here, we report an ~1.84‐Gb chromosome‐scale diploid genome assembly of orchardgrass, with a contig N50 of 0.93 Mb, a scaffold N50 of 6.08 Mb, and a super‐scaffold N50 of 252.52 Mb, which is the first chromosome‐scale assembled genome of a cool‐season forage grass. The genome includes 40,088 protein‐coding genes, and 69% of the assembled sequences are transposable elements, with long terminal repeats (LTRs) being the most abundant. The LTR retrotransposons may have been activated and expanded in the grass genome in response to environmental changes during the Pleistocene between 0 and 1 million years ago. Phylogenetic analysis reveals that orchardgrass diverged after rice but before three Triticeae species, and evolutionarily conserved chromosomes were detected by analysing ancient chromosome rearrangements in these grass species. We also re‐sequenced the whole genome of 76 orchardgrass accessions and found that germplasm from Northern Europe and East Asia clustered together, likely due to the exchange of plants along the ‘Silk Road' or other ancient trade routes connecting the East and West. Last, a combined transcriptome, quantitative genetic, and bulk segregant analysis provided insights into the genetic network regulating flowering time in orchardgrass and revealed four main candidate genes controlling this trait. This chromosome‐scale genome and the online database of orchardgrass developed here will facilitate the discovery of genes controlling agronomically important traits, stimulate genetic improvement of and functional genetic research on orchardgrass, and provide comparative genetic resources for other forage grasses. This article is protected by copyright. All rights reserved.
Article
Full-text available
Polyploidization is a major driver of speciation and its importance to plant evolution has been well recognized. Bamboos comprise of one diploid herbaceous and three polyploid woody lineages, and are the only major subfamily in grasses which diversified in forests with tree-like lignified culm for its woody members. Here we generated four draft genomes of major bamboo lineages at three different ploidy levels (diploid, tetraploid and hexaploid). We also constructed a high-density genetic linkage map of bamboo for a hexaploid species, providing a linkage-map-based strategy for assembly and identification of subgenomes in polyploids. Further phylogenomic analyses based on a large dataset of syntenic genes with expected copies revealed that woody bamboos originated subsequent to the divergence of the herbaceous bamboo lineage, and experienced complex reticulate evolution by three independent allopolyploid events involving four extinct diploid ancestors. A shared but distinct subgenome was identified in all polyploid forms, and its progenitor could be critical in ancient polyploidizations and origin of woody bamboos. We also found important genetic clues to the unique flowering behavior and woody trait in bamboos. Taken together, our study provides significant insights into ancient reticulate evolution at the subgenome level in the absence of extant donor species, and offers a potential model scenario for a broad-scale study of plant origins by allopolyploidization in angiosperms.
Article
Aims: Amorphophallus konjac is an important commercial crop grown in China because it is the only plant species which is rich in glucomannan concentration. Recently, an outbreak of anthracnose (incidence ranging from 10-15%) was observed in a field survey conducted from June to August 2018. This study aims to identify the causal agent of Amorphophallus konjac anthracnose. Methods and results: The pathogen was isolated on PDA medium. The fungal colony on PDA was grayish to dark gray. Conidia were falcate, one-celled and hyaline. Based on the micro-morphological and cultural characteristics, the pathogen was identified as Colletotrichum sp. BLAST search and phylogenetic analysis of the ITS, GAPDH, CHS1, ACT, CAL and TUB2 genes revealed the pathogen as Colletotrichum siamense. Koch's postulates were conducted on 2-month konjac leaves with conidial suspension. Development of typical anthracnose disease was recorded 5 days after inoculation and the pathogen's identity was confirmed by re-isolation and molecular identification. Conclusions: Amorphophallus konjac anthracnose was caused by Colletotrichum siamense in China. Significance and impact of study: Identification of causal agent of Amorphophallus konjac anthracnose will be helpful in designing effective disease control strategies.
Article
The fossil record is incomplete, so molecular divergence time analysis is a crucial tool in estimating evolutionary timescales. The software MCMCtree contained in the PAML software provides Bayesian methods to estimate divergence times of genomic-sized sequences. Here, I present MCMCtreeR, a flexible R package to prepare time priors for MCMCtree analysis and plot time-scaled phylogenies. The package provides functions to refine parameters and visualise time-calibrated node prior distributions so that these priors accurately reflect confidence in known, usually fossil, time information. After the parameters have been chosen, the package produces output files ready for MCMCtree analysis. Following analysis, the package has tools to compare prior and posterior calibrated node age distributions and produce plots of the time-scaled phylogenies. The plotting functions allow for the inclusion of age uncertainty on time-scaled phylogenies, including the display of full posterior distributions on nodes. Options also allow for the inclusion of the geological timescale, and these plotting functions are applicable with posterior age estimates from any Bayesian divergence-time estimation software. Availability: MCMCtreeR is an R package available on CRAN (https://CRAN.R-project.org/package=MCMCtreeR). MCMCtreeR depends on the R packages ape, sn, and stats4.