ArticlePublisher preview available
To read the full-text of this research, you can request a copy directly from the authors.

Abstract and Figures

Synthesis of custom de novo DNA sequences is highly demanded by fast-growing field of synthetic biology. Usually DNA sequences with length more than 1 kb are assembled from smaller synthetic DNA fragments (synthons) obtained by PCR assembly. The ability to synthesize longer synthons sufficiently reduces efforts and time for DNA synthesis. We developed a novel rational oligonucleotide design and programmed approach for the assembly of synthetic DNA synthons up to 1550 bp. The developed procedure was thoroughly investigated by synthesis of cholesterol oxidase gene from Streptomyces lavendulae (1544 bp). Our approach is based on combined design, oligonucleotide concentration gradient, and specialized assembly program that directs assembly reaction to full-length gene in a stepwise manner. The process includes conventional thermodynamically balanced assembly, thermodynamically balanced inside-out elongation, and further amplification. The ability of DNA polymerase to perform programmed assembly is highly influenced by the presence of 5′ → 3′-exonuclease activity. Oligonucleotide probing of PCR assembly products allowed us to shed light on the nature of high molecular weight spurious by-products and to understand the mechanism of their formation. For the first time, we applied light scattering techniques for tracking of oligonucleotide annealing, analysis of gene assembly products, and even for real-time monitoring of gene assembly process.
This content is subject to copyright. Terms and conditions apply.
METHODS AND PROTOCOLS
Programmed assembly of long DNA synthons: design, mechanism,
and online monitoring
Veronika V. Shchur
1
&Yuliya P. Burankova
1
&Michail A. Shapira
1
&Dmitry V. Klevzhits
1
&Sergei A. Usanov
1
&
Aleksei V. Yantsevich
1
Received: 24 May 2019 /Revised: 7 August 2019 /Accepted: 21 August 2019
#Springer-Verlag GmbH Germany, part of Springer Nature 2019
Abstract
Synthesis of custom de novo DNA sequences is highly demanded by fast-growing field of synthetic biology. Usually DNA
sequences with length more than 1 kb are assembled from smaller synthetic DNA fragments (synthons) obtained by PCR
assembly. The ability to synthesize longer synthons sufficiently reduces efforts and time for DNA synthesis. We developed a
novel rational oligonucleotide design and programmed approach for the assembly of synthetic DNA synthons up to 1550 bp. The
developed procedure was thoroughly investigated by synthesis of cholesterol oxidase gene from Streptomyces lavendulae (1544
bp). Our approach is based on combined design, oligonucleotide concentration gradient, and specialized assembly program that
directs assembly reaction to full-length gene in a stepwise manner. The process includes conventional thermodynamically
balanced assembly, thermodynamically balanced inside-out elongation, and further amplification. The ability of DNA polymer-
ase to perform programmed assembly is highly influenced by the presence of 53-exonuclease activity. Oligonucleotide
probing of PCR assembly products allowed us to shed light on the nature of high molecular weight spurious by-products and to
understand the mechanism of their formation. For the first time, we applied light scattering techniques for tracking of oligonu-
cleotide annealing, analysis of gene assembly products, and even for real-time monitoring of gene assembly process.
Keywords Gene synthesis .Concatemer .Dynamic light scattering .Polymerase chain reaction .Cholesterol oxidase
Introduction
During the last few decades, the progress in the methodology
of the DNA synthesis created fundamentally new opportuni-
ties for molecular biology. These opportunities formed the
basis for a new scientific discipline synthetic biology.
The ability to synthesize custom DNA molecules along with
the insight in structure-function relations for biosystems opens
unlimited possibilities for biotechnology in the nearest future.
Nowadays, artificial DNA sequences or synthetic genes are a
powerful tool in molecular biology and biotechnology field.
Synthetic genes have already found applications for protein
engineering (Cox et al. 2007; Plesa et al. 2018), metabolic
engineering, species-specific sequence optimization (Kudla
et al. 2009; Lichtenberg et al. 2009), and even whole synthetic
genomes (Gibson et al. 2008; Smith et al. 2003). All currently
described methods of DNA synthesis are based on ligase
chain reaction (LCR) (Smith et al. 2003; Strizhov et al.
1996) or polymerase chain reaction (PCR) (Stemmer et al.
1995) that allows to assemble pools of oligonucleotides, ob-
tained by phosphoramidite chemistry. LCR techniques are
rarely used in practice.
The most commonly used artificial DNA synthesis tech-
niques currently rely on PCR-mediated assembly of target
DNA sequence (PCA) from oligonucleotides with length
40100 bases. Usually the process consists of two stages:
assembly of target sequence from a pool of oligonucleotides
and enrichment of target gene from a crude PCR mixture by
additional amplification by flanking primers.
There are a lot of developed designs for PCA starting from
the procedure described by Stemmer in 1995 (Stemmer et al.
Electronic supplementary material The online version of this article
(https://doi.org/10.1007/s00253-019-10099-4) contains supplementary
material, which is available to authorized users.
*Sergei A. Usanov
usanov@iboch.by
*Aleksei V. Yantsevich
al.yantsevich@gmail.com
1
Institute of Bioorganic Chemistry, National Academy of Sciences of
Belarus, Acad. Kuprevicha Street, 5/2, 220141 Minsk, Belarus
https://doi.org/10.1007/s00253-019-10099-4
Applied Microbiology and Biotechnology (2019) 103:91039117
/Published online: 12 September 2019
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
... Besides that, unmodified PCA has been simplified by combining the gene assembly and PCR reaction into one reaction [11][12][13]. Moreover, a combination of PCA and TBIO techniques has been shown to enable the synthesis of genes up to 1550 bp [14]. ...
Article
Full-text available
Synthetic gene construction is one of the components in synthetic biology. It can be used for various purposes such as to optimize the gene expression. In this study, we proposed six predetermined criteria for designing oligos for the synthesis of Beauveria bassiana protease gene. These criteria were set up to optimize the cost and to accommodate the oligos assembly. A total of 44 overlapping oligos were designed and synthesized 0.5 µM of oligos mixture was used in assembly PCR together with high fidelity DNA polymerase to produce 1.1 kbp fragment. The gene was visualized by agarose gel electrophoresis before subcloned into pCR™2.1-TOPO. The sequence of the gene was verified by DNA sequencing. Site-directed mutagenesis was performed to repair errors resulted from the gene synthesis. A sharp and distinguished band of the expected size of the protease gene was observed in agarose gel electrophoresis. Errors in the sequence which was detected by DNA sequencing was successfully repaired using our simplified site-directed mutagenesis protocol. The result indicated long DNA sequences (>1 kbp) can be synthesized with less error using our method. Additionally, this method was easy to perform because it would require minimum optimization to synthesize other genes by following our guideline.
... Олигонуклеотид синтезировали, подвергали депротекции и очищали на картридже для ТФЭ в соответствии с методом, описанным в [15]. Для сохранения ДМТ-группы на 5ʹ-конце олигонуклеотида из процедуры очистки исключена стадия детритилирования (обработка 2 %-ной трифторуксусной кислотой). ...
Article
The functionalization of 5ʹ-OH group in nucleic acids is of significant value for molecular biology. In the current work we discovered that acid-labile 4,4ʹ-dimethoxytrityl protecting group (DMT) of oligonucleotides (ONs) is stable under PCR conditions and does not interfere with activity of DNA polymerases. So application of 5ʹ-DMT-protected ONs could allow producing both symmetric and asymmetric 5ʹ-DMT-blocked double-stranded DNA (dsDNA) fragments. We demonstrated that the presence of thiol compounds (mercaptoethanol and dithiothreitol) in PCR mixture is undesirable for the stability of DMT-group. DMT-ONs can be successfully used during polymerase chain assembly of synthetic genes. We tested 5ʹ-DMT dsDNA in blunt-end DNA ligation reaction by T4 DNA ligase and found that it could not be ligated with 5ʹ-phosphorylated DNA fragments, namely linearized plasmid vector pJET1.2/blunt. Possible reason for this is steric hindrance created by bulky and rigid DMT-group, that prevents entering enzyme active site. We also demonstrated that 5ʹ-DMT modification of dsDNA does not affect activity of T5 5ʹ,3ʹ-exonuclease towards both ssDNA and dsDNA. Further screening of the exonucleases, sensitive to 5ʹ-DMT-modification or search of ways to separate long 5ʹ-DMT-ssDNA and 5ʹ-OH-ssDNA could allow finding application of 5ʹ-DMT-modified oligo- and polynucleotides.
Article
Full-text available
DNA assembly forms the cornerstone of modern synthetic biology. Despite the numerous available methods, scarless multi-fragment assembly of large plasmids remains challenging. Furthermore, the upcoming wave in molecular biological automation demands a rethinking of how we perform DNA assembly. To streamline automation workflow and minimize operator intervention, a non-enzymatic assembly method is highly desirable. Here, we report the optimization and operationalization of a process called Twin-Primer Assembly (TPA), which is a method to assemble polymerase chain reaction-amplified fragments into a plasmid without the use of enzymes. TPA is capable of assembling a 7 kb plasmid from 10 fragments at ∼80% fidelity and a 31 kb plasmid from five fragments at ∼50% fidelity. TPA cloning is scarless and sequence independent. Even without the use of enzymes, the performance of TPA is on par with some of the best in vitro assembly methods currently available. TPA should be an invaluable addition to a synthetic biologist's toolbox.
Article
Full-text available
Next-generation sequencing technology has enabled the detection of rare genetic or somatic mutations and contributed to our understanding of disease progression and evolution. However, many next-generation sequencing technologies first rely on DNA amplification, via the Polymerase Chain Reaction (PCR), as part of sample preparation workflows. Mistakes made during PCR appear in sequencing data and contribute to false mutations that can ultimately confound genetic analysis. In this report, a single-molecule sequencing assay was used to comprehensively catalog the different types of errors introduced during PCR, including polymerase misincorporation, structure-induced template-switching, PCR-mediated recombination and DNA damage. In addition to well-characterized polymerase base substitution errors, other sources of error were found to be equally prevalent. PCR-mediated recombination by Taq polymerase was observed at the single-molecule level, and surprisingly found to occur as frequently as polymerase base substitution errors, suggesting it may be an underappreciated source of error for multiplex amplification reactions. Inverted repeat structural elements in lacZ caused polymerase template-switching between the top and bottom strands during replication and the frequency of these events were measured for different polymerases. For very accurate polymerases, DNA damage introduced during temperature cycling, and not polymerase base substitution errors, appeared to be the major contributor toward mutations occurring in amplification products. In total, we analyzed PCR products at the single-molecule level and present here a more complete picture of the types of mistakes that occur during DNA amplification.
Article
Full-text available
Spodoptera species, representing widespread polyphagous insect pests, are resistant to Bacillus thuringiensis delta -endotoxins used thus far as insecticides in transgenic plants. Here we describe the chemical synthesis of a cryIC gene by a novel template directed ligation-PCR method. This simple and economical method to construct large synthetic genes can be used when routine resynthesis of genes is required. Chemically phosphorylated adjacent oligonucleotides of the gene to be synthesized are assembled and ligated on a single-stranded, partially homologous template derived from a wild-type gene (cryIC in our case) by a thermostable Pfu DNA ligase using repeated cycles of melting, annealing, and ligation. The resulting synthetic DNA strands are selectively amplified by PCR with short specific flanking primers that are complementary only to the new synthetic DNA. Optimized expression of the synthetic cryIC gene in alfalfa and tobacco results in the production of 0.01-0.2% of total soluble proteins as CryIC toxin and provides protection against the Egyptian cotton leafworm (Spodoptera littoralis) and the beet armyworm (Spodoptera exigua). To facilitate selection and breeding of Spodoptera-resistant plants, the cryIC gene was linked to a pat gene, conferring resistance to the herbicide BASTA.
Article
Full-text available
Generation of customized DNA binding domains targeting unique sequences in complex genomes is crucial for many biotechnological applications. The recently described DNA binding domain of the transcription activator-like effectors (TALEs) from Xanthomonas consists of a series of repeats arranged in tandem, each repeat binding a nucleotide of the target sequence. We present here a strategy for engineering of TALE proteins with novel DNA binding specificities based on the 17.5 repeat-containing AvrBs3 TALE as a scaffold. For each of the 17 full repeats, four module types were generated, each with a distinct base preference. Using this set of 68 repeat modules, recognition domains for any 17 nucleotide DNA target sequence of choice can be constructed by assembling selected modules in a defined linear order. Assembly is performed in two successive one-pot cloning steps using the Golden Gate cloning method that allows seamless fusion of multiple DNA fragments. Applying this strategy, we assembled designer TALEs with new target specificities and tested their function in vivo.
Article
Full-text available
Genome sequences can be conceptualized as arrangements of motifs or words. The frequencies and positional distributions of these words within particular non-coding genomic segments provide important insights into how the words function in processes such as mRNA stability and regulation of gene expression. Using an enumerative word discovery approach, we investigated the frequencies and positional distributions of all 65,536 different 8-letter words in the genome of Arabidopsis thaliana. Focusing on promoter regions, introns, and 3' and 5' untranslated regions (3'UTRs and 5'UTRs), we compared word frequencies in these segments to genome-wide frequencies. The statistically interesting words in each segment were clustered with similar words to generate motif logos. We investigated whether words were clustered at particular locations or were distributed randomly within each genomic segment, and we classified the words using gene expression information from public repositories. Finally, we investigated whether particular sets of words appeared together more frequently than others. Our studies provide a detailed view of the word composition of several segments of the non-coding portion of the Arabidopsis genome. Each segment contains a unique word-based signature. The respective signatures consist of the sets of enriched words, 'unwords', and word pairs within a segment, as well as the preferential locations and functional classifications for the signature words. Additionally, the positional distributions of enriched words within the segments highlight possible functional elements, and the co-associations of words in promoter regions likely represent the formation of higher order regulatory modules. This work is an important step toward fully cataloguing the functional elements of the Arabidopsis genome.
Article
Large-scale gene synthesis in tiny droplets Gene synthesis technology is important for functional characterization of DNA sequences and for the development of synthetic biology. However, current methods are limited by their low scalability and high cost. Plesa et al. developed a gene synthesis method, DropSynth, which uses barcoded beads to concentrate oligos and subsequently assemble them into synthetic genes within picoliter emulsion droplets. DropSynth allows generation of large libraries of thousands of genes and functional testing of all possible mutations of a particular sequence. Science , this issue p. 343
Article
A basic requirement for synthetic biology is the availability of efficient DNA assembly methods. We have previously reported the development of Golden Gate cloning, a method that allows parallel assembly of multiple DNA fragments in a one-tube reaction. Golden Gate cloning can be used for different levels of construct assembly: from gene fragments to complete gene coding sequences, from basic genetic elements to full transcription units, and finally from transcription units to multigene constructs. We provide here a protocol for DNA assembly using Golden Gate cloning, taking as an example the level of assembly of gene fragments to complete coding sequences, a level of cloning that can be used to perform DNA shuffling. Such protocol requires the following steps: (1) selecting fusion sites within parental sequences (sites at which parental sequences will be recombined), (2) amplifying all DNA fragments by PCR to add flanking restriction sites, (3) cloning the amplified fragments in intermediate constructs, and (4) assembling all or selected sets of intermediate constructs in a compatible recipient vector using a one-pot restriction-ligation.
Article
Three methods for assembling multiple, overlapping DNA molecules are described. Each method shares the same basic approach: (i) an exonuclease removes nucleotides from the ends of double-stranded (ds) DNA molecules, exposing complementary single-stranded (ss) DNA overhangs that are specifically annealed; (ii) the ssDNA gaps of the joined molecules are filled in by DNA polymerase, and the nicks are covalently sealed by DNA ligase. The first method employs the 3'-exonuclease activity of T4 DNA polymerase (T4 pol), Taq DNA polymerase (Taq pol), and Taq DNA ligase (Taq lig) in a two-step thermocycled reaction. The second method uses 3'-exonuclease III (ExoIII), antibody-bound Taq pol, and Taq lig in a one-step thermocycled reaction. The third method employs 5'-T5 exonuclease, Phusion® DNA polymerase, and Taq lig in a one-step isothermal reaction and can be used to assemble both ssDNA and dsDNA. These assembly methods can be used to seamlessly construct synthetic and natural genes, genetic pathways, and entire genomes and could be very useful for molecular engineering tools.
Article
DNA synthesis techniques and technologies are quickly becoming a cornerstone of modern molecular biology and play a pivotal role in the field of synthetic biology. The ability to synthesize whole genes, novel genetic pathways, and even entire genomes is no longer the dream it was 30 years ago. Using little more than a thermocycler, commercially synthesized oligonucleotides, and DNA polymerases, a standard molecular biology laboratory can synthesize several kilobase pairs of synthetic DNA in a week using existing techniques. Herein, we review the techniques used in the generation of synthetic DNA, from the chemical synthesis of oligonucleotides to their assembly into long, custom sequences. Software and websites to facilitate the execution of these approaches are explored, and applications of DNA synthesis techniques to gene expression and synthetic biology are discussed. Finally, an example of automated gene synthesis from our own laboratory is provided.