Figure 4 - uploaded by Lev Jardón-Barbolla
Content may be subject to copyright.
Ion-Torrent. The bead libraries derived from emulsion PCR are deposited into a “chip” with millions of cavities or wells that fit only one well. Each well is part of a microtransistor that integrates the chip, where voltage changes can be registered individually. Sequencing proceeds in a similar fashion as pyrosequencing, but instead of light emission, the addition of each base produces a voltage change. Modified form Niedringhaus et al. (2011). 

Ion-Torrent. The bead libraries derived from emulsion PCR are deposited into a “chip” with millions of cavities or wells that fit only one well. Each well is part of a microtransistor that integrates the chip, where voltage changes can be registered individually. Sequencing proceeds in a similar fashion as pyrosequencing, but instead of light emission, the addition of each base produces a voltage change. Modified form Niedringhaus et al. (2011). 

Source publication
Article
Full-text available
Recent years have witnessed the advent and rapid development of massive sequencing technology, commonly known as Next Generation Sequencing (NGS). This technology allows for rapid, massive and inexpensive sequencing of genome regions or entire genomes, making possible genomic studies of non-model organisms and has seen great progress in metagenomic...

Context in source publication

Context 1
... calibration for long stretches of a single nucleotide has its limits, and it is usually inaccurate when > 6 bases of the same nucleotide are added in a row (Mardis, 2008). Although in the last years the length of reads obtained by pyrosequencing has increased considerably (Shokralla et al., 2012), the coverage for this approach is still the lowest for the currently available platforms, ranging from 7-10X, depending on genome size (i.e. Barbazuk et al., 2007; Wheeler et al., 2008; Vera et al., 2008). Illumina genome analyzer (GAIIx) . This is one of the most praised massive sequencing platforms, due to its high quality sequences and great coverage (millions of simultaneously sequenced fragments). The procedure for library construction is also based on synthesis of complementary strands of DNA through PCR. However, the specifics of such synthesis have notable differences with the emulsion PCR used by 454. As most current massive sequencing protocols, DNA fragmentation and size selection are necessary steps prior to library construction. Once fragments of a desired size are selected, platform- specific adaptors are added which allows attachment to the sequencing matrix, a flow-cell device that will support bridge amplification and sequencing per se . On this matrix, each attached fragment is amplified producing multiple and identical DNA copies in a cluster (Fig. 2). Illumina ́s matrix has 8 separate channels (or lines) in each of which a library of clusters can be created. Each cluster is sequenced by synthesis, in which all 4 nucleotides are added simultaneously to the flow device, along with DNA polymerase for addition into the oligo-primed fragments that form the clusters. For this sequencing approach, the addition of a nucleotide blocks subsequent incorporation of nucleotides, interrupting the synthesis, and releases a fluorescent signal that is unique for each type of nucleotide. Imaging of the fluorescent signal follows the incorporation. After imaging, the blocking group is removed and leaves DNA strands ready for the next nucleotide incorporation by DNA polymerase. This series of steps continues for a specific number of rounds allowing read-lengths of 60-150 bases (Glenn, 2011). Applied biosystems SOLiD TM sequencer . Sequencing by oligo ligation detection (SOLiD). This platform has the best quality of sequences and the smallest error rate (Table 1), as a result of the ligation-based approach. The library construction occurs through an emulsion PCR with small magnetic beads (similar to 454/Roche). After completion of library construction, the resulting beads are attached covalently to a flow-cell glass slide. Compared to other platforms, SOLiD has a major difference in that the approach taken for sequencing the amplified fragments, uses DNA ligase as illustrated in figure 3. The slide with attached beads is exposed to cyclic enzymatic reactions. During the first cycle a primer sequence is incorporated, that is complementary to the adaptor ligated to the fragments as well as a ligase and 4 fluorescently labeled 2- base encoded probes. Non-ligated probes are then washed away, followed by the imaging of the released fluorescence that identifies the ligated probe (Landergren et al., 1988). The cycle is repeated to remove the fluorescent dye and regenerate the 5 ́-PO4 groups for 10 subsequent ligation cycles (Fig. 3). A second ligation round is performed with a “n-1” primer, which resets the interrogated sequence one base downstream, and then the 10 cycle ligation proceeds. Four more rounds of ligation cycles are performed with progressively “n-1” primers. Color calls from the 5-ligation rounds are then ordered into a linear sequence (the color space) and compared to a linear sequence to decode DNA problem sequence. The read length of SOLiD was initially 35 bp with a final output of 3 Gb per run (Liu et al., 2012). These numbers have increased up to 75 bp per read and 10 Gb per run (Shokralla et al., 2012), with high accuracy in the assignment of bases due to the 2-base sequencing method (accuracy of 99.85% after filtering; Liu et al., 2012; Table 1). Ion-torrent ( the chip is the machine ). This platform is gaining popularity in the market mainly due to the low cost of sequencing runs and the possibility of in-house daily use without many technical requirements or maintenance. The library construction procedure is almost identical to 454 or pyrosequencing, including DNA fragmentation and adaptor ligation. The bead library from the emulsion PCR is deposited into a “chip” with millions of wells (165- 600 million depending on the specific version; Shokralla et al., 2012), where only 1 bead fits in each well. The extraordinary aspect of this technology is the chip itself. Each well is integrated to the chip’s ion-sensitive layer and a proprietary ion sensor to register the very small voltage changes (per well) that result from nucleotide addition during DNA sequencing by synthesis (Rothberg et al., 2011, Fig. 4). As in 454, nucleotide addition is not followed by termination and it proceeds in cycles, 1 nucleotide after another. This results in the same problem that the pyrosequencing platforms have with homopolymer detection. The read length of ion Torrent is currently about 150 bp with a final output of 3 Gb per run (Liu et al., 2012), but it has increased up to 75 additional bp per read, and up to 10 Gb per run (Shokralla et al., 2012; Table 1). Single molecule real time (SMRT) sequencing . Technological bets on massive sequencing are focused on the possibility to sequence single molecules in real time or “single molecule real time” (SMRT) sequencing. These technological developments are referred to as the third generation sequencing and, to date, only Pacific Biosciences (PacBio) has released a commercial platform implementing SMRT sequencing. SMRT implements the possibility of attaching a polymerase to a sequencing matrix and be able to follow in real time the synthesis process of a single DNA molecule (Fig. 5). An important feature of SMRT is that it does not require library construction as a prior step to sequencing, increasing the sequence production rate. Also, this technology allows for longer reads. The read length of Pacific Biosciences system has been reported to be up to 1 500 bp with a final output of 60- 75 Mb per run (Shokralla et al., 2012 ; Table 1). However, it is difficult to score single base additions in real time. The main problem in scoring real time single base additions is the high speed at which each polymerase synthesizes DNA exhibits stochastic fluctuations (Eid et al., 2009), thus, each enzyme has to be monitored individually and nucleotide additions registered at the appropriate speed in real time. This difficulties result in the highest error rates among NGS platforms (Table 1). Currently, the main bottlenecks in genomic studies are in the post-sequence stages or assembly, which usually requires major computational capacities (Henson et al., 2012). Genomes sequences are variable (for instance, including heterozygosity in diploid organisms) and can be highly repetitive, and during assembly it is necessary to distinguish this “real” variation from sequencing errors and stay within reasonable computational times, making assembly a complex problem (Henson et al., 2012). The task would be much simpler if it could be determined whether a given set of reads corresponds to overlapping positions on the genome. In this sense, generally longer reads help to correctly find overlapping regions. Given the presence of short reads in all NGS platforms, it is clear that assembling genomes, metagenomes or transcriptomes is a task that faces enormous challenges, which are even more challenging due to high error rates. The ease and accuracy of assembly depends on the degree of overlapping between reads. Two reads are considered as overlapping when there is a sequence match between reads that is long enough to be reliably distinguished from a random event (Henson et al., 2012). Thus, high uncertainty in assembly arises from locations in which not enough overlap is present to extend the genome sequence. In combination, coverage and read length increase confidence levels of the assembly process. Experience and models show that a good assembly using Sanger reads requires each base to be covered on average by at least 3 reads (3X, Lander and Waterman, 1988). However, for the short reads of NGS platforms, this number can rise up to 30X (Farrer et al., 2009; Meyer et al., 2012). High error rates also affect assembly and thus higher coverage is needed. Published genomes have used between 5X and 10X with Sanger (Adams et al., 2000; Venter et al., 2001; Venter et al., 2004), but new publications have begun to report 50X, 100X and higher coverage with NGS (Diguistini et al., 2009; Quail et al., 2012). However, even with high coverage, overcoming the problem of repeats and derived assembly gaps sometimes need to be spanned by paired reads (2 reads generated from a single fragment of DNA and separated by known distance), which are available for most NGS platforms (Schatz et al., 2010). Much research has been published on the assembly problem, as well as development of new assembly algorithms and platforms (for recent reviews see Schatz et al., 2010; Nagarajan and Pop, 2013). Nevertheless, it has been suggested that the best approach is to use a reference genome sequence as a guide to resolve repeats, an approach known as “comparative assembly” (Pop et al., 2004). A detailed analysis by Schatz et al. (2010) on the assembly of large genomes using NGS (Illumina and 454) showed that assemblies with these platforms are inferior than those accomplished using Sanger technology, but recognized the appeal of the lower costs of NGS. In this paper, guidelines are given to decide the best way to proceed in terms of choosing the platform and assembly method when pursuing a genome sequencing project. As we mention ...

Similar publications

Article
Full-text available
The use of next-generation sequencing technologies in drinking water distribution systems (DWDS) has shed insight into the microbial communities’ composition, and interaction in the drinking water microbiome. For the past two decades, various studies have been conducted in which metagenomics data have been collected over extended periods and analyz...
Article
Full-text available
Background The phytoplankton spring bloom in freshwater habitats is a complex, recurring, and dynamic ecological spectacle that unfolds at multiple biological scales. Although enormous taxonomic shifts in microbial assemblages during and after the bloom have been reported, genomic information on the microbial community of the spring bloom remains s...
Article
Full-text available
16S rRNA community profiling continues to be a useful tool to study microbiome composition and dynamics, in part due to advances in next generation sequencing technology that translate into reductions in cost. Reliable taxonomic identification to the species-level, however, remains difficult, especially for short-read sequencing platforms, due to i...
Article
Full-text available
Motivation: Association analysis of microbiome composition with disease-related outcomes provides invaluable knowledge towards understanding the roles of microbes in the underlying disease mechanisms. Proper analysis of sparse compositional microbiome data is challenging. Existing methods rely on strong assumptions on the data structure and fail t...

Citations

... The development of HTS has allowed us to obtain large datasets and represents a more cost-efficient alternative to conventional techniques [106]. Massive sequencing technology has permitted the study of model and non-model organisms, contributing to the advancement of disciplines such as microbial ecology and evolution [107]. HTS has enabled sizeable global analysis of microbial communities including fungi associated with plant systems (mycorrhizae, endophytes, and pathogens), free-living saprotrophs, and fungi from extreme environments [48]. ...
Article
Full-text available
The deep sea (>1000 m below sea level) represents one of the most extreme environments of the ocean. Despite exhibiting harsh abiotic conditions such as low temperatures, high hydrostatic pressure, high salinity concentrations, a low input of organic matter, and absence of light, the deep sea encompasses a great fungal diversity. For decades, most knowledge on the fungal diversity of the deep sea was obtained through culture-dependent techniques. More recently, with the latest advances of high-throughput next generation sequencing platforms, there has been a rapid increment in the number of studies using culture-independent techniques. This review brings into the spotlight the progress of the techniques used to assess the diversity and ecological role of the deep-sea mycobiota and provides an overview on how the omics technologies have contributed to gaining knowledge about fungi and their activity in poorly explored marine environments. Finally, current challenges and suggested coordinated efforts to overcome them are discussed.
... Generally, pyrosequencing is based on the real-time detection of pyrophosphate (PPi) molecules, which are released during the incorporation of nucleotides by DNA polymerase. Afterwards, PPi molecules initiate a series of enzymatic reactions, leading to the production of light by the firefly enzyme luciferase (Escalante et al. 2014;Galimberti et al. 2013;Hellberg and Morrissey 2011). Abbadi et al. (2017) showed the potential of pyrosequencing as a simple, rapid and cost-effective tool for the differentiation of several bivalve species in fresh and processed seafood samples, particularly comparing with Sanger's sequencing. ...
Article
The world’s seafood supply and trade have increased in the last decades, as well as the potential for marketed species substitution. Currently, seafood safety and authenticity assessment have become central issues, directly related with the identification of improper labeling of processed foods. To detect and prevent mislabeling issues, species identification using DNA barcodes has been widely used as effective molecular markers. Therefore, this review intends to present the current status on the application of DNA barcodes to seafood species authentication. In this regard, the barcode regions, reference databases and related methodologies are described, while applications are listed and summarized. Cytochrome c oxidase subunit I (COI) gene has been the preferential targeted DNA region in animal species identification, including fish and shellfish, though other mitochondrial (cytb, 12S rRNA, 16S rRNA) and nuclear genes have been used. DNA barcoding relying on Sanger’s sequencing has been the most used approach for seafood authentication. Nevertheless, in recent years, noteworthy progresses have been advanced toward DNA barcoding strategies, involving next generation sequencing. Methods relying on real-time PCR using species-specific primers and probes or followed by high resolution melting analysis combined with DNA barcodes represent alternative and promising approaches for simple, cost-effective and high-throughput species discrimination in processed seafood. Still, polymerase chain reaction with restriction fragment length polymorphism detection, targeting DNA barcodes, continues to be a well-established and broadly accepted method in seafood authentication.
... Actualmente, se pueden secuenciar genomas completos muy rápidamente, a un costo relativamente modesto, analizar las diferencias entre todo el genoma de las plantas u otros organismos silvestres y cómo cambiaron a consecuencia de la domesticación. También se pueden usar diferentes marcadores genéticos, para evaluar geográficamente dónde se domesticaron las diferentes especies, estimar hace cuánto tiempo sucedieron los cambios fisiológicos y morfológicos, cómo cambiaron sus tamaños efectivos, cuáles características y/o genes fueron los blancos de la selección y la intensidad de esta selección, entre otros procesos evolutivos (Eguiarte et al., 2013;Escalante et al., 2014;Gaut, 2015;Gaut et al., 2015). este momento así como en el futuro. ...
... También se observa una clara diferenciación de las variedades cultivadas en la Península de Yucatán con respecto al resto de México (Sánchez de la . En esta especie aún quedan por describir los patrones de flujo génico a nivel regional asociados a las actividades humanas, así como explorar hipótesis demográficas, y en particular encontrar a las poblaciones genéticamente más cercanas al centro de origen de la especie, utilizando para esto métodos genómicos, ya que como discutiremos más adelante, con las tecnologías de secuenciación de siguiente generación podremos además explorar la historia demográfica e identificar señales de selección en genes de importancia agronómica (ver también Eguiarte et al. (2013); Escalante et al. (2014)). ...
Article
Full-text available
La domesticación de plantas y animales permite estudiar diferentes procesos evolutivos, como la selección, adaptación y especiación. En este artículo se describen avances recientes en el estudio de las calabazas, las cuales constituyen el género Cucurbita (Cucurbitaceae) siendo un grupo de plantas herbáceas americanas que incluyen entre 12 y 15 especies. Cucurbita ha tenido seis eventos de domesticación, de los cuales cuatro sucedieron en México. Este es un género relativamente reciente, que surgió en Norte América hace 16 millones de años y sus especies cultivadas mantienen una alta variación genética; Cucurbita pepo es la especie que presenta mayor variación genética,variación asociada a dos domesticaciones independientes, una en el norte de México, y otra en el Sureste de los Estados Unidos. En otra especie, Cucurbita argyrosperma, sus poblaciones de la Península de Yucatán, representan una poza genética diferenciada del resto de la especie. El estudio del genoma de C. argyrosperma y taxa cercanos ha revelado las regiones de su genoma asociadas a la domesticación. Las poblaciones de las especies de este género representan una fuente de importantes recursos genéticos frente al cambio climático y constituyen un buen sistema para el estudio de la domesticación y de diferentes procesos evolutivos.
... Las principales diferencias de estas plataformas radican en la no necesidad de realizar una PCR antes de la secuenciación (aunque los protocolos de Illumina para la construcción de bibliotecas genómicas, no usan PCR para su secuenciación), reduciendo el tiempo de análisis y los errores derivados de ésta, y en que la detección de nucleótidos se realiza en tiempo real: Single molecule real time (SMRT) sequencing. El método de SMRT implementa la posibilidad de asociar una polimerasa a una matriz de la secuencia y poder seguir en tiempo real el proceso de síntesis de una sola molécula de ADN y además no requiere la construcción de una biblioteca como un paso previo a la secuenciación, aumentando la tasa de producción de la secuencia (Escalante et al., 2014). Los secuenciadores desarrollados en esta etapa son las plataformas Pacific Biosciences y Oxford Nanopore. ...
... El principal problema en el registro es la velocidad a la que cada polimerasa sintetiza ADN, debido a las fluctuaciones propias de cada enzima, por lo que cada enzima tiene que controlarse individualmente y las adiciones de nucleótidos deben ser registradas a la velocidad adecuada en tiempo real. Como resultado, se obtienen las mayores tasas de error entre las plataformas de NGS (Escalante et al., 2014). Ver descripción gráfica del método en Anexos. ...
Presentation
Full-text available
SECUENCIAMIENTO DE ADN El secuenciamiento de ADN es el proceso de determinar el orden exacto de los nucleótidos en una molécula de ADN. Incluye cualquier método o tecnología que se utiliza para determinar el orden de las cuatro bases: adenina, guanina, citosina y timina en una hebra de ADN. La aparición de métodos rápidos de secuenciación de ADN ha acelerado grandemente descubrimiento e investigación en todos los campos del saber biológica, médico y biotecnológico. Estas modernas tecnologías de secuenciación de ADN han sido fundamentales en la obtención de secuencias completas de genomas de numerosos tipos y especies de seres vivos, incluyendo el genoma humano, animales, vegetales y especies microbianas.
... The Human Genome Project encouraged the development of techniques for sequence whole genomes at low cost and less time-consuming. These sequencing tools are known as massive parallel sequencing (MPS), also called Next Generation Sequencing (NGS) and have allowed the routine use of single nucleotide polymorphisms (SNPs) in the last decade, even in no model species (Escalante et al. 2014). These massive sequencing tools are used by breeders to sequence large populations, study genetic composition of crop varieties, understand evolutionary relationships between cultivars and wild relatives, identify genotype-genotype (G × G) and genotypes-environment (G × A) interactions, and increase the resolution of gene and quantitative trait locus (QTL) discovery, providing the basis for modeling complex genotype-phenotype relationships at the whole-genome level (Figure 3. Cobb et al. 2013, Varshney et al. 2014. ...
... How is genomic data obtained? Several massive parallel sequencing platforms are commercially available, and while the specific methods vary, they obtain millions of short DNA sequence reads from random locations in the genome (Escalante et al. 2014). Massive sequencing technologies have been widely used for de novo sequencing and several crop and other angiosperm genomes have been sequenced, which are used as reference in no model species studies AZALEA GUERRA-GARCÍA AND DANIEL PIÑERO (Table 1). ...
Article
Full-text available
p>Background: The domestication process has left signatures in the genomes of domesticated species. Before the existence of molecular markers, only phenotypic traits could be used in domestication studies and breeding programs, but these approaches required long time and effort. In the last decades, the use of molecular markers dramatically increased, and the development of massive sequencing tools have enable to obtain thousands or even millions of molecular markers. This work focuses on domesticated plants and the main goal is to bring a general and an integrative perspective of the data, approaches and questions that can be answered using massive sequencing tools, compared to classic genetic data. Results: The use of molecular markers in the last decades has increased the efficiency and accuracy of plant breeding, allowing to access information about domestication history and to identify genes affected by domestication. Some patterns have been identified: (1) genetic diversity reduction due to demographic bottlenecks and artificial selection; (2) frequently, mutations related with domestication syndrome preexisted at low frequency in natural populations; (3) accumulation of deleterious mutation; (4) gene flow between wild and cultivated populations. There are several approaches that can be used in massive sequencing tools: de novo genome sequencing, whole genome resequencing, reduction of genome complexity using restriction enzymes, transcriptome analysis and epigenetic studies. Conclusions: Despite the progress made, enormous challenges still remain: storage of large databases; development of fast, accurate and high throughput strategies to phenotype; identification of paralogous genes in polyploid species; and the analysis of large and highly diverse genomes.
... Thus, the future directions on R S research in Mexico should be: a) Develop a base-line understanding of the biophysical controls of R S across different ecosystems in Mexico, including the responses when a land use change occurs, b) establish long-term (>5 years) observatory networks to measure R S across different ecosystems and management schemes, c) within the latter, establish manipulative experiments to obtain mechanistic knowledge of how different scenarios (e.g., increasing temperature or changing timing and magnitude of precipitation) could affect R S (Norby and Luo, 2004), d) when an array of long-term measurements has been established, large-scale modelling of R S using satellite data could be carried out (i.e., Wu et al., 2014), e) develop a Mexican database of R S records, with a quality assurance and quality control (QA/QC) protocols (Carbone and Vargas, 2008), f) integrate aboveground phenological measurements and net fluxes (e.g., phenocams, Richardson et al., 2007;Vargas et al., 2013), g) integrate belowground phenological measurements (e.g., minirhizotrons, (Hasselquist et al., 2009), h) integrate emerging disciplines to explain patterns and mechanisms (e.g., ecological genomics, Escalante et al., 2014), i) isolate autotrophic and heterotrophic respiration from total R S (Hanson et al., 2000), j) integrate other greenhouse gases related to R S measurements (e.g., CH 4 , NO, NO 2 ), k) continuous interaction of universities, research centers, and government agencies, as well as with other networks (e.g., MexFlux, Mex-LTER). ...
Article
Full-text available
Soil respiration (RS) is a CO2 efflux from the soil to the atmosphere def ined as the sum of autotrophic (respiration by roots and mycorrhizae), and heterotrophic (respiration of microorganisms that decompose fractions of organic matter and of soil fauna) respiration. Globally, RS is considered to be the second largest flux of C to the atmosphere. From published literature it is clear that its main controls are soil temperature, soil moisture, photosynthesis, organic matter inputs and soil biota composition. Despite its relevance in C cycle science, there have been only twenty eight studies in Mexico in the last decade where direct measurement of gas exchange was conducted in the f ield. These studies were held mostly in agricultural and forest ecosystems, in Central and Southern Mexico where mild subtropical conditions prevail. However, arid, semi-arid, tropical and wetland ecosystems may have an important role in Mexico’s CO2 emissions because of their extent and extensive land use changes. From the twenty eight studies, only two provided continuous measurements of RS with high temporal resolution, highlighting the need for long-term studies to evaluate the complex biophysical controls of this flux and associated processes over different ecological succession stages. We conclude that Mexico represents an important opportunity to understand its complex dynamics, in national and global context, as ecosystems in the country cover a wide range of climatic conditions. This is particularly important because deforestation and degradation of Mexican ecosystems is rapidly increasing along with expected changes in climate.
... La secuenciación de Sanger reinó desde 1975 hasta 2005, con unas lecturas promedio de ~1000 bases, una tasa de error de 0,1 % y un coste aproximado de $397 por megabase (coste mínimo en octubre de 2007, consúltese evolución de costes en: http://www.genome.gov/sequencingcosts/) (Escalante et al. 2014). Actualmente el campo de la secuenciación se encuentra dominado por las tecnologías denominadas next generation sequencing (NGS). ...
... Actualmente el campo de la secuenciación se encuentra dominado por las tecnologías denominadas next generation sequencing (NGS). Centrándonos en la plataforma de NGS más común, vemos que proporciona unas lecturas promedio de 50-250 bases, una tasa de error de 0,26 % y un coste de $0,05 por megabase (coste en abril de 2015) (Escalante et al. 2014). ...
... The main benefit of this technology is its ability to produce long reads, while restricted by its high error rate in homopolymers containing regions, and a high rate of artificial amplification [50][51][52] . The error rates of NGS are higher relative to the Sanger sequencers, and also require advanced computational tools and statistical calculations before further data processing and assembly [53] . Due to the NGS platform specific errors, presently, use of barcoding strategies, simultaneous sequencing of the samples by two different NGS platforms or high coverage sequencing have been recommended to counteract the effects of errors [54][55][56] . ...
... Such an assembly requires extensive computational power and datasets containing longer reads with higher coverage are preferable [64][65][66] . When reference genomes for assembly are available, technologies that generate short reads could also be used to have a high coverage of the metagenomes [53] . August When compared in terms of publications, Illumina technology is the most widely used platform, irrespective of application. ...
... Earlier the use of this platform was not suited for virus discovery or de novo sequencing projects due to its short reads. However, regular augmentation in read length for Illumina platforms has made it suitable for de novo assembly of genomes, at a sensitivity, comparable to specific PCR [53,67,68] . However, according to the number of publications, specifically for metagenomic studies, pyrosequencing technology (Roche 454) is preferred over the other NGS approaches producing shorter reads. ...
Article
Full-text available
Rapid development and commercial availability of next-generation sequencers (NGS) systems have dramatically changed almost every field of biological research, especially microbiology and metagenomics. Different NGS systems have been adapted and used for numerous applications in virology too. These systems are capable of rapidly sequencing and analyzing a complex mixture of nucleic acid templates, in a massively parallel fashion, making them ideal tools for viral metagenomics and discovery. This manuscript reviews the prevailing NGS technologies, their application in virus discovery to serve as a guide for the readers, working in the field of virology, public health and in biothreat mitigation programs.
Article
For over a decade, molecular short standardised DNA fragments, termed DNA barcodes, have been developed for species discrimination around the world. As of 2010, the vast majority of barcoding research was biased toward particular taxonomic groups and geographic regions largely because researchers in developed countries were the ones with the resources and capacity to carry out such work. To rectify this, the International Barcode of Life Project was launched with the intent to extend the geographic and taxonomic coverage of the barcode reference library. South Africa committed to this mission in an attempt to catalogue all of its known biodiversity and, possibly, help identify new species. To date, approximately 48 000 South African faunal barcodes are housed in the Barcode of Life Data System (BOLD), which represent only 2.3% of all known South African animal species. Although insects are the best represented in absolute terms, with over 37 000 samples recorded, they are still grossly lacking with just over 1% representation. Much like the global trend, there is a general taxonomic bias, with fish, birds and mammals showing the greatest representation. Moreover, geographic bias is also present, with the Free State province particularly under-represented on BOLD, likely owing to limited human capacity. Although few studies have been published with respect to barcoding, the majority reveal that the cytochrome c oxidase 1 (CO1) gene, used in isolation or in conjunction with other molecular markers, can greatly benefit South African biodiversity research. Several limitations of DNA barcoding are discussed and recommendations specific to South Africa provided.