[show abstract][hide abstract] ABSTRACT: To examine the significance of intra-tumor genetic heterogeneity (ITGH) of the androgen receptor gene (AR) in breast cancer, patient-matched samples of laser capture micro-dissected breast tumor cells, adjacent normal breast epithelia cells, and peripheral blood leukocytes were sequenced using a novel next generation sequencing protocol. This protocol measured the frequency of distribution of a variable AR CAG repeat length, a functional polymorphism associated with breast cancer risk. All samples exhibited some degree of ITGH with up to 30 CAG repeat length variants identified. Each type of tissue exhibited a different distribution profile of CAG repeat lengths with substantial differences in the frequencies of zero and 18-25 CAG AR variants. Tissue differences in the frequency of ARs with each of these CAG repeat lengths were significant as measured by paired, twin t-tests. These results suggest that preferential selection of 18-25 CAG repeat length variants in breast tumors may be associated with breast cancer, and support the observation that shorter CAG repeats may protect against breast cancer. They also suggest that merely identifying variant genes will be insufficient to determine the critical mutational events of oncogenesis, which will require measuring the frequency of distribution of mutations within cancerous and matching normal tissues.
[show abstract][hide abstract] ABSTRACT: Chronic Hepatitis B Virus (HBV) infection is epidemiologically associated with hepatocellular-carcinoma (HCC) but its role in HCC remains poorly understood due to technological limitations. Here, we systematically characterize HBV in HCC patients. HBV sequences were enriched from 48 HCC patients using an oligo-bead-based strategy, pooled together and sequenced using the FLX-Genome-Sequencer. In the tumors, preferential integration of HBV into promoters of genes (P<0.001) and significant enrichment of integration into chromosome 10 (P<0.01) was observed. Integration into chromosome 10 was significantly associated with poorly differentiated tumors (P<0.05). Notably, in the tumors, recurrent integration into the promoter of the human telomerase reverse transcriptase (TERT) gene was found to correlate with increased TERT expression. The preferred region within the HBV genome involved in integration as well as viral structural alteration is at the 3'-end of HBx where viral replication/transcription initiates. Upon integration, the 3'-end of the HBx is often deleted. HBx-human chimeric transcripts, the most common type of chimeric transcripts, can be expressed as chimeric proteins. Sequence variation resulting in non-conservative amino acid substitutions are commonly observed in HBV genome. This study highlights HBV as highly mutable in HCC patients with preferential regions within the host and virus genome for HBV integration/structural alterations.
[show abstract][hide abstract] ABSTRACT: We sought to determine the prevalence of hepatitis B virus (HBV) lamivudine (LAM)-resistant minority variants in subjects who once received LAM but had discontinued it prior to virus sampling. We performed direct PCR Sanger sequencing and ultra-deep pyrosequencing (UDPS) of HBV RT of plasma viruses from 45 LAM-naïve subjects and 46 LAM-experienced subjects who had discontinued LAM a median of 24 months earlier. UDPS was performed to a depth of ∼3,000 reads per nucleotide. Minority variants were defined as differences from the Sanger sequence present in ≥0.5% of UDPS reads in a sample. Sanger sequencing identified ≥1 LAM-resistance mutations (rtL80I/V, rtM204I, rtA181T) in samples from five (11%) of 46 LAM-experienced and none of 45 LAM-naïve subjects (0%; p=0.06). UDPS detected ≥1 LAM-resistance mutations (rtL80I/V, rtV173L, rtL180M, rtA181T, rtM204I/V) in 10 (22%) of the 46 LAM-experienced subjects - including five in whom LAM-resistance mutations were not identified by Sanger sequencing. Overall, LAM-resistance mutations were more likely to be present in LAM-experienced (10/46, 22%) than LAM-naïve subjects (0/45, 0%; p=0.001). The median time since LAM discontinuation was 12.8 months in the 10 subjects with a LAM-resistance mutation compared with 30.5 months in the 36 LAM-experienced subjects without a LAM-resistance mutation (p<0.001). The likelihood of detecting a LAM-resistance mutation was significantly increased using UDPS compared with Sanger sequencing and was inversely associated with the time since LAM discontinuation.
Antimicrobial Agents and Chemotherapy 10/2012; · 4.57 Impact Factor
[show abstract][hide abstract] ABSTRACT: OBJECTIVES: To determine whether pan-protease inhibitor (PI)-resistant virus populations are composed predominantly of viruses with resistance to all PIs or of diverse virus populations with resistance to different subsets of PIs. METHODS: We performed deep sequencing of plasma virus samples from nine patients with high-level genotypic and/or phenotypic resistance to all licensed PIs. The nine virus samples had a median of 12 PI resistance mutations by direct PCR Sanger sequencing. RESULTS: For each of the nine virus samples, deep sequencing showed that each of the individual viruses within a sample contained nearly all of the mutations detected by Sanger sequencing. Indeed, a median of 94.9% of deep sequence reads had each of the PI resistance mutations present as a single chromatographic peak in the Sanger sequence. A median of 5.0% of reads had all but one of the Sanger mutations that were not part of an electrophoretic mixture. CONCLUSIONS: The collinearity of PI resistance mutations in the nine virus samples demonstrated that pan-PI-resistant viruses are able to replicate in vivo despite their highly mutated protease enzymes. We hypothesize that the marked collinearity of PI resistance mutations in pan-PI-resistant virus populations results from the unique requirements for multi-PI resistance and the extensive cross-resistance conferred by many of the accessory PI resistance mutations.
Journal of Antimicrobial Chemotherapy 10/2012; · 5.34 Impact Factor
[show abstract][hide abstract] ABSTRACT: Human leukocyte antigen (HLA) genes are the most polymorphic in the human genome. They play a pivotal role in the immune response and have been implicated in numerous human pathologies, especially autoimmunity and infectious diseases. Despite their importance, however, they are rarely characterized comprehensively because of the prohibitive cost of standard technologies and the technical challenges of accurately discriminating between these highly related genes and their many allelles. Here we demonstrate a high-resolution, and cost-effective methodology to type HLA genes by sequencing, which combines the advantage of long-range amplification, the power of high-throughput sequencing platforms, and a unique genotyping algorithm. We calibrated our method for HLA-A, -B, -C, and -DRB1 genes with both reference cell lines and clinical samples and identified several previously undescribed alleles with mismatches, insertions, and deletions. We have further demonstrated the utility of this method in a clinical setting by typing five clinical samples in an Illumina MiSeq instrument with a 5-d turnaround. Overall, this technology has the capacity to deliver low-cost, high-throughput, and accurate HLA typing by multiplexing thousands of samples in a single sequencing run, which will enable comprehensive disease-association studies with large cohorts. Furthermore, this approach can also be extended to include other polymorphic genes.
Proceedings of the National Academy of Sciences 05/2012; 109(22):8676-81. · 9.74 Impact Factor
[show abstract][hide abstract] ABSTRACT: The Saccharomyces cerevisiae strains widely used for industrial fuel-ethanol production have been developed by selection, but their underlying beneficial genetic polymorphisms remain unknown. Here, we report the draft whole-genome sequence of the S. cerevisiae strain CAT-1, which is a dominant fuel-ethanol fermentative strain from the sugarcane industry in Brazil. Our results indicate that strain CAT-1 is a highly heterozygous diploid yeast strain, and the ~12-Mb genome of CAT-1, when compared with the reference S228c genome, contains ~36,000 homozygous and ~30,000 heterozygous single nucleotide polymorphisms, exhibiting an uneven distribution among chromosomes due to large genomic regions of loss of heterozygosity (LOH). In total, 58 % of the 6,652 predicted protein-coding genes of the CAT-1 genome constitute different alleles when compared with the genes present in the reference S288c genome. The CAT-1 genome contains a reduced number of transposable elements, as well as several gene deletions and duplications, especially at telomeric regions, some correlated with several of the physiological characteristics of this industrial fuel-ethanol strain. Phylogenetic analyses revealed that some genes were likely associated with traits important for bioethanol production. Identifying and characterizing the allelic variations controlling traits relevant to industrial fermentation should provide the basis for a forward genetics approach for developing better fermenting yeast strains.
[show abstract][hide abstract] ABSTRACT: Optimization of targeted cell capture with microfluidic devices continues to be a challenge. On the one hand, microfluidics allow working with microliter volumes of liquids, whereas various applications in the real world require detection of target analyte in large volumes, such as capture of rare cell types in several ml of blood. This contrast of volumes (microliter vs. ml) has prevented the emergence of microfluidic cell capture sensors in the clinical setting. Here, we study the improvement in cell capture and throughput achieved using parallel bioactivated microfluidic channels. The device consists of channels in parallel with each other tied to a single channel. We discuss fabrication and testing of our devices, and show the ability for an improvement in throughput detection of target cells.
[show abstract][hide abstract] ABSTRACT: Whole genome comparisons identified introgression from archaic to modern humans. Our analysis of highly polymorphic human leukocyte antigen (HLA) class I, vital immune system components subject to strong balancing selection, shows how modern humans acquired the HLA-B*73 allele in west Asia through admixture with archaic humans called Denisovans, a likely sister group to the Neandertals. Virtual genotyping of Denisovan and Neandertal genomes identified archaic HLA haplotypes carrying functionally distinctive alleles that have introgressed into modern Eurasian and Oceanian populations. These alleles, of which several encode unique or strong ligands for natural killer cell receptors, now represent more than half the HLA alleles of modern Eurasians and also appear to have been later introduced into Africans. Thus, adaptive introgression of archaic alleles has significantly shaped modern human immune systems.
[show abstract][hide abstract] ABSTRACT: The construction of synthetic biological systems involving millions of nucleotides is limited by the lack of high-quality synthetic DNA. Consequently, the field requires advances in the accuracy and scale of chemical DNA synthesis and in the processing of longer DNA assembled from short fragments. Here we describe a highly parallel and miniaturized method, called megacloning, for obtaining high-quality DNA by using next-generation sequencing (NGS) technology as a preparative tool. We demonstrate our method by processing both chemically synthesized and microarray-derived DNA oligonucleotides with a robotic system for imaging and picking beads directly off of a high-throughput pyrosequencing platform. The method can reduce error rates by a factor of 500 compared to the starting oligonucleotide pool generated by microarray. We use DNA obtained by megacloning to assemble synthetic genes. In principle, millions of DNA fragments can be sequenced, characterized and sorted in a single megacloner run, enabling constructive biology up to the megabase scale.
[show abstract][hide abstract] ABSTRACT: In this paper we present a scalable method based on the use of microfluidics and shear force spectroscopy which can be used for determining the affinity between molecules. Our method involves the use of functionalization of the surface of microfluidic channels with ligand molecules, and the surface of microspheres with receptor molecules. Bound beads are detached from the surface of the microchannels using pressure driven flow. The drag force required to detach the beads is used to determine the affinity of the bond holding the two molecules together. The minimum force we are able to detect is 5 pN. We have used this method to determine the binding force between protein-protein interactions and DNA base-pair interactions. We also have shown the ability of this technique to distinguish between strong and weak protein-protein interactions. Using this approach, it may be possible to multiplex an array of these functionalized channels onto a chip and probe the interactions between large varieties of biomolecules.
[show abstract][hide abstract] ABSTRACT: Developing T cells face a series of cell fate choices in the thymus and in the periphery. The role of the individual T cell receptor (TCR) in determining decisions of cell fate remains unresolved. The stochastic/selection model postulates that the initial fate of the cell is independent of TCR specificity, with survival dependent on additional TCR/coreceptor "rescue" signals. The "instructive" model holds that cell fate is initiated by the interaction of the TCR with a cognate peptide-MHC complex. T cells are then segregated on the basis of TCR specificity with the aid of critical coreceptors and signal modulators [Chan S, Correia-Neves M, Benoist C, Mathis (1998) Immunol Rev 165: 195-207]. The former would predict a random representation of individual TCR across divergent T cell lineages whereas the latter would predict minimal overlap between divergent T cell subsets. To address this issue, we have used high-throughput sequencing to evaluate the TCR distribution among key T cell developmental and effector subsets from a single donor. We found numerous examples of individual subsets sharing identical TCR sequence, supporting a model of a stochastic process of cell fate determination coupled with dynamic patterns of clonal expansion of T cells bearing the same TCR sequence among both CD4(+) and CD8+ populations.
Proceedings of the National Academy of Sciences 01/2010; 107(4):1518-23. · 9.74 Impact Factor
[show abstract][hide abstract] ABSTRACT: The HIV-1 nucleoside RT inhibitor (NRTI)-resistance mutation, K65R confers intermediate to high-level resistance to the NRTIs abacavir, didanosine, emtricitabine, lamivudine, and tenofovir; and low-level resistance to stavudine. Several lines of evidence suggest that K65R is more common in HIV-1 subtype C than subtype B viruses.
We performed ultra-deep pyrosequencing (UDPS) and clonal dideoxynucleotide sequencing of plasma virus samples to assess the prevalence of minority K65R variants in subtype B and C viruses from untreated individuals. Although UDPS of plasma samples from 18 subtype C and 27 subtype B viruses showed that a higher proportion of subtype C viruses contain K65R (1.04% vs. 0.25%; p<0.001), limiting dilution clonal sequencing failed to corroborate its presence in two of the samples in which K65R was present in >1.5% of UDPS reads. We therefore performed UDPS on clones and site-directed mutants containing subtype B- and C-specific patterns of silent mutations in the conserved KKK motif encompassing RT codons 64 to 66 and found that subtype-specific nucleotide differences were responsible for increased PCR-induced K65R mutation in subtype C viruses.
This study shows that the RT KKK nucleotide template in subtype C viruses can lead to the spurious detection of K65R by highly sensitive PCR-dependent sequencing techniques. However, the study is also consistent with the subtype C nucleotide template being inherently responsible for increased polymerization-induced K65R mutations in vivo.
PLoS ONE 01/2010; 5(6):e10992. · 3.73 Impact Factor
[show abstract][hide abstract] ABSTRACT: We have used multiplexed high-throughput sequencing to characterize changes in small RNA populations that occur during viral infection in animal cells. Small RNA-based mechanisms such as RNA interference (RNAi) have been shown in plant and invertebrate systems to play a key role in host responses to viral infection. Although homologs of the key RNAi effector pathways are present in mammalian cells, and can launch an RNAi-mediated degradation of experimentally targeted mRNAs, any role for such responses in mammalian host-virus interactions remains to be characterized. Six different viruses were examined in 41 experimentally susceptible and resistant host systems. We identified virus-derived small RNAs (vsRNAs) from all six viruses, with total abundance varying from "vanishingly rare" (less than 0.1% of cellular small RNA) to highly abundant (comparable to abundant micro-RNAs "miRNAs"). In addition to the appearance of vsRNAs during infection, we saw a number of specific changes in host miRNA profiles. For several infection models investigated in more detail, the RNAi and Interferon pathways modulated the abundance of vsRNAs. We also found evidence for populations of vsRNAs that exist as duplexed siRNAs with zero to three nucleotide 3' overhangs. Using populations of cells carrying a Hepatitis C replicon, we observed strand-selective loading of siRNAs onto Argonaute complexes. These experiments define vsRNAs as one possible component of the interplay between animal viruses and their hosts.
[show abstract][hide abstract] ABSTRACT: Objectives: K103N, the most common nonnucleoside reverse transcriptase inhibitor (NNRTI)-resistant mutation in patients with transmitted resistance and in patients receiving a failing NNRTI-containing regimen, is fully susceptible to the new NNRTI, etravirine. Therefore, we sought to determine how often NNRTI-resistant mutations other than K103N occur as minority variants in plasma samples for which standard genotypic resistance testing detects K103N alone.
Methods: We performed ultradeep pyrosequencing (UDPS; 454 Life Sciences a Roche Company, Branford, CT) of plasma virus samples from 13 treatment-naive and 20 NNRTI-experienced patients in whom standard genotypic resistance testing revealed K103N but no other major NNRTI-resistance mutations.
Results: Samples from 0 of 13 treatment-naive patients vs. 7 of 20 patients failing an NNRTI-containing regimen had minority variants with major etravirine-associated NNRTI-resistant mutations (P = 0.03, Fisher exact test): Y181C (7.0%), Y181C (3.6%) + G190A (3.2%), L100I (14%), L100I (32%) + 190A (5.4%), K101E (3.8%) + G190A (4.9%), K101E (4.0%) + G190S (4.8%), and G190S (3.1%).
Conclusions: In treatment-naive patients, UDPS did not detect additional major NNRTI-resistant mutations suggesting that etravirine may be effective in patients with transmitted K103N. In NNRTI-experienced patients, UDPS often detected additional major NNRTI-resistant mutations suggesting that etravirine may not be fully active in patients with acquired K103N.
[show abstract][hide abstract] ABSTRACT: A novel picornavirus genome was sequenced, showing 42.6%, 35.2%, and 44.6% of deduced amino acid identities corresponding to the P1, P2, and P3 regions, respectively, of the Aichi virus. Divergent strains of this new virus,which we named salivirus, were detected in 18 stool samples from Nigeria, Tunisia, Nepal, and the United States.
A statistical association was seen between virus shedding and unexplained cases of gastroenteritis in Nepal (P =0.0056). Viruses with approximately 90% nucleotide similarity, named klassevirus, were also recently reported in
three cases of unexplained diarrhea from the United States and Australia and in sewage from Spain, reflecting a
global distribution and supporting a pathogenic role for this new group of picornaviruses.
Journal of Virology 09/2009; 83(22):12002-12006. · 5.08 Impact Factor
[show abstract][hide abstract] ABSTRACT: Glaucoma is a heterogeneous group of optic neuropathies that manifests by optic nerve head cupping or degeneration of the optic nerve, resulting in a specific pattern of visual field loss. Glaucoma leads to blindness if left untreated, and is considered the second leading cause of blindness worldwide. The subgroup primary congenital glaucoma (PCG) is characterized by an anatomical defect in the trabecular meshwork, and age at onset in the neonatal or infantile period. It is the most severe form of glaucoma. CYP1B1 was the first gene genetically linked to PCG, and CYP1B1 mutations are the cause of disease in 20-100% of patients in different populations. Here, we report that LTBP2 encoding latent transforming growth factor beta binding protein 2 is a PCG causing gene, confirming results recently reported. A disease-associated locus on chromosome 14 was identified by performing whole genome autozygosity mapping in Iranian PCG families using high density single nucleotide polymorphism chips, and two disease-segregating loss of function mutations in LTBP2, p.Ser472fsX3 and p.Tyr1793fsX55, were observed in two families while sequencing candidate genes in the locus. The p.Tyr1793fsX55 mutation affects an amino acid close to the C-terminal of the encoded protein. Subsequently, LTBP2 expression was shown in human eyes, including the trabecular meshwork and ciliary processes that are thought to be relevant to the etiology of PCG.
Human Molecular Genetics 09/2009; 18(20):3969-77. · 7.69 Impact Factor
[show abstract][hide abstract] ABSTRACT: A novel picornavirus genome was sequenced, showing 42.6%, 35.2%, and 44.6% of deduced amino acid identities corresponding to the P1, P2, and P3 regions, respectively, of the Aichi virus. Divergent strains of this new virus, which we named salivirus, were detected in 18 stool samples from Nigeria, Tunisia, Nepal, and the United States. A statistical association was seen between virus shedding and unexplained cases of gastroenteritis in Nepal (P = 0.0056). Viruses with approximately 90% nucleotide similarity, named klassevirus, were also recently reported in three cases of unexplained diarrhea from the United States and Australia and in sewage from Spain, reflecting a global distribution and supporting a pathogenic role for this new group of picornaviruses.
Journal of Virology 09/2009; 83(22):12002-6. · 5.08 Impact Factor
[show abstract][hide abstract] ABSTRACT: The dynamics of emerging nucleoside and nucleotide reverse-transcriptase inhibitor (NRTI) resistance in hepatitis B virus (HBV) are not well understood because standard dideoxynucleotide direct polymerase chain reaction (PCR) sequencing assays detect drug-resistance mutations only after they have become dominant. To obtain insight into NRTI resistance, we used a new sequencing technology to characterize the spectrum of low-prevalence NRTI-resistance mutations in HBV obtained from 20 plasma samples from 11 NRTI-treated patients and 17 plasma samples from 17 NRTI-naive patients, by using standard direct PCR sequencing and ultra-deep pyrosequencing (UDPS). UDPS detected drug-resistance mutations that were not detected by PCR in 10 samples from 5 NRTI-treated patients, including the lamivudine-resistance mutation V173L (in 5 samples), the entecavir-resistance mutations T184S (in 2 samples) and S202G (in 1 sample), the adefovir-resistance mutation N236T (in 1 sample), and the lamivudine and adefovir-resistance mutations V173L, L180M, A181T, and M204V (in 1 sample). G-to-A hypermutation mediated by the apolipoprotein B mRNA editing enzyme, catalytic polypeptide-like family of cytidine deaminases was estimated to be present in 0.6% of reverse-transcriptase genes. Genotype A coinfection was detected by UDPS in each of 3 patients in whom genotype G virus was detected by direct PCR sequencing. UDPS detected low-prevalence HBV variants with NRTI-resistance mutations, G-to-A hypermutation, and low-level dual genotype infection with a sensitivity not previously possible.
The Journal of Infectious Diseases 04/2009; 199(9):1275-85. · 5.85 Impact Factor