Current protocols in bioinformatics / editoral board, Andreas D. Baxevanis ... [et al.]

Description

  • Impact factor
    0.00
  • 5-year impact
    0.00
  • Cited half-life
    0.00
  • Immediacy index
    0.00
  • Eigenfactor
    0.00
  • Article influence
    0.00
  • ISSN
    1934-340X

Publications in this journal

  • [Show abstract] [Hide abstract]
    ABSTRACT: VisANT is a Web-based workbench for the integrative analysis of biological networks with unique features such as exploratory navigation of interaction network and multi-scale visualization and inference with integrated hierarchical knowledge. It provides functionalities for convenient construction, visualization, and analysis of molecular and higher order networks based on functional (e.g., expression profiles, phylogenetic profiles) and physical (e.g., yeast two-hybrid, chromatin-immunoprecipitation and drug target) relations from either the Predictome database or user-defined data sets. Analysis capabilities include network structure analysis, overrepresentation analysis, expression enrichment analysis etc. Additionally, network can be saved, accessed, and shared online. VisANT is able to develop and display meta-networks for meta-nodes that are structural complexes or pathways or any kind of subnetworks. Further, VisANT supports a growing number of standard exchange formats and database referencing standards, e.g., PSI-MI, KGML, BioPAX, SBML(in progress) Multiple species are supported to the extent that interactions or associations are available (i.e., public datasets or Predictome database).
    Current protocols in bioinformatics / editoral board, Andreas D. Baxevanis ... [et al.] 03/2014; 8(88):8.8.1-8.8.39.
  • [Show abstract] [Hide abstract]
    ABSTRACT: Detecting somatic single nucleotide variants (SNVs) is an essential component of cancer research with next generation sequencing data. This protocol describes how to run the SomaticSniper somatic SNV detector and then filter the output to eliminate most false positives. It also includes support protocols detailing the compilation of the software.
    Current protocols in bioinformatics / editoral board, Andreas D. Baxevanis ... [et al.] 03/2014; 15(155):15.5.1-15.5.8.
  • [Show abstract] [Hide abstract]
    ABSTRACT: One of the greatest challenges facing modern molecular biology is understanding the complex mechanisms regulating gene expression. A fundamental step in this process requires the characterization of sequence motifs involved in the regulation of gene expression at transcriptional and post-transcriptional levels. In particular, transcription is modulated by the interaction of transcription factors (TFs) with their corresponding binding sites. Weeder, Pscan, and PscanChIP are software tools freely available for noncommercial users as a stand-alone or Web-based applications for the automatic discovery of conserved motifs in a set of DNA sequences likely to be bound by the same TFs. Input for the tools can be promoter sequences from co-expressed or co-regulated genes (for which Weeder and Pscan are suitable), or regions identified through genome wide ChIP-seq or similar experiments (Weeder and PscanChIP). The motifs are either found by a de novo approach (Weeder) or by using descriptors of the binding specificity of TFs (Pscan and PscanChIP). Curr. Protoc. Bioinform. 47:2.11.1-2.11.31. © 2014 by John Wiley & Sons, Inc.
    Current protocols in bioinformatics / editoral board, Andreas D. Baxevanis ... [et al.] 01/2014; 47:2.11.1-2.11.31.
  • [Show abstract] [Hide abstract]
    ABSTRACT: Cytoscape is one of the most popular open-source software tools for the visual exploration of biomedical networks composed of protein, gene, and other types of interactions. It offers researchers a versatile and interactive visualization interface for exploring complex biological interconnections supported by diverse annotation and experimental data, thereby facilitating research tasks such as predicting gene function and constructing pathways. Cytoscape provides core functionality to load, visualize, search, filter, and save networks, and hundreds of Apps extend this functionality to address specific research needs. The latest generation of Cytoscape (version 3.0 and later) has substantial improvements in function, user interface, and performance relative to previous versions. This protocol aims to jump-start new users with specific protocols for basic Cytoscape functions, such as installing Cytoscape and Cytoscape Apps, loading data, visualizing and navigating the networks, visualizing network associated data (attributes), and identifying clusters. It also highlights new features that benefit experienced users. Curr. Protoc. Bioinform. 47:8.13.1-8.13.24. © 2014 by John Wiley & Sons, Inc.
    Current protocols in bioinformatics / editoral board, Andreas D. Baxevanis ... [et al.] 01/2014; 47:8.13.1-8.13.24.
  • [Show abstract] [Hide abstract]
    ABSTRACT: Technological advances have enabled the use of DNA sequencing as a flexible tool to characterize genetic variation and to measure the activity of diverse cellular phenomena such as gene isoform expression and transcription factor binding. Extracting biological insight from the experiments enabled by these advances demands the analysis of large, multi-dimensional datasets. This unit describes the use of the BEDTools toolkit for the exploration of high-throughput genomics datasets. Several protocols are presented for common genomic analyses, demonstrating how simple BEDTools operations may be combined to create bespoke pipelines addressing complex questions. Curr. Protoc. Bioinform. 47:11.12.1-11.12.34. © 2014 by John Wiley & Sons, Inc.
    Current protocols in bioinformatics / editoral board, Andreas D. Baxevanis ... [et al.] 01/2014; 47:11.12.1-11.12.34.
  • [Show abstract] [Hide abstract]
    ABSTRACT: After raw data have been captured by mass spectrometers in biological LC-MS/MS experiments, they must be converted from vendor-specific binary files to open-format files for manipulation by most software. This protocol details the use of ProteoWizard software for this conversion, taking format features, coding options, and vendor particularities into account. This protocol will aid researchers in preparing their data for analysis by database search engines and other bioinformatics tools. Curr. Protoc. Bioinform. 46:13.24.1-13.24.9. © 2014 by John Wiley & Sons, Inc.
    Current protocols in bioinformatics / editoral board, Andreas D. Baxevanis ... [et al.] 01/2014; 46:13.24.1-9.
  • [Show abstract] [Hide abstract]
    ABSTRACT: Systems medicine provides insights into mechanisms of human diseases, and expedites the development of better diagnostics and drugs. To facilitate such strategies, we initiated MalaCards, a compendium of human diseases and their annotations, integrating and often remodeling information from 64 data sources. MalaCards employs, among others, the proven automatic data-mining strategies established in the construction of GeneCards, our widely used compendium of human genes. The development of MalaCards poses many algorithmic challenges, such as disease name unification, integrated classification, gene-disease association, and disease-targeted expression analysis. MalaCards displays a Web card for each of >19,000 human diseases, with 17 sections, including textual summaries, related diseases, related genes, genetic variations and tests, and relevant publications. Also included are a powerful search engine and a variety of categorized disease lists. This unit describes two basic protocols to search and browse MalaCards effectively. Curr. Protoc. Bioinform. 47:1.24.1-1.24.19. © 2014 by John Wiley & Sons, Inc.
    Current protocols in bioinformatics / editoral board, Andreas D. Baxevanis ... [et al.] 01/2014; 47:1.24.1-1.24.19.
  • [Show abstract] [Hide abstract]
    ABSTRACT: PeptideAtlas, SRMAtlas, and PASSEL are Web-accessible resources to support discovery and targeted proteomics research. PeptideAtlas is a multi-species compendium of shotgun proteomic data provided by the scientific community; SRMAtlas is a resource of high-quality, complete proteome SRM assays generated in a consistent manner for the targeted identification and quantification of proteins; and PASSEL is a repository that compiles and represents selected reaction monitoring data, all in an easy-to-use interface. The databases are generated from native mass spectrometry data files that are analyzed in a standardized manner including statistical validation of the results. Each resource offers search functionalities and can be queried by user-defined constraints; the query results are provided in tables or are graphically displayed. PeptideAtlas, SRMAtlas, and PASSEL are publicly available freely via the Web site http://www.peptideatlas.org. In this protocol, we describe the use of these resources, we highlight how to submit, search, collate and download data. Curr. Protoc. Bioinform. 46:13.25.1-13.25.28. © 2014 by John Wiley & Sons, Inc.
    Current protocols in bioinformatics / editoral board, Andreas D. Baxevanis ... [et al.] 01/2014; 46:13.25.1-13.25.28.
  • [Show abstract] [Hide abstract]
    ABSTRACT: The structures of many non-coding RNA (ncRNA) are conserved by evolution to a greater extent than their sequences. By predicting the conserved structure of two or more homologous sequences, the accuracy of secondary structure prediction can be improved as compared to structure prediction for a single sequence. This unit provides protocols for the use of four programs in the RNAstructure suite for prediction of conserved structures, Multilign, TurboFold, Dynalign, and PARTS. These programs can be run via Web servers, on the command line, or with graphical interfaces. Curr. Protoc. Bioinform. 46:12.4.1-12.4.22. © 2014 by John Wiley & Sons, Inc.
    Current protocols in bioinformatics / editoral board, Andreas D. Baxevanis ... [et al.] 01/2014; 46:12.4.1-12.4.22.
  • [Show abstract] [Hide abstract]
    ABSTRACT: Contemporary microbial ecology studies usually employ one or more "omics" approaches to investigate the structure and function of microbial communities. Among these, metaproteomics aims to characterize the metabolic activities of the microbial membership, providing a direct link between the genetic potential and functional metabolism. The successful deployment of metaproteomics research depends on the integration of high-quality experimental and bioinformatic techniques for uncovering the metabolic activities of a microbial community in a way that is complementary to other "meta-omic" approaches. The essential, quality-defining informatics steps in metaproteomics investigations are: (1) construction of the metagenome, (2) functional annotation of predicted protein-coding genes, (3) protein database searching, (4) protein inference, and (5) extraction of metabolic information. In this article, we provide an overview of current bioinformatic approaches and software implementations in metaproteome studies in order to highlight the key considerations needed for successful implementation of this powerful community-biology tool. Curr. Protoc. Bioinform. 46:13.26.1-13.26.14. © 2014 by John Wiley & Sons, Inc.
    Current protocols in bioinformatics / editoral board, Andreas D. Baxevanis ... [et al.] 01/2014; 46:13.26.1-13.26.14.
  • [Show abstract] [Hide abstract]
    ABSTRACT: Functional characterization of a protein sequence is one of the most frequent problems in biology. This task is usually facilitated by accurate three-dimensional (3-D) structure of the studied protein. In the absence of an experimentally determined structure, comparative or homology modeling can sometimes provide a useful 3-D model for a protein that is related to at least one known protein structure. Comparative modeling predicts the 3-D structure of a given protein sequence (target) based primarily on its alignment to one or more proteins of known structure (templates). The prediction process consists of fold assignment, target-template alignment, model building, and model evaluation. This unit describes how to calculate comparative models using the program MODELLER and discusses all four steps of comparative modeling, frequently observed errors, and some applications. Modeling lactate dehydrogenase from Trichomonas vaginalis (TvLDH) is described as an example. The download and installation of the MODELLER software is also described. Curr. Protoc. Bioinform. 47:5.6.1-5.6.32. © 2014 by John Wiley & Sons, Inc.
    Current protocols in bioinformatics / editoral board, Andreas D. Baxevanis ... [et al.] 01/2014; 47:5.6.1-5.6.32.
  • [Show abstract] [Hide abstract]
    ABSTRACT: The advent of the next-generation sequencing data has made it possible to cost-effectively detect and characterize genomic variation in human genomes. Structural variation, including deletion, duplication, insertion, inversion and translocation, is of great importance to human genetics due to its association with many genetic diseases. BreakDancer is a bioinformatics tool that relates paired-end read alignments from a test genome to the reference genome for the purpose of comprehensively and accurately detecting various types of structural variation.
    Current protocols in bioinformatics / editoral board, Andreas D. Baxevanis ... [et al.] 01/2014; 2014.
  • [Show abstract] [Hide abstract]
    ABSTRACT: This unit describes how to use BWA and the Genome Analysis Toolkit (GATK) to map genome sequencing data to a reference and produce high-quality variant calls that can be used in downstream analyses. The complete workflow includes the core NGS data processing steps that are necessary to make the raw data suitable for analysis by the GATK, as well as the key methods involved in variant discovery using the GATK.
    Current protocols in bioinformatics / editoral board, Andreas D. Baxevanis ... [et al.] 10/2013; 11(1110):11.10.1-11.10.33.
  • [Show abstract] [Hide abstract]
    ABSTRACT: Sequence similarity searching, typically with BLAST, is the most widely used and most reliable strategy for characterizing newly determined sequences. Sequence similarity searches can identify "homologous" proteins or genes by detecting excess similarity- statistically significant similarity that reflects common ancestry. This unit provides an overview of the inference of homology from significant similarity, and introduces other units in this chapter that provide more details on effective strategies for identifying homologs. Curr. Protoc. Bioinform. 42:3.1.1-3.1.8. © 2013 by John Wiley & Sons, Inc.
    Current protocols in bioinformatics / editoral board, Andreas D. Baxevanis ... [et al.] 06/2013; Chapter 3:Unit3.1.
  • [Show abstract] [Hide abstract]
    ABSTRACT: The iPlant Collaborative's Discovery Environment is a unified Web portal to many bioinformatics applications and analytical workflows, including various methods of phylogenetic analysis. This unit describes example protocols for phylogenetic analyses, starting at sequence retrieval from the GenBank sequence database, through to multiple sequence alignment inference and visualization of phylogenetic trees. Methods for extracting smaller sub-trees from very large phylogenies, and the comparative method of continuous ancestral character state reconstruction based on observed morphology of extant species related to their phylogenetic relationships, are also presented. Curr. Protoc. Bioinform. 42:6.13.1-6.13.13. © 2013 by John Wiley & Sons, Inc.
    Current protocols in bioinformatics / editoral board, Andreas D. Baxevanis ... [et al.] 06/2013; Chapter 6:Unit6.13.
  • [Show abstract] [Hide abstract]
    ABSTRACT: Although individual tumors show surprisingly diverse genomic alterations, these events tend to occur in a limited number of pathways, and alterations that affect the same pathway tend to not co-occur in the same patient. While pathway analysis has been a powerful tool in cancer genomics, our knowledge of oncogenic pathway modules is incomplete. To systematically identify such modules, we have developed a novel method, Mutual Exclusivity Modules in Cancer (MEMo). The method searches and identifies modules characterized by three properties: (1) member genes are recurrently altered across a set of tumor samples; (2) member genes are known to or are likely to participate in the same biological process; and (3) alteration events within the modules are mutually exclusive. MEMo integrates multiple data types and maps genomic alterations to biological pathways. MEMo's mutual exclusivity uses a statistical model that preserves the number of alterations per gene and per sample. The MEMo software, source code and sample data sets are available for download at: http://cbio.mskcc.org/memo. Curr. Protoc. Bioinform. 41:8.17.1-8.17.12. © 2013 by John Wiley & Sons, Inc.
    Current protocols in bioinformatics / editoral board, Andreas D. Baxevanis ... [et al.] 03/2013; Chapter 8:Unit8.17.
  • [Show abstract] [Hide abstract]
    ABSTRACT: Human Proteinpedia (http://www.humanproteinpedia.org) is a publicly available proteome repository for sharing human protein data derived from multiple experimental platforms. It incorporates diverse features of the human proteome including protein-protein interactions, enzyme-substrate relationships, PTMs, subcellular localization, and expression of proteins in various human tissues and cell lines in diverse biological conditions including diseases. Through a publicly distributed annotation system developed especially for proteomic data, investigators across the globe can upload, view, and edit proteomic data even before they are published. Inclusion of information on investigators and laboratories that generated the data, as well as visualization of tandem mass spectra, stained tissue sections, protein/peptide microarrays, fluorescent micrographs, and western blots, ensures quality of proteomic data assimilated in Human Proteinpedia. Many of the protein annotations submitted to Human Proteinpedia have also been made available to the scientific community through Human Protein Reference Database (http://www.hprd.org), another resource developed by our group. In this protocol, we describe how to submit, edit, and retrieve proteomic data in Human Proteinpedia. Curr. Protoc. Bioinform. 41:1.21.1-1.21.15. © 2013 by John Wiley & Sons, Inc.
    Current protocols in bioinformatics / editoral board, Andreas D. Baxevanis ... [et al.] 03/2013; Chapter 1:Unit1.21.