Genetic Simulation Resources (GSR): A website for the registration and discovery of genetic data simulators.

Department of Genetics, the University of Texas MD Anderson Cancer Center.
Bioinformatics (Impact Factor: 4.62). 02/2013; DOI: 10.1093/bioinformatics/btt094
Source: PubMed

ABSTRACT Many simulation methods and programs have been developed to simulate genetic data of the human genome. These data have been widely used, for example, to predict properties of populations retrospectively or prospectively according to mathematically intractable genetic models, and to assist the validation, statistical inference and power analysis of a variety of statistical models. However, due to the differences in type of genetic data of interest, simulation methods, evolutionary features, input and output formats, terminologies and assumptions for different applications, choosing the right tool for a particular study can be a resource intensive process that usually involves searching, downloading and testing many different simulation programs. Genetic Simulation Resources (GSR) is a website provided by the National Cancer Institute (NCI) that aims to help researchers compare and choose the appropriate simulation tools for their studies. This website allows authors of simulation software to register their applications and describe them with well-defined attributes, thus allowing site users to search and compare simulators according to specified features. AVAILABILITY: CONTACT:

  • [Show abstract] [Hide abstract]
    ABSTRACT: Cancer risk is determined by a complex interplay of genetic and environmental factors. Genome-wide association studies (GWAS) have identified hundreds of common (minor allele frequency [MAF] > 0.05) and less common (0.01 < MAF < 0.05) genetic variants associated with cancer. The marginal effects of most of these variants have been small (odds ratios: 1.1-1.4). There remain unanswered questions on how best to incorporate the joint effects of genes and environment, including gene-environment (G × E) interactions, into epidemiologic studies of cancer. To help address these questions, and to better inform research priorities and allocation of resources, the National Cancer Institute sponsored a "Gene-Environment Think Tank" on January 10-11, 2012. The objective of the Think Tank was to facilitate discussions on (1) the state of the science, (2) the goals of G × E interaction studies in cancer epidemiology, and (3) opportunities for developing novel study designs and analysis tools. This report summarizes the Think Tank discussion, with a focus on contemporary approaches to the analysis of G × E interactions. Selecting the appropriate methods requires first identifying the relevant scientific question and rationale, with an important distinction made between analyses aiming to characterize the joint effects of putative or established genetic and environmental factors and analyses aiming to discover novel risk factors or novel interaction effects. Other discussion items include measurement error, statistical power, significance, and replication. Additional designs, exposure assessments, and analytical approaches need to be considered as we move from the current small number of success stories to a fuller understanding of the interplay of genetic and environmental factors.
    Genetic Epidemiology 11/2013; 37(7). DOI:10.1002/gepi.21756 · 2.95 Impact Factor
  • [Show abstract] [Hide abstract]
    ABSTRACT: Stochastic simulation software that simultaneously model genetic, population and environmental processes can inform many topics in molecular ecology. These include forecasting species and community response to environmental change, inferring dispersal ecology, revealing cryptic mating, quantifying past population dynamics, assessing in situ management options, and monitoring neutral and adaptive biodiversity change. Advances in population demographic-genetic simulation software, especially with respect to individual life history, landscapes, and genetic processes, are transforming and expanding the ways that molecular data can be used. The aim of this review is to explain the roles that such software can play in molecular ecology studies (whether as a principal component or a supporting function) so that researchers can decide whether, when and precisely how simulations can be incorporated into their work. First, I use seven case studies to demonstrate how simulations are employed, their specific advantage/necessity, and what alternative or complementary (non-simulation) approaches are available. I also explain how simulations can be integrated with existing spatial, environmental, historical and genetic datasets. I next describe simulation features that may be of interest to molecular ecologists, such as spatial and behavioral considerations and species' interactions, to provide guidance on how particular simulation capabilities serve particular needs. Lastly, I discuss the prospect of simulation software in emerging challenges (climate change, biodiversity monitoring, population exploitation) and opportunities (genomics, ancient DNA), in order to emphasize that the scope of simulation-based work is expanding. I also suggest practical considerations, priorities, and elements of best practice. This should accelerate the uptake of simulation approaches and firmly embed them as a versatile tool in the molecular ecologist's toolbox. This article is protected by copyright. All rights reserved.
    Molecular Ecology 04/2014; DOI:10.1111/mec.12741 · 5.84 Impact Factor
  • 14th Workshop on Algorithms in Bioinformatics, Wroclaw, Poland; 09/2014

Full-text (2 Sources)

Available from
Jun 3, 2014