
Cynthia L Smith- PhD
- Researcher at The Jackson Laboratory
Cynthia L Smith
- PhD
- Researcher at The Jackson Laboratory
About
114
Publications
16,031
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
11,333
Citations
Introduction
Current institution
Additional affiliations
October 2000 - present
Publications
Publications (114)
The biomedical research community addresses reproducibility challenges in animal studies through standardized nomenclature, improved experimental design, transparent reporting, data sharing, and centralized repositories. The ARRIVE guidelines outline documentation standards for laboratory animals in experiments, but genetic information is often inc...
The Alliance of Genome Resources (Alliance) is an extensible coalition of knowledgebases focused on the genetics and genomics of intensively-studied model organisms. The Alliance is organized as individual knowledge centers with strong connections to their research communities and a centralized software infrastructure, discussed here. Model organis...
Mouse Genome Informatics (MGI) is a federation of expertly curated information resources designed to support experimental and computational investigations into genetic and genomic aspects of human biology and disease using the laboratory mouse as a model system. The Mouse Genome Database (MGD) and the Gene Expression Database (GXD) are core MGI dat...
The Human Phenotype Ontology (HPO) is a widely used resource that comprehensively organizes and defines the phenotypic features of human disease, enabling computational inference and supporting genomic and phenotypic analyses through semantic similarity and machine learning algorithms. The HPO has widespread applications in clinical diagnostics and...
Gene inactivation can affect the process(es) in which that gene acts and causally downstream ones, yielding diverse mutant phenotypes. Identifying the genetic pathways resulting in a given phenotype helps us understand how individual genes interact in a functional network. Computable representations of biological pathways include detailed process d...
Gene inactivation can affect the process(es) in which that gene acts and causally downstream ones, yielding diverse mutant phenotypes. Identifying the genetic pathways resulting in a given phenotype helps us understand how individual genes interact in a functional network. Computable representations of biological pathways include detailed process d...
The Gene Ontology (GO) knowledgebase (http://geneontology.org) is a comprehensive resource concerning the functions of genes and gene products (proteins and noncoding RNAs). GO annotations cover genes from organisms across the tree of life as well as viruses, though most gene function knowledge currently derives from experiments carried out in a re...
Standardized gene nomenclature supports unambiguous communication and identification of the scientific literature associated with genes. To support the increasing number of annotated genomes that are now available for comparative studies, gene nomenclature authorities coordinate the assignment of approved gene names that can be readily propagated a...
In addition to naturally occurring sequence variation and spontaneous mutations, a wide array of technologies exist for modifying the mouse genome. Standardized nomenclature, including allele, transgene, and other mutation nomenclature, as well as persistent unique identifiers (PUID) are critical for effective scientific communication, comparison o...
Recombinase alleles and transgenes can be used to facilitate spatio-temporal specificity of gene disruption or transgene expression. However, the versatility of this in vivo recombination system relies on having detailed and accurate characterization of recombinase expression and activity to enable selection of the appropriate allele or transgene....
The Mouse Genome Informatics (MGI) database system combines multiple expertly curated community data resources into a shared knowledge management ecosystem united by common metadata annotation standards. MGI’s mission is to facilitate the use of the mouse as an experimental model for understanding the genetic and genomic basis of human health and d...
The Alliance of Genome Resources (the Alliance) is a combined effort of 7 knowledgebase projects: Saccharomyces Genome Database, WormBase, FlyBase, Mouse Genome Database, the Zebrafish Information Network, Rat Genome Database, and the Gene Ontology Resource. The Alliance seeks to provide several benefits: better service to the various communities s...
For over a century, researchers have used mice as models and adapted many new methods to create novel mutations in them. In the past 100+ years, we have gone from breeding strains for selected traits to inducing random mutations throughout the genome to creating designer alleles with multiple functions. Each method offers opportunities and challeng...
Standardized gene nomenclature supports unambiguous communication and identification of the scientific literature associated with genes. To support the increasing number of annotated genomes that are now available for comparative studies, gene nomenclature authorities coordinate the assignment of approved gene names that can be readily propagated a...
Motivation
Biomedical research findings are typically disseminated through publications. To simplify access to domain-specific knowledge while supporting the research community, several biomedical databases devote significant effort to manual curation of the literature—a labor intensive process. The first step toward biocuration requires identifyin...
Motivation: Biomedical research findings are typically disseminated through publications. To simplify access to domain-specific knowledge while supporting the research community, several biomedical databases devote significant effort to manual curation of the literature—a labor intensive process. The first step toward biocuration requires identifyi...
The Mouse Genome Database (MGD; http://www.informatics.jax.org) is the community model organism knowledgebase for the laboratory mouse, a widely used animal model for comparative studies of the genetic and genomic basis for human health and disease. MGD is the authoritative source for biological reference data related to mouse genes, gene functions...
Host genetics, housing conditions, and antibiotic treatments impact microbiota diversity and composition, but microbes that inhabit the mouse gut and skin can also influence severity and penetrance of host mutant phenotypes.
There’s growing evidence that sex-based differences can influence phenotypes beyond those directly related to the reproductive system; to fully understand a gene’s function, researchers should consider both male and female subjects.
Differential gene expression based on the parental origin can have a big impact on phenotype
Recombinase-expressing mice are selected based on where and when they will activate conditional alleles, but some produce phenotypes in isolation that can complicate analysis of those alleles.
Strain contributions arising from the strain of origin and backcrosses can modify phenotypic severity and/or penetrance of mutant alleles in mouse models.
Creating a null mutation of a gene is a powerful way to examine gene function, but knocking out part of a gene does not always result in a null allele.
The Alliance of Genome Resources (Alliance) is a consortium of the major model organism databases and the Gene Ontology that is guided by the vision of facilitating exploration of related genes in human and well-studied model organisms by providing a highly integrated and comprehensive platform that enables researchers to leverage the extensive bod...
Many studies have reported on the identification of ‘essential genes’ in mice, but the context of the experimental model, including genetic background and specific molecular details of the allele, may impact the influence of alleles on viability.
The Human Phenotype Ontology (HPO)—a standardized vocabulary of phenotypic abnormalities associated with 7000+ diseases—is used by thousands of researchers, clinicians, informaticians and electronic health record systems around the world. Its detailed descriptions of clinical abnormalities and computable disease definitions have made HPO the de fac...
The Mouse Genome Database (MGD; http://www.informatics.jax.org) is the community model organism genetic and genome resource for the laboratory mouse. MGD is the authoritative source for biological reference data sets related to mouse genes, gene functions, phenotypes, and mouse models of human disease. MGD is the primary outlet for official gene, a...
The mouse Gene Expression Database (GXD) is an extensive, well-curated community resource freely available at www.informatics.jax.org/expression.shtml. Covering all developmental stages, GXD includes data from RNA in situ hybridization, immunohistochemistry, RT-PCR, northern blot and western blot experiments in wild-type and mutant mice. GXD's gene...
Model organism databases (MODs) have been collecting and integrating biomedical research data for 30 years and were designed to meet specific needs of each model organism research community. The contributions of model organism research to understanding biological systems would be hard to overstate. Modern molecular biology methods and cost reductio...
Many bioinformatics resources with unique perspectives on the protein landscape are currently available. However, generating new knowledge from these resources requires interoperable workfows that support cross-resource queries. In this study, we employ federated queries linking information from the Protein Kinase Ontology, iPTMnet, Protein Ontolog...
Model organisms are vital to uncovering the mechanisms of human disease and developing new therapeutic tools. Researchers collecting and integrating relevant model organism and/or human data often apply disparate terminologies (vocabularies and ontologies), making comparisons and inferences difficult. A unified disease ontology is required that con...
The Mouse Genome Informatics (MGI), resource (www. informatics. jax. org) has existed for over 25 years, and over this time its data content, informatics infrastructure, and user interfaces and tools have undergone dramatic changes (Eppig et al., Mamm Genome 26:272–284, 2015). Change has been driven by scientific methodological advances, rapid impr...
Deep phenotyping has been defined as the precise and comprehensive analysis of phenotypic abnormalities in which the individual components of the phenotype are observed and described. The three components of the Human Phenotype Ontology (HPO; www.human-phenotype-ontology.org) project are the phenotype vocabulary, disease-phenotype annotations and t...
The Mouse Genome Database (MGD; http://www.informatics.jax.org) is the key community mouse database which supports basic, translational and computational research by providing integrated data on the genetics, genomics, and biology of the laboratory mouse. MGD serves as the source for biological reference data sets related to mouse genes, gene funct...
The widespread use of CRISPR/Cas and other targeted endonuclease technologies in many species has led to an explosion in the generation of new mutations and alleles. The ability to generate many different mutations from the same target sequence either by homology-directed repair with a donor sequence or non-homologous end joining-induced insertions...
The capture of developmental phenotype information in a way that permits computational analysis requires a semantically formal system that is sufficiently powerful and flexible to deal with all of the traditional and increasingly novel observational modalities being applied to mouse embryos.
The Mammalian Phenotype (MP) Ontology provides a structu...
Deep phenotyping has been defined as the precise and comprehensive analysis of phenotypic abnormalities in which the individual components of the phenotype are observed and described. The three components of the Human Phenotype Ontology (HPO; www.human-phenotype-ontology.org) project are the phenotype vocabulary, disease-phenotype annotations and t...
The Mouse Genome Database (MGD: http://www.informatics.jax.org) is the primary community data resource for the laboratory mouse. It provides a highly integrated and highly curated system offering a comprehensive view of current knowledge about mouse genes, genetic markers and genomic features as well as the associations of those features with seque...
A vast array of data is about to emerge from the large scale high-throughput mouse knockout phenotyping projects worldwide. It is critical that this information is captured in a standardized manner, made accessible, and is fully integrated with other phenotype data sets for comprehensive querying and analysis across all phenotype data types. The vo...
The availability of and access to quality genetically defined, health-status known mouse resources is critical for biomedical research. By ensuring that mice used in research experiments are biologically, genetically, and health-status equivalent, we enable knowledge transfer, hypothesis building based on multiple data streams, and experimental rep...
A core part of the Mouse Genome Informatics (MGI) resource is the collection of mouse mutations and the annotation phenotypes and diseases displayed by mice carrying these mutations. These data are integrated with the rest of data in MGI and exported to numerous other resources. The use of mouse phenotype data to drive translational research into h...
The Mouse Genome Database (MGD, www.informatics.jax.org) is the international scientific database for genetic, genomic, and biological data on the laboratory mouse to support the research requirements of the biomedical community. To accomplish this goal, MGD provides broad data coverage, serves as the authoritative standard for mouse nomenclature f...
Congenital heart disease (CHD) is the most prevalent birth defect,
affecting nearly 1% of live births1; the incidence ofCHDis up to tenfold
higher inhumanfetuses2,3.Agenetic contributionis stronglysuggested
by the association of CHD with chromosome abnormalities
and high recurrence risk4. Here we report findings from a recessive
forward genetic scr...
1 Congenital heart disease (CHD) is the most prevalent birth defect, affecting nearly 1% of live births 1 ; the incidence of CHD is up to tenfold higher in human fetuses 2,3. A genetic contribution is strongly suggested by the association of CHD with chromosome abnormalities and high recurrence risk 4. Here we report findings from a recessive forwa...
The Human Phenotype Ontology (HPO) project, available at http://www.human-phenotype-ontology.org, provides a structured, comprehensive and well-defined set of 10,088 classes (terms) describing human phenotypic abnormalities and 13,326 subclass relations between the HPO classes. In addition we have developed logical definitions for 46% of all HPO cl...
Background:
The use of ontologies to standardize biological data and facilitate comparisons among datasets has steadily grown as the complexity and amount of available data have increased. Despite the numerous ontologies available, one area currently lacking a robust ontology is the description of vertebrate traits. A trait is defined as any measu...
A forum of the Human Variome Project (HVP) was held as a satellite to the 2012 Annual Meeting of the American Society of Human Genetics in San Francisco, California. The theme of this meeting was "Getting Ready for the Human Phenome Project". Understanding the genetic contribution to both rare single gene "Mendelian" disorders and to more complex c...
The laboratory mouse is the premier animal model for studying human biology because all life stages can be accessed experimentally,
a completely sequenced reference genome is publicly available and there exists a myriad of genomic tools for comparative and
experimental research. In the current era of genome scale, data-driven biomedical research, t...
Numerous disease syndromes are associated with regions of copy number variation (CNV) in the human genome, and in most cases the pathogenicity of the CNV is thought to be related to altered dosage of the genes contained within the affected segment. However, establishing the contribution of individual genes to the overall pathogenicity of CNV syndro...
The Mammalian Phenotype Ontology (MP) is a structured vocabulary for describing mammalian phenotypes and serves as a critical tool for efficient annotation and comprehensive retrieval of phenotype data. Importantly, the ontology contains broad and specific terms, facilitating annotation of data from initial observations or screens and detailed data...
Mouse phenotype data represents a valuable resource for the identification of disease-associated genes, especially where the molecular basis is unknown and there is no clue to the candidate gene's function, pathway involvement or expression pattern. However, until recently these data have not been systematically used due to difficulties in mapping...
Optimal curation of human diseases requires an ontology or structured vocabulary that contains terms familiar to end users, is robust enough to support multiple levels of annotation granularity, is limited to disease terms and is stable enough to avoid extensive reannotation following updates. At Mouse Genome Informatics (MGI), we currently use dis...
The Mouse Genome Database (MGD, http://www.informatics.jax.org) is the international community resource for integrated genetic, genomic and biological data about the laboratory mouse. Data in MGD are obtained through loads from major data providers and experimental consortia, electronic submissions from laboratories and from the biomedical literatu...
Ontologies are widely used to represent knowledge in biomedicine. Systematic approaches for detecting errors and disagreements are needed for large ontologies with hundreds or thousands of terms and semantic relationships. A recent approach of defining terms using logical definitions is now increasingly being adopted as a method for quality control...
The Mouse Genome Database (MGD) is the community model organism database for the laboratory mouse and the authoritative source
for phenotype and functional annotations of mouse genes. MGD includes a complete catalog of mouse genes and genome features
with integrated access to genetic, genomic and phenotypic information, all serving to further the u...
Phenotype ontologies are typically constructed to serve the needs of a particular community, such as annotation of genotype-phenotype associations in mouse or human. Here we demonstrate how these ontologies can be improved through assignment of logical definitions using a core ontology of phenotypic qualities and multiple additional ontologies from...
The mouse has long been an important model for the study of human genetic disease. Through the application of genetic engineering and mutagenesis techniques, the number of unique mutant mouse models and the amount of phenotypic data describing them are growing exponentially. Describing phenotypes of mutant mice in a computationally useful manner th...
The mouse has long been an important model for the study of human genetic disease. Through the application of genetic engineering and mutagenesis techniques, the number of unique mutant mouse models and the amount of phenotypic data describing them are growing exponentially. Describing phenotypes of mutant mice in a computationally useful manner th...
The Mouse Genome Informatics (MGI; "http://www.informatics.jax.org":http://www.informatics.jax.org) group is comprised of several collaborating projects including the Mouse Genome Database (MGD) Project, the Gene Expression Database (GXD) Project, the Mouse Tumor Biology (MTB) Database Project, and the Gene Ontology (GO) Project. Literature identif...
The Mouse Genome Database (MGD, http://www.informatics.jax.org/), integrates genetic, genomic and phenotypic information about the laboratory mouse, a primary animal model for studying
human biology and disease. Information in MGD is obtained from diverse sources, including the scientific literature and external
databases, such as EntrezGene, UniPr...
The mRNAs encoding postsynaptic components at the neuromuscular junction are concentrated in the synaptic region of muscle
fibers. Accumulation of these RNAs in the synaptic region is mediated, at least in part, by selective transcription of the
corresponding genes in synaptic myofiber nuclei. The transcriptional mechanisms that are responsible for...
Understanding the functions encoded in the mouse genome will be central to an understanding of the genetic basis of human disease. To achieve this it will be essential to be able to characterize the phenotypic consequences of variation and alterations in individual genes. Data on the phenotypes of mouse strains are currently held in a number of dif...
Understanding the functions encoded in the mouse genome will be central to an understanding of the genetic basis of human disease. To achieve this it will be essential to be able to characterise the phenotypic consequences of variation and alterations in individual genes. Data on the phenotypes of mouse strains are currently held in a number of dif...
The mouse is the premier model organism in human disease research because all of its life stages are accessible and there are myriad experimental tools for comparative analysis and specific manipulation of its genome. The Mouse Genome Informatics Database (MGI, http://www.informatics.jax.org) supports biological knowledge building for the laborator...
The Mammalian Phenotype (MP) Ontology enables robust annotation of mammalian phenotypes in the context of mutations, quantitative trait loci and strains that are used as models of human biology and disease. The MP Ontology supports different levels and richness of phenotypic knowledge and flexible annotations to individual genotypes. It continues t...
The Mouse Genome Database (MGD) forms the core of the Mouse Genome Informatics (MGI) system (http://www.informatics.jax.org), a model organism database resource for the laboratory mouse. MGD provides essential integration of experimental knowledge
for the mouse system with information annotated from both literature and online sources. MGD curates a...
The Mouse Genome Database (MGD) is one component of the Mouse Genome Informatics (MGI) system (http://www.informatics.jax.org), a community database resource for the laboratory mouse. MGD strives to provide a comprehensive knowledgebase about the
mouse with experiments and data annotated from both literature and online sources. MGD curates and pres...
The Mouse Genome Database (MGD) (http://www.informatics.jax.org) one component of a community database resource for the laboratory mouse, a key model organism for interpreting the human
genome and for understanding human biology. MGD strives to provide an extensively integrated information resource with experimental
details annotated from both lite...
Mice deficient in src and fyn or src and yes move and breathe poorly and die perinatally, consistent with defects in neuromuscular function. Src and Fyn are associated with acetylcholine receptors (AChRs) in muscle cells, and Src and Yes can act downstream of ErbB2, suggesting roles for Src family kinases in signaling pathways regulating neuromuscu...
Mice deficient in src and fyn or src and yes move and breathe poorly and die perinatally, consistent with defects in neuromuscular function. Src and Fyn are associated with acetylcholine receptors (AChRs) in muscle cells, and Src and Yes can act downstream of ErbB2, suggesting roles for Src family kinases in signaling pathways regulating neuromuscu...
The spätzle gene is required for proper specification of positional information along the dorsal-ventral axis of the Drosophila embryo and for induction of the innate immune response to fungal infection. It has been shown to encode a precursor of a Nerve Growth Factor-like ligand which is also a member of the cysknot protein superfamily. In dorsal-...