Chapter

Cross-Species Translation of Multi-way Biomarkers

06/2011; DOI:10.1007/978-3-642-21735-7_26 pp.209-216

ABSTRACT We present a Bayesian translational model for matching patterns in data sets which have neither co-occurring samples nor variables,
but only a similar experiment design dividing the samples into two or more categories. The model estimates covariate effects
related to this design and separates the factors that are shared across the data sets from those specific to one data set.
The model is designed to find similarities in medical studies, where there is great need for methods for linking laboratory
experiments with model organisms to studies of human diseases and new treatments.

KeywordsBayesian inference–cross-species modeling–multi-way modeling–translational modeling

0 0
 · 
0 Bookmarks
 · 
52 Views
  • Source
    Article: Cross-species common regulatory network inference without requirement for prior gene affiliation.
    [show abstract] [hide abstract]
    ABSTRACT: MOTIVATION: Cross-species meta-analyses of microarray data usually require prior affiliation of genes based on orthology information that often relies on sequence similarity. RESULTS: We present an algorithm merging microarray datasets on the basis of co-expression alone, without any requirement for orthology information to affiliate genes. Combining existing methods such as co-inertia analysis, back-transformation, Hungarian matching and majority voting in an iterative non-greedy hill-climbing approach, it affiliates arrays and genes at the same time, maximizing the co-structure between the datasets. To introduce the method, we demonstrate its performance on two closely and two distantly related datasets of different experimental context and produced on different platforms. Each pair stems from two different species. The resulting cross-species dynamic Bayesian gene networks improve on the networks inferred from each dataset alone by yielding more significant network motifs, as well as more of the interactions already recorded in KEGG and other databases. Also, it is shown that our algorithm converges on the optimal number of nodes for network inference. Being readily extendable to more than two datasets, it provides the opportunity to infer extensive gene regulatory networks. Availability and Implementation: Source code (MATLAB and R) freely available for download at http://www.mchips.org/supplements/moghaddasi_source.tgz.
    Bioinformatics 03/2010; 26(8):1082-90. · 5.47 Impact Factor
  • Article: Two-way analysis of high-dimensional collinear data
    [show abstract] [hide abstract]
    ABSTRACT: We present a Bayesian model for two-way ANOVA-type analysis of high-dimensional, small sample-size datasets with highly correlated groups of variables. Modern cellular measurement methods are a main application area; typically the task is differential analysis between diseased and healthy samples, complicated by additional covariates requiring a multi-way analysis. The main complication is the combination of high dimensionality and low sample size, which renders classical multivariate techniques useless. We introduce a hierarchical model which does dimensionality reduction by assuming that the input variables come in similarly-behaving groups, and performs an ANOVA-type decomposition for the set of reduced-dimensional latent variables. We apply the methods to study lipidomic profiles of a recent large-cohort human diabetes study.
    Data Mining and Knowledge Discovery 04/2012; 19(2):261-276. · 1.54 Impact Factor
  • Article: Cross species analysis of microarray expression data.
    [show abstract] [hide abstract]
    ABSTRACT: MOTIVATION: Many biological systems operate in a similar manner across a large number of species or conditions. Cross-species analysis of sequence and interaction data is often applied to determine the function of new genes. In contrast to these static measurements, microarrays measure the dynamic, condition-specific response of complex biological systems. The recent exponential growth in microarray expression datasets allows researchers to combine expression experiments from multiple species to identify genes that are not only conserved in sequence but also operated in a similar way in the different species studied. RESULTS: In this review we discuss the computational and technical challenges associated with these studies, the approaches that have been developed to address these challenges and the advantages of cross-species analysis of microarray data. We show how successful application of these methods lead to insights that cannot be obtained when analyzing data from a single species. We also highlight current open problems and discuss possible ways to address them.
    Bioinformatics 05/2009; 25(12):1476-83. · 5.47 Impact Factor

Full-text (2 Sources)

View
4 Downloads
Available from
8 Nov 2012

Keywords

Bayesian translational model
 
co-occurring samples
 
data sets
 
KeywordsBayesian inference–cross-species modeling–multi-way modeling–translational modeling
 
medical studies
 
model estimates covariate effects
 
model organisms
 
new treatments
 
samples
 
similar experiment design
 
similarities
 
specific
 
variables