Jean-Eudes Dazard

Case Western Reserve University, Cleveland, OH, USA

Are you Jean-Eudes Dazard?

Claim your profile

Publications (8)28.74 Total impact

  • Article: Joint Adaptive Mean-Variance Regularization and Variance Stabilization of High Dimensional Data.
    Jean-Eudes Dazard, J Sunil Rao
    [show abstract] [hide abstract]
    ABSTRACT: The paper addresses a common problem in the analysis of high-dimensional high-throughput "omics" data, which is parameter estimation across multiple variables in a set of data where the number of variables is much larger than the sample size. Among the problems posed by this type of data are that variable-specific estimators of variances are not reliable and variable-wise tests statistics have low power, both due to a lack of degrees of freedom. In addition, it has been observed in this type of data that the variance increases as a function of the mean. We introduce a non-parametric adaptive regularization procedure that is innovative in that : (i) it employs a novel "similarity statistic"-based clustering technique to generate local-pooled or regularized shrinkage estimators of population parameters, (ii) the regularization is done jointly on population moments, benefiting from C. Stein's result on inadmissibility, which implies that usual sample variance estimator is improved by a shrinkage estimator using information contained in the sample mean. From these joint regularized shrinkage estimators, we derived regularized t-like statistics and show in simulation studies that they offer more statistical power in hypothesis testing than their standard sample counterparts, or regular common value-shrinkage estimators, or when the information contained in the sample mean is simply ignored. Finally, we show that these estimators feature interesting properties of variance stabilization and normalization that can be used for preprocessing high-dimensional multivariate data. The method is available as an R package, called 'MVR' ('Mean-Variance Regularization'), downloadable from the CRAN website.
    Computational Statistics & Data Analysis 07/2012; 56(7):2317-2333. · 1.03 Impact Factor
  • Article: Human biomarker discovery and predictive models for disease progression for idiopathic pneumonia syndrome following allogeneic stem cell transplantation.
    [show abstract] [hide abstract]
    ABSTRACT: Allogeneic hematopoietic stem cell transplantation (SCT) is the only curative therapy for many malignant and nonmalignant conditions. Idiopathic pneumonia syndrome (IPS) is a frequently fatal complication that limits successful outcomes. Preclinical models suggest that IPS represents an immune mediated attack on the lung involving elements of both the adaptive and the innate immune system. However, the etiology of IPS in humans is less well understood. To explore the disease pathway and uncover potential biomarkers of disease, we performed two separate label-free, proteomics experiments defining the plasma protein profiles of allogeneic SCT patients with IPS. Samples obtained from SCT recipients without complications served as controls. The initial discovery study, intended to explore the disease pathway in humans, identified a set of 81 IPS-associated proteins. These data revealed similarities between the known IPS pathways in mice and the condition in humans, in particular in the acute phase response. In addition, pattern recognition pathways were judged to be significant as a function of development of IPS, and from this pathway we chose the lipopolysaccaharide-binding protein (LBP) protein as a candidate molecular diagnostic for IPS, and verified its increase as a function of disease using an ELISA assay. In a separately designed study, we identified protein-based classifiers that could predict, at day 0 of SCT, patients who: 1) progress to IPS and 2) respond to cytokine neutralization therapy. Using cross-validation strategies, we built highly predictive classifier models of both disease progression and therapeutic response. In sum, data generated in this report confirm previous clinical and experimental findings, provide new insights into the pathophysiology of IPS, identify potential molecular classifiers of the condition, and uncover a set of markers potentially of interest for patient stratification as a basis for individualized therapy.
    Molecular &amp Cellular Proteomics 02/2012; 11(6):M111.015479. · 7.40 Impact Factor
  • Article: Local sparse bump hunting reveals molecular heterogeneity of colon tumors.
    Jean-Eudes Dazard, J Sunil Rao, Sanford Markowitz
    [show abstract] [hide abstract]
    ABSTRACT: The question of molecular heterogeneity and of tumoral phenotype in cancer remains unresolved. To understand the underlying molecular basis of this phenomenon, we analyzed genome-wide expression data of colon cancer metastasis samples, as these tumors are the most advanced and hence would be anticipated to be the most likely heterogeneous group of tumors, potentially exhibiting the maximum amount of genetic heterogeneity. Casting a statistical net around such a complex problem proves difficult because of the high dimensionality and multicollinearity of the gene expression space, combined with the fact that genes act in concert with one another and that not all genes surveyed might be involved. We devise a strategy to identify distinct subgroups of samples and determine the genetic/molecular signature that defines them. This involves use of the local sparse bump hunting algorithm, which provides a much more optimal and biologically faithful transformed space within which to search for bumps. In addition, thanks to the variable selection feature of the algorithm, we derived a novel sparse gene expression signature, which appears to divide all colon cancer patients into two populations: a population whose expression pattern can be molecularly encompassed within the bump and an outlier population that cannot be. Although all patients within any given stage of the disease, including the metastatic group, appear clinically homogeneous, our procedure revealed two subgroups in each stage with distinct genetic/molecular profiles. We also discuss implications of such a finding in terms of early detection, diagnosis and prognosis.
    Statistics in Medicine 11/2011; 31(11-12):1203-20. · 1.88 Impact Factor
  • Article: A quantitative proteomic approach for detecting protein profiles of activated human myeloid dendritic cells.
    [show abstract] [hide abstract]
    ABSTRACT: Dendritic cells (DC) direct the magnitude, polarity and effector function of the adaptive immune response. DC express toll-like receptors (TLR), antigen capturing and processing machinery, and costimulatory molecules, which facilitate innate sensing and T cell activation. Once activated, DC can efficiently migrate to lymphoid tissue and prime T cell responses. Therefore, DC play an integral role as mediators of the immune response to multiple pathogens. Elucidating the molecular mechanisms involved in DC activation is therefore central in gaining an understanding of host response to infection. Unfortunately, technical constraints have limited system-wide 'omic' analysis of human DC subsets collected ex vivo. Here we have applied novel proteomic approaches to human myeloid dendritic cells (mDCs) purified from 100 mL of peripheral blood to characterize specific molecular networks of cell activation at the individual patient level, and have successfully quantified over 700 proteins from individual samples containing as little as 200,000 mDCs. The proteomic and network readouts after ex vivo stimulation of mDCs with TLR3 agonists are measured and verified using flow cytometry.
    Journal of immunological methods 09/2011; 375(1-2):39-45. · 2.35 Impact Factor
  • Article: Local Sparse Bump Hunting.
    Jean-Eudes Dazard, J Sunil Rao
    [show abstract] [hide abstract]
    ABSTRACT: The search for structures in real datasets e.g. in the form of bumps, components, classes or clusters is important as these often reveal underlying phenomena leading to scientific discoveries. One of these tasks, known as bump hunting, is to locate domains of a multidimensional input space where the target function assumes local maxima without pre-specifying their total number. A number of related methods already exist, yet are challenged in the context of high dimensional data. We introduce a novel supervised and multivariate bump hunting strategy for exploring modes or classes of a target function of many continuous variables. This addresses the issues of correlation, interpretability, and high-dimensionality (p ≫ n case), while making minimal assumptions. The method is based upon a divide and conquer strategy, combining a tree-based method, a dimension reduction technique, and the Patient Rule Induction Method (PRIM). Important to this task, we show how to estimate the PRIM meta-parameters. Using accuracy evaluation procedures such as cross-validation and ROC analysis, we show empirically how the method outperforms a naive PRIM as well as competitive non-parametric supervised and unsupervised methods in the problem of class discovery. The method has practical application especially in the case of noisy high-throughput data. It is applied to a class discovery problem in a colon cancer micro-array dataset aimed at identifying tumor subtypes in the metastatic stage. Supplemental Materials are available online.
    Journal of Computational and Graphical Statistics 12/2010; 19(4):900-929. · 1.06 Impact Factor
  • Article: Functional interactions between the LRP6 WNT co-receptor and folate supplementation.
    [show abstract] [hide abstract]
    ABSTRACT: Crooked tail (Cd) mice bear a gain-of-function mutation in Lrp6, a co-receptor for canonical WNT signaling, and are a model of neural tube defects (NTDs), preventable with dietary folic acid (FA) supplementation. Whether the FA response reflects a direct influence of FA on LRP6 function was tested with prenatal supplementation in LRP6-deficient embryos. The enriched FA (10 ppm) diet reduced the occurrence of birth defects among all litters compared with the control (2 ppm FA) diet, but did so by increasing early lethality of Lrp6(-/-) embryos while actually increasing NTDs among nulls alive at embryonic days 10-13 (E10-13). Proliferation in cranial neural folds was reduced in homozygous Lrp6(-/-) mutants versus wild-type embryos at E10, and FA supplementation increased proliferation in wild-type but not mutant neuroepithelia. Canonical WNT activity was reduced in LRP6-deficient midbrain-hindbrain at E9.5, demonstrated in vivo by a TCF/LEF-reporter transgene. FA levels in media modulated the canonical WNT response in NIH3T3 cells, suggesting that although FA was required for optimal WNT signaling, even modest FA elevations attenuated LRP5/6-dependent canonical WNT responses. Gene expression analysis in embryos and adults showed striking interactions between targeted Lrp6 deficiency and FA supplementation, especially for mitochondrial function, folate and methionine metabolism, WNT signaling and cytoskeletal regulation that together implicate relevant signaling and metabolic pathways supporting cell proliferation, morphology and differentiation. We propose that FA supplementation rescues Lrp6(Cd/Cd) fetuses by normalizing hyperactive WNT activity, whereas in LRP6-deficient embryos, added FA further attenuates reduced WNT activity, thereby compromising development.
    Human Molecular Genetics 12/2010; 19(23):4560-72. · 7.64 Impact Factor
  • Article: Urinary protein profiles in a rat model for diabetic complications.
    [show abstract] [hide abstract]
    ABSTRACT: Diabetes mellitus is estimated to affect approximately 24 million people in the United States and more than 150 million people worldwide. There are numerous end organ complications of diabetes, the onset of which can be delayed by early diagnosis and treatment. Although assays for diabetes are well founded, tests for its complications lack sufficient specificity and sensitivity to adequately guide these treatment options. In our study, we employed a streptozotocin-induced rat model of diabetes to determine changes in urinary protein profiles that occur during the initial response to the attendant hyperglycemia (e.g. the first two months) with the goal of developing a reliable and reproducible method of analyzing multiple urine samples as well as providing clues to early markers of disease progression. After filtration and buffer exchange, urinary proteins were digested with a specific protease, and the relative amounts of several thousand peptides were compared across rat urine samples representing various times after administration of drug or sham control. Extensive data analysis, including imputation of missing values and normalization of all data was followed by ANOVA analysis to discover peptides that were significantly changing as a function of time, treatment and interaction of the two variables. The data demonstrated significant differences in protein abundance in urine before observable pathophysiological changes occur in this animal model and as function of the measured variables. These included decreases in relative abundance of major urinary protein precursor and increases in pro-alpha collagen, the expression of which is known to be regulated by circulating levels of insulin and/or glucose. Peptides from these proteins represent potential biomarkers, which can be used to stage urogenital complications from diabetes. The expression changes of a pro-alpha 1 collagen peptide was also confirmed via selected reaction monitoring.
    Molecular &amp Cellular Proteomics 07/2009; 8(9):2145-58. · 7.40 Impact Factor
  • Source
    Article: Studying genetic determinants of natural variation in human gene expression using Bayesian ANOVA.
    [show abstract] [hide abstract]
    ABSTRACT: Standard genetic mapping techniques scan chromosomal segments for location of genetic linkage and association signals. The majority of these methods consider only correlations at single markers and/or phenotypes with explicit detailing of the genetic structure. These methods tend to be limited by their inability to consider the effect of large numbers of model variables jointly. In contrast, we propose a Bayesian analysis of variance (ANOVA) method to categorize individuals based on similarity of multidimensional profiles and attempt to analyze all variables simultaneously. Using Problem 1 of the Genetic Analysis Workshop 15 data set, we demonstrate the method's utility for joint analysis of gene expression levels and single-nucleotide polymorphism genotypes. We show that the method extracts similar information to that of previous genetic mapping analyses, and suggest extensions of the method for mining unique information not previously found.
    BMC proceedings 02/2007; 1 Suppl 1:S115.