Conference Proceeding

Morphological signatures and genomic correlates in glioblastoma

Center for Comprehensive Inf., Emory Univ., Atlanta, GA, USA
Proceedings / IEEE International Symposium on Biomedical Imaging: from nano to macro. IEEE International Symposium on Biomedical Imaging 05/2011; DOI:10.1109/ISBI.2011.5872714 pp.1624 - 1627 In proceeding of: Biomedical Imaging: From Nano to Macro, 2011 IEEE International Symposium on
Source: IEEE Xplore

ABSTRACT Large multimodal datasets such as The Cancer Genome Atlas present an opportunity to perform correlative studies of tissue morphology and genomics to explore the morphological phenotypes associated with gene expression and genetic alterations. In this paper we present an investigation of Cancer Genome Atlas data that correlates morphology with recently discovered molecular subtypes of glioblastoma. Using image analysis to segment and extract features from millions of cells, we calculate high-dimensional morphological signatures to describe trends of nuclear morphology and cytoplasmic staining in whole-slide images. We illustrate the similarities between the analysis of these signatures and predictive studies of gene expression, both in terms of limited sample size and high-dimensionality. Our top-down analysis demonstrates the power of morphological signatures to predict clinically-relevant molecular tumor subtypes, with 85.4% recognition of the proneural subtype. A complementary bottom-up analysis shows that self-aggregating clusters have statistically significant associations with tumor subtype and reveals the existence of remarkable structure in the morphological signature space of glioblastomas.

0 0
 · 
0 Bookmarks
 · 
51 Views
  • Article: Quantification of histochemical staining by color deconvolution.
    [show abstract] [hide abstract]
    ABSTRACT: To develop a flexible method of separation and quantification of immunohistochemical staining by means of color image analysis. An algorithm was developed to deconvolve the color information acquired with red-green-blue (RGB) cameras and to calculate the contribution of each of the applied stains based on stain-specific RGB absorption. The algorithm was tested using different combinations of diaminobenzidine, hematoxylin and eosin at different staining levels. Quantification of the different stains was not significantly influenced by the combination of multiple stains in a single sample. The color deconvolution algorithm resulted in comparable quantification independent of the stain combinations as long as the histochemical procedures did not influence the amount of stain in the sample due to bleaching because of stain solubility and saturation of staining was prevented. This image analysis algorithm provides a robust and flexible method for objective immunohistochemical analysis of samples stained with up to three different stains using a laboratory microscope, standard RGB camera setup and the public domain program NIH Image.
    Analytical and quantitative cytology and histology / the International Academy of Cytology [and] American Society of Cytology 09/2001; 23(4):291-9. · 0.41 Impact Factor
  • Source
    Article: Diagnosis of multiple cancer types by shrunken centroids of gene expression.
    [show abstract] [hide abstract]
    ABSTRACT: We have devised an approach to cancer class prediction from gene expression profiling, based on an enhancement of the simple nearest prototype (centroid) classifier. We shrink the prototypes and hence obtain a classifier that is often more accurate than competing methods. Our method of "nearest shrunken centroids" identifies subsets of genes that best characterize each class. The technique is general and can be used in many other classification problems. To demonstrate its effectiveness, we show that the method was highly efficient in finding genes for classifying small round blue cell tumors and leukemias.
    Proceedings of the National Academy of Sciences 06/2002; 99(10):6567-72. · 9.68 Impact Factor
  • Source
    Article: Local-learning-based feature selection for high-dimensional data analysis.
    [show abstract] [hide abstract]
    ABSTRACT: This paper considers feature selection for data classification in the presence of a huge number of irrelevant features. We propose a new feature-selection algorithm that addresses several major issues with prior work, including problems with algorithm implementation, computational complexity, and solution accuracy. The key idea is to decompose an arbitrarily complex nonlinear problem into a set of locally linear ones through local learning, and then learn feature relevance globally within the large margin framework. The proposed algorithm is based on well-established machine learning and numerical analysis techniques, without making any assumptions about the underlying data distribution. It is capable of processing many thousands of features within minutes on a personal computer while maintaining a very high accuracy that is nearly insensitive to a growing number of irrelevant features. Theoretical analyses of the algorithm's sample complexity suggest that the algorithm has a logarithmical sample complexity with respect to the number of features. Experiments on 11 synthetic and real-world data sets demonstrate the viability of our formulation of the feature-selection problem for supervised learning and the effectiveness of our algorithm.
    IEEE Transactions on Software Engineering 09/2010; 32(9):1610-26. · 1.98 Impact Factor

Full-text (2 Sources)

View
3 Downloads
Available from
10 Jan 2013

Keywords

Cancer Genome Atlas data
 
Cancer Genome Atlas present
 
clinically-relevant molecular tumor subtypes
 
complementary bottom-up analysis
 
correlative studies
 
gene expression
 
genetic alterations
 
genomics
 
glioblastomas
 
image analysis
 
Large multimodal datasets
 
limited sample size
 
molecular subtypes
 
nuclear morphology
 
proneural subtype
 
remarkable structure
 
tumor subtype
 
whole-slide images