"For instance, in pathway analysis (Khatri et al., 2012) and gene set enrichment analysis (Subramanian et al., 2005), multiple genes that work together to serve a particular biological function are often analyzed jointly as a gene set. Several pathway repositories, such as the Kyoto Encyclopedia of Genes and Genomes (KEGG) (Kanehisa et al., 2004), PANTHER classification system for protein sequence data (Nikolsky and Bryant, 2009), and Reactome pathways in humans (Matthews et al., 2009) have been established, and are continually being updated. For non-Mendelian diseases and complex traits, identification of isolated genetic variants is insufficient to summarize the complex association with disease. "
[Show abstract][Hide abstract] ABSTRACT: Rapid developments in molecular technology have yielded a large amount of high throughput genetic data to understand the mechanism for complex traits. The increase of genetic variants requires hundreds and thousands of statistical tests to be performed simultaneously in analysis, which poses a challenge to control the overall Type I error rate. Combining p-values from multiple hypothesis testing has shown promise for aggregating effects in high-dimensional genetic data analysis. Several p-value combining methods have been developed and applied to genetic data; see Dai et al. (2012b) for a comprehensive review. However, there is a lack of investigations conducted for dependent genetic data, especially for weighted p-value combining methods. Single nucleotide polymorphisms (SNPs) are often correlated due to linkage disequilibrium (LD). Other genetic data, including variants from next generation sequencing, gene expression levels measured by microarray, protein and DNA methylation data, etc. also contain complex correlation structures. Ignoring correlation structures among genetic variants may lead to severe inflation of Type I error rates for omnibus testing of p-values. In this work, we propose modifications to the Lancaster procedure by taking the correlation structure among p-values into account. The weight function in the Lancaster procedure allows meaningful biological information to be incorporated into the statistical analysis, which can increase the power of the statistical testing and/or remove the bias in the process. Extensive empirical assessments demonstrate that the modified Lancaster procedure largely reduces the Type I error rates due to correlation among p-values, and retains considerable power to detect signals among p-values. We applied our method to reassess published renal transplant data, and identified a novel association between B cell pathways and allograft tolerance.
Frontiers in Genetics 02/2014; 5:32. DOI:10.3389/fgene.2014.00032
[Show abstract][Hide abstract] ABSTRACT: Gene expression signatures of toxicity and clinical response benefit both safety assessment and clinical practice; however, difficulties in connecting signature genes with the predicted end points have limited their application. The Microarray Quality Control Consortium II (MAQCII) project generated 262 signatures for ten clinical and three toxicological end points from six gene expression data sets, an unprecedented collection of diverse signatures that has permitted a wide-ranging analysis on the nature of such predictive models. A comprehensive analysis of the genes of these signatures and their nonredundant unions using ontology enrichment, biological network building and interactome connectivity analyses demonstrated the link between gene signatures and the biological basis of their predictive power. Different signatures for a given end point were more similar at the level of biological properties and transcriptional control than at the gene level. Signatures tended to be enriched in function and pathway in an end point and model-specific manner, and showed a topological bias for incoming interactions. Importantly, the level of biological similarity between different signatures for a given end point correlated positively with the accuracy of the signature predictions. These findings will aid the understanding, and application of predictive genomic signatures, and support their broader application in predictive medicine.
The Pharmacogenomics Journal 08/2010; 10(4):310-23. DOI:10.1038/tpj.2010.35 · 4.23 Impact Factor
[Show abstract][Hide abstract] ABSTRACT: Hepatocellular carcinoma (HCC), the major type of liver cancer, is among the most lethal cancers owing to its aggressive nature and frequently late detection. Therefore, the possibility to identify early diagnostic markers could be of significant benefit. Urine has especially become one of the most attractive body fluids in biomarker discovery as it can be obtained non-invasively in large quantities and is stable as compared with other body fluids. To identify potential protein biomarker for early diagnosis of HCC, we explored protein expression profiles in urine from HCC patients and normal controls (n = 44) by shotgun proteomics using nano-liquid chromatography coupled tandem mass spectrometry (nanoLC–MS/MS) and stable isotope dimethyl labeling. We have systematically mapped 91 proteins with differential expressions (p < 0.05), which included 8 down-regulated microtubule proteins and 83 up-regulated proteins involved in signal and inflammation response. Further integrated proteogenomic approach composed of proteomic, genomic and transcriptomic analysis identified that S100A9 and GRN were co-amplified (p < 0.001) and co-expressed (p < 0.01) in HCC tumors and urine samples. In addition, the amplifications of S100A9 or GRN were found to be associated with poor survival in HCC patients, and their co-amplification was also prognosed worse overall survival than individual ones. Our results suggest that urinary S100A9 and GRN as potential combinatorial biomarkers can be applied to early diagnosis of hepatocellular carcinoma and highlight the utility of onco-proteogenomics for identifying protein markers that can be applied to disease-oriented translational medicine.
Biochimica et Biophysica Acta - Clinical 03/2015; 10. DOI:10.1016/j.bbacli.2015.02.004
Note: This list is based on the publications in our database and might not be exhaustive.
Data provided are for informational purposes only. Although carefully collected, accuracy cannot be guaranteed. The impact factor represents a rough estimation of the journal's impact factor and does not reflect the actual current impact factor. Publisher conditions are provided by RoMEO. Differing provisions from the publisher's actual policy or licence agreement may be applicable.