Jinyoung Byun

Dartmouth–Hitchcock Medical Center, LEB, New Hampshire, United States

Are you Jinyoung Byun?

Claim your profile

Publications (13)58.92 Total impact

  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Background: Comparative analysis of gene expression in human tissues is important for understanding the molecular mechanisms underlying tissue-specific control of gene expression. It can also open an avenue for using gene expression in blood (which is the most easily accessible human tissue) to predict gene expression in other (less accessible) tissues, which would facilitate the development of novel gene expression based models for assessing disease risk and progression. Until recently, direct comparative analysis across different tissues was not possible due to the scarcity of paired tissue samples from the same individuals. Methods: In this study we used paired whole blood/lung gene expression data from the Genotype-Tissue Expression (GTEx) project. We built a generalized linear regression model for each gene using gene expression in lung as the outcome and gene expression in blood, age and gender as predictors. Results: For ~18 % of the genes, gene expression in blood was a significant predictor of gene expression in lung. We found that the number of single nucleotide polymorphisms (SNPs) influencing expression of a given gene in either blood or lung, also known as the number of quantitative trait loci (eQTLs), was positively associated with efficacy of blood-based prediction of that gene's expression in lung. This association was strongest for shared eQTLs: those influencing gene expression in both blood and lung. Conclusions: In conclusion, for a considerable number of human genes, their expression levels in lung can be predicted using observable gene expression in blood. An abundance of shared eQTLs may explain the strong blood/lung correlations in the gene expression.
    Preview · Article · Nov 2015 · BMC Medical Genomics

  • No preview · Conference Paper · Nov 2015
  • [Show abstract] [Hide abstract]
    ABSTRACT: Results from genome-wide association studies (GWAS) have indicated that strong single-gene effects are the exception, not the rule, for most diseases. We assessed the joint effects of germline genetic variations through a pathway-based approach that considers the tissue-specific contexts of GWAS findings. From GWAS meta-analyses of lung cancer (12 160 cases/16 838 controls), breast cancer (15 748 cases/18 084 controls) and prostate cancer (14 160 cases/12 724 controls) in individuals of European ancestry, we determined the tissue-specific interaction networks of proteins expressed from genes that are likely to be affected by disease-associated variants. Reactome pathways exhibiting enrichment of proteins from each network were compared across the cancers. Our results show that pathways associated with all three cancers tend to be broad cellular processes required for growth and survival. Significant examples include the nerve growth factor (P = 7.86 × 10−33), epidermal growth factor (P = 1.18 × 10−31) and fibroblast growth factor (P = 2.47 × 10−31) signaling pathways. However, within these shared pathways, the genes that influence risk largely differ by cancer. Pathways found to be unique for a single cancer focus on more specific cellular functions, such as interleukin signaling in lung cancer (P = 1.69 × 10−15), apoptosis initiation by Bad in breast cancer (P = 3.14 × 10−9) and cellular responses to hypoxia in prostate cancer (P = 2.14 × 10−9). We present the largest comparative cross-cancer pathway analysis of GWAS to date. Our approach can also be applied to the study of inherited mechanisms underlying risk across multiple diseases in general.
    No preview · Article · Oct 2015 · Human Molecular Genetics
  • [Show abstract] [Hide abstract]
    ABSTRACT: Autoimmune muscle diseases (myositis) comprise a group of complex phenotypes influenced by genetic and environmental factors. To identify genetic risk factors in patients of European ancestry, we conducted a genome-wide association study (GWAS) of the major myositis phenotypes in a total of 1710 cases, which included 705 adult dermatomyositis, 473 juvenile dermatomyositis, 532 polymyositis and 202 adult dermatomyositis, juvenile dermatomyositis or polymyositis patients with anti-histidyl-tRNA synthetase (anti-Jo-1) autoantibodies, and compared them with 4724 controls. Single-nucleotide polymorphisms showing strong associations (P<5 × 10(-8)) in GWAS were identified in the major histocompatibility complex (MHC) region for all myositis phenotypes together, as well as for the four clinical and autoantibody phenotypes studied separately. Imputation and regression analyses found that alleles comprising the human leukocyte antigen (HLA) 8.1 ancestral haplotype (AH8.1) defined essentially all the genetic risk in the phenotypes studied. Although the HLA DRB1*03:01 allele showed slightly stronger associations with adult and juvenile dermatomyositis, and HLA B*08:01 with polymyositis and anti-Jo-1 autoantibody-positive myositis, multiple alleles of AH8.1 were required for the full risk effects. Our findings establish that alleles of the AH8.1 comprise the primary genetic risk factors associated with the major myositis phenotypes in geographically diverse Caucasian populations.Genes and Immunity advance online publication, 20 August 2015; doi:10.1038/gene.2015.28.
    No preview · Article · Aug 2015 · Genes and immunity

  • No preview · Article · Aug 2015 · Cancer Research
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: PARK2, a gene associated with Parkinson disease, is a tumor suppressor in human malignancies. Here, we show that c.823C>T (p.Arg275Trp), a germline mutation in PARK2, is present in a family with eight cases of lung cancer. The resulting amino acid change, p.Arg275Trp, is located in the highly conserved RING finger 1 domain of PARK2, which encodes an E3 ubiquitin ligase. Upon further analysis, the c.823C>T mutation was detected in three additional families affected by lung cancer. The effect size for PARK2 c.823C>T (odds ratio = 5.24) in white individuals was larger than those reported for variants from lung cancer genome-wide association studies. These data implicate this PARK2 germline mutation as a genetic susceptibility factor for lung cancer. Our results provide a rationale for further investigations of this specific mutation and gene for evaluation of the possibility of developing targeted therapies against lung cancer in individuals with PARK2 variants by compensating for the loss-of-function effect caused by the associated variation. Copyright © 2015 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
    Full-text · Article · Jan 2015 · The American Journal of Human Genetics
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Whole-genome profiling of gene expression is a powerful tool for identifying cancer-associated genes. Genes differentially expressed between normal and tumorous tissues are usually considered to be cancer associated. We recently demonstrated that the analysis of interindividual variation in gene expression can be useful for identifying cancer associated genes. The goal of this study was to identify the best microarray data-derived predictor of known cancer associated genes. We found that the traditional approach of identifying cancer genes--identifying differentially expressed genes--is not very efficient. The analysis of interindividual variation of gene expression in tumor samples identifies cancer-associated genes more effectively. The results were consistent across 4 major types of cancer: breast, colorectal, lung, and prostate. We used recently reported cancer-associated genes (2011-2012) for validation and found that novel cancer-associated genes can be best identified by elevated variance of the gene expression in tumor samples. The observation that the high interindividual variation of gene expression in tumor tissues is the best predictor of cancer-associated genes is likely a result of tumor heterogeneity on gene level. Computer simulation demonstrates that in the case of heterogeneity, an assessment of variance in tumors provides a better identification of cancer genes than does the comparison of the expression in normal and tumor tissues. Our results thus challenge the current paradigm that comparing the mean expression between normal and tumorous tissues is the best approach to identifying cancer-associated genes; we found that the high interindividual variation in expression is a better approach, and that using variation would improve our chances of identifying cancer-associated genes.
    Preview · Article · Mar 2014 · BMC Genomics
  • [Show abstract] [Hide abstract]
    ABSTRACT: The most common data structures in the biomedical studies have been matched or unmatched designs. Data structures resulting from a hybrid of the two may create challenges for statistical inferences. The question may arise whether to use parametric or nonparametric methods on the hybrid data structure. The Early Treatment for Retinopathy of Prematurity study was a multicenter clinical trial sponsored by the National Eye Institute. The design produced data requiring a statistical method of a hybrid nature. An infant in this multicenter randomized clinical trial had high-risk prethreshold retinopathy of prematurity that was eligible for treatment in one or both eyes at entry into the trial. During follow-up, recognition visual acuity was accessed for both eyes. Data from both eyes (matched) and from only one eye (unmatched) were eligible to be used in the trial. The new hybrid nonparametric method is a meta-analysis based on combining the Hodges-Lehmann estimates of treatment effects from the Wilcoxon signed rank and rank sum tests. To compare the new method, we used the classic meta-analysis with the t-test method to combine estimates of treatment effects from the paired and two sample t-tests. We used simulations to calculate the empirical size and power of the test statistics, as well as the bias, mean square and confidence interval width of the corresponding estimators. The proposed method provides an effective tool to evaluate data from clinical trials and similar comparative studies. Copyright © 2013 John Wiley & Sons, Ltd.
    No preview · Article · Dec 2013 · Statistics in Medicine

  • No preview · Article · Jun 2012 · Cancer Research
  • [Show abstract] [Hide abstract]
    ABSTRACT: Identification of genes that are differently expressed is a common approach used to analyze genetic mechanisms underlying cancer development. However, recent study results suggest that many such genes relate to a small number of biological functions. We hypothesized that analysis of these functions provides a better understanding of tumor biology than does actual identification of these genes. We re-analyzed publicly available gene expression data for paired samples of prostate tumor and adjacent normal tissue from the same patients to identify genes differently expressed in individual tumors and then used them to identify the functions. We found significant interindividual variation in the type and the number of functions. After adjusting for redundancy and nonspecificity of the functional terms, we identified seven functions. Several of them showed a strong association with clinical traits, e.g. age at diagnosis, preoperative prostate-specific antigen concentration, Gleason grade, and biochemical recurrence. Actin cytoskeleton was the function most frequently associated with clinical traits. Of note, the association between function and clinical traits was much stronger than that between the genes differently expressed and those traits. Different prostate tumors differ in their functional profiles. Functions of differently expressed genes are strongly associated with clinical traits. This suggests that analysis of functions of differently expressed genes may provide a better description of tumor biology than does analysis of the respective genes.
    No preview · Article · May 2012
  • [Show abstract] [Hide abstract]
    ABSTRACT: Identifying genes associated with cancer development is typically accomplished by comparing mean expression values in normal and tumor tissues, which identifies differentially expressed (DE) genes. Interindividual variation (IV) in gene expression is indirectly included in DE gene identification because given the same absolute differences in means, genes with lower variance tend to have lower p-values. We explored the direct use of IV in gene expression to identify candidate genes associated with cancer development. We focused on prostate (PCa) and lung (LC) cancers and compared IV in the expression level of genes shown to be cancer related with that in all other genes in the human genome. Compared with all those other genes, cancer-related genes tended to have greater IV in normal tissues and a greater increase in IV during the transition from normal to tumorous tissue. Genes without significantly different mean expression values between tumor and normal tissues but with greater IV in tumor than in normal tissue (note: the DE-based approach completely ignores those genes) had stronger associations with clinically important features like Gleason score in PCa or tumor histology in LC than all other genes were. Our results suggest that analyzing IV in gene expression level is useful in identifying novel candidate genes associated with cancer development.
    No preview · Article · Apr 2012 · Journal of Bioinformatics and Computational Biology
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: Housekeeping (HK) genes are involved in basic cellular functions and tend to be constitutively expressed across various tissues and conditions. A number of studies have analyzed the value of HK genes as an internal standard for assessing gene expression, but the role of HK genes in cancer development has never been specifically addressed. In this study, we sought to evaluate the expression of HK genes during prostate tumorigenesis. We performed a meta-analysis of gene expression during the transition from normal prostate (NP) to localized prostate cancer (LPC) (i.e., NP > LPC) and from localized to metastatic prostate cancer (MPC) (i.e., LPC > MPC). We found that HK genes are more likely to be differentially expressed during prostate tumorigenesis than is the average gene in the human genome, suggesting that prostate tumorigenesis is driven by modulation of the expression of HK genes. Cell-cycle genes and proliferation markers were up-regulated in both NP > LPC and LPC > MPC transitions. We also found that the genes encoding ribosomal proteins were up-regulated in the NP > LPC and down-regulated in the LPC > MPC transition. The expression of heat shock proteins was up-regulated during the LPC > MPC transition, suggesting that in its advanced stages, prostate tumor is under cellular stress. The results of these analyses suggest that during prostate tumorigenesis, there is a period when the tumor is under cellular stress and, therefore, may be the most vulnerable and responsive to treatment.
    Preview · Article · Dec 2009 · International Journal of Cancer
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: The genetic mechanisms of prostate tumorigenesis remain poorly understood, but with the advent of gene expression array capabilities, we can now produce a large amount of data that can be used to explore the molecular and genetic mechanisms of prostate tumorigenesis. We conducted a meta-analysis of gene expression data from 18 gene array datasets targeting transition from normal to localized prostate cancer and from localized to metastatic prostate cancer. We functionally annotated the top 500 differentially expressed genes and identified several candidate pathways associated with prostate tumorigeneses. We found the top differentially expressed genes to be clustered in pathways involving integrin-based cell adhesion: integrin signaling, the actin cytoskeleton, cell death, and cell motility pathways. We also found integrins themselves to be downregulated in the transition from normal prostate tissue to primary localized prostate cancer. Based on the results of this study, we developed a collagen hypothesis of prostate tumorigenesis. According to this hypothesis, the initiating event in prostate tumorigenesis is the age-related decrease in the expression of collagen genes and other genes encoding integrin ligands. This concomitant depletion of integrin ligands leads to the accumulation of ligandless integrin and activation of integrin-associated cell death. To escape integrin-associated death, cells suppress the expression of integrins, which in turn alters the actin cytoskeleton, elevates cell motility and proliferation, and disorganizes prostate histology, contributing to the histologic progression of prostate cancer and its increased metastasizing potential. The results of this study suggest that prostate tumor progression is associated with the suppression of integrin-based cell adhesion. Suppression of integrin expression driven by integrin-mediated cell death leads to increased cell proliferation and motility and increased tumor malignancy.
    Full-text · Article · Sep 2009 · BMC Medical Genomics

Publication Stats

57 Citations
58.92 Total Impact Points


  • 2015
    • Dartmouth–Hitchcock Medical Center
      LEB, New Hampshire, United States
  • 2013-2015
    • Dartmouth College
      • Department of Community and Family Medicine
      Hanover, New Hampshire, United States
  • 2009-2012
    • University of Texas MD Anderson Cancer Center
      • Genitourinary Medical Oncology
      Houston, Texas, United States