Seyoung Park

Seyoung Park
  • Professor at Sungkyunkwan University

About

21
Publications
4,306
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
238
Citations
Current institution
Sungkyunkwan University
Current position
  • Professor

Publications

Publications (21)
Article
Full-text available
Due to the prevalence of complex data, data heterogeneity is often observed in contemporary scientific studies and various applications. Motivated by studies on cancer cell lines, we consider the analysis of heterogeneous subpopulations with binary responses and high-dimensional covariates. In many practical scenarios, it is common to use a single...
Preprint
Health outcomes, such as body mass index and cholesterol levels, are known to be dependent on age and exhibit varying effects with their associated risk factors. In this paper, we propose a novel framework for dynamic modeling of the associations between health outcomes and risk factors using varying-coefficients (VC) regional quantile regression v...
Article
Full-text available
Despite the urgent need for an effective prediction model tailored to individual interests, existing models have mainly been developed for the mean outcome, targeting average people. Additionally, the direction and magnitude of covariates’ effects on the mean outcome may not hold across different quantiles of the outcome distribution. To accommodat...
Article
Full-text available
Health outcomes, such as body mass index and cholesterol levels, are known to be dependent on age and exhibit varying effects with their associated risk factors. In this paper, we propose a novel framework for dynamic modeling of the associations between health outcomes and risk factors using varying-coefficients (VC) regional quantile regression v...
Article
In this article, we study high-dimensional multivariate logistic regression models in which a common set of covariates is used to predict multiple binary outcomes simultaneously. Our work is primarily motivated from many biomedical studies with correlated multiple responses such as the cancer cell-line encyclopedia project. We assume that the under...
Article
Full-text available
A challenge in bulk gene differential expression analysis is to differentiate changes due to cell type-specific gene expression and cell type proportions. SCADIE is an iterative algorithm that simultaneously estimates cell type-specific gene expression profiles and cell type proportions, and performs cell type-specific differential expression analy...
Article
In this study, we propose a multivariate-response regression by imposing structural conditions on the underlying regression coefficient matrix motivated by an analysis of Cancer Cell Line Encyclopedia (CCLE) data consisting of resistance responses to multiple drugs and gene expression of cancer cell lines. It is important to estimate the drug resis...
Article
Full-text available
In nonparametric models, numerous penalization methods using a nonparametric series estimator have been developed for model selection and estimation. However, a penalization has been poorly understood combined with kernel smoothing. This can be attributed to the intrinsic technical and computational difficulties, which leads to different treatments...
Article
Full-text available
Testing the behavior of varying coefficients (VC) over a range of quantiles is important in the field of regression analysis. This study tests whether coefficient functions in varying quantile regression share common structural information across a certain range of quantile levels, even when linear combinations of covariates are unspecified in the...
Article
Full-text available
High-dimensional Poisson reduced-rank models have been considered for statistical inference on low-dimensional locations of the individuals based on the observations of high-dimensional count vectors. In this study, we assume sparsity on a so-called loading matrix to enhance its interpretability. The sparsity assumption leads to the use of \(L_1\)...
Article
Full-text available
Advances in high-throughput genomic technologies coupled with large-scale studies including The Cancer Genome Atlas (TCGA) project have generated rich resources of diverse types of omics data to better understand cancer etiology and treatment responses. Clustering patients into subtypes with similar disease etiologies and/or treatment responses usi...
Article
Full-text available
Motivation: A number of computational methods have been proposed recently to profile tumor microenvironment from bulk RNA data, and they have proved useful for understanding microenvironment differences among therapeutic response groups. However, these methods are not able to account for tumor proportion nor variable mRNA levels across cell types....
Article
Full-text available
We consider the problem of constructing a perturbed portfolio by utilizing a benchmark portfolio. We propose two computationally efficient portfolio optimization models, the mean-absolute deviation risk and the Dantzig-type, which can be solved using linear programing. These portfolio models push the existing benchmark toward the efficient frontier...
Article
Full-text available
Motivation: Single-cell RNA-sequencing (scRNA-seq) technology can generate genome-wide expression data at the single-cell levels. One important objective in scRNA-seq analysis is to cluster cells where each cluster consists of cells belonging to the same cell type based on gene expression patterns. Results: We introduce a novel spectral clusteri...
Article
In this work, we present an additive model for space-time data that splits the data into a temporally correlated component and a spatially correlated component. We model the spatially correlated portion using a time-varying Gaussian graphical model. Under assumptions on the smoothness of changes in covariance matrices, we derive strong single sampl...
Article
Full-text available
We study joint quantile regression at multiple quantile levels with high-dimensional covariates. Variable selection performed at individual quantile levels may lack stability across neighboring quantiles, making it difficult to understand and to interpret the impact of a given covariate on conditional quantile functions. We propose a Dantzig-type p...
Article
We consider the problem of testing significance of predictors in quantile regression, where the sample size n and the number of predictors are allowed to increase together. Unlike the quantile regression analysis for the τth quantile at a given τ∈(0,1), we aim to detect any covariate that is significant for the conditional quantiles at any level of...
Article
Neural encoding studies explore the relationships between measurements of neural activity and measurements of a behavior that is viewed as a response to that activity. The coupling between neural and behavioral measurements is typically imperfect and difficult to measure.To enhance our ability to understand neural encoding relationships, we propose...

Network

Cited By