About
66
Publications
9,593
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
2,886
Citations
Introduction
Skills and Expertise
Current institution
Additional affiliations
August 2014 - present
August 2006 - July 2012
Publications
Publications (66)
Quantile autoregresive model is a useful extension to classical autoregresive models as it can capture the influences of conditioning variables on the location, scale and shape of the response distribution. However, at the extreme tails, standard quantile autoregression estimator is often unstable due to data sparsity. In this paper, assuming quant...
The paper develops a new marginal testing procedure to detect significant predictors that are associated with the conditional quantiles of a scalar response. The idea is to fit the marginal quantile regression on each predictor one at a time, and then to base the test on the t-statistics that are associated with the most predictive predictors. A re...
We propose a semiparametric estimator for single-index models with censored responses due to detection limits. In the presence of left censoring, the mean function cannot be identified without any parametric distributional assumptions, but the quantile function is still identifiable at upper quantile levels. To avoid parametric distributional assum...
Researchers sometimes have a priori information on relative importance of predictors that can be used to screen out covariates. An important question is whether any of the discarded covariates have predictive power when the most relevant predictors are included in the model. We consider testing whether any of the discarded covariates is significant...
The El Niño-Southern Oscillation (ENSO), Indian Ocean Dipole (IOD), North Atlantic Oscillation (NAO), Atlantic Multidecadal Oscillation (AMO) and Pacific decadal oscillation (PDO) are well understood to be major drivers for the variability of precipitation extremes over monsoon regions in China (MRC). However, research on monsoon extremes in China...
Array-based CGH experiments are designed to detect genomic aberrations or regions of DNA copy-number variation that are associated with an outcome, typically a state of disease. Most of the existing statistical methods target on detecting DNA copy number variations in a single sample or array. We focus on the detection of group effect variation, th...
Quantile regression offers a convenient tool to access the relationship between a response and covariates in a comprehensive way and it is appealing especially in applications where interests are on the tails of the response distribution. However, due to data sparsity, the finite sample estimation at tail quantiles often suffers from high variabili...
Quantile functions are important in characterizing the entire probability distribution of a random variable, especially when the tail of a skewed distribution is of interest. This article introduces new quantile function estimators for spatial and temporal data with a fused adaptive Lasso penalty to accommodate the dependence in space and time. Thi...
The bent line quantile regression describes the situation where the conditional quantile function of the response is piecewise linear but still continuous in covariates. In some applications, the change points at which the quantile functions are bent tend to be the same across quantile levels or for quantile levels lying in a certain region. To cap...
The paper discusses the asymptotic validity of posterior inference of pseudo-Bayesian quantile regression methods with complete or censored data when an asymmetric Laplace likelihood is used. The asymmetric Laplace likelihood has a special place in the Bayesian quantile regression framework because the usual quantile regression estimator can be der...
In survival analysis, the accelerated failure time model is a useful alternative to the popular Cox proportional hazards model due to its easy interpretation. Current estimation methods for the accelerated failure time model mostly assume independent and identically distributed random errors, but in many applications the conditional variance of log...
In quantile regression, it is of interest to determine whether a covariate has varying or constant effect across quantiles, since in situations where the quantile coefficients share some common features we can improve the estimation efficiency through joint modeling of multiple quantiles. To automatically perform estimation and detection of the int...
Joint modeling techniques have become a popular strategy for studying the association between a response and one or more longitudinal covariates. Motivated by the GenIMS study, where it is of interest to model the event of survival using censored longitudinal biomarkers, a joint model is proposed for describing the relationship between a binary out...
We develop a method for sequential detection of structural changes in linear quantile regression models. We establish the asymptotic properties of the proposed test statistic, and demonstrate the advantages of the proposed method over existing tests through simulation.
Based on a semiparametric Bayesian framework, a joint-quantile regression method is developed for analyzing clustered data, where random effects are included to accommodate the intra-cluster dependence. Instead of posing any parametric distributional assumptions on the random errors, the proposed method approximates the central density by linearly...
Even though most work on change-point estimation focuses on changes in the mean, changes in the variance or in the tail distribution can lead to more extreme events. In this paper, we develop a new method of detecting and estimating the change-points in the tail of multiple time series data. In addition, we adapt existing tail change-point detectio...
We develop a new procedure for testing change points due to a covariate threshold in regression quantiles. The proposed test is based on the CUSUM of the subgradient of the quantile objective function and requires fitting the model only under the null hypothesis. The critical values can be obtained by simulating the Gaussian process that characteri...
Examination of multiple conditional quantile functions provides a comprehensive view of the relationship between the response and covariates. In situations where quantile slope coefficients share some common features, estimation efficiency and model interpretability can be improved by utilizing such commonality across quantiles. Furthermore, elimin...
Length-biased time-to-event data commonly arise in epidemiological cohort studies and cross-sectional surveys. Ignoring length-biased sampling often leads to severe bias in estimating the survival time in the general population. We propose a flexible quantile regression framework for analysing the covariate effects on the population survival time u...
Models for survival data generally assume that covariates are fully observed. However, in medical studies it is not uncommon for biomarkers to be censored at known detection limits. A computationally-efficient multiple imputation procedure for modeling survival data with covariates subject to detection limits is proposed. This procedure is develope...
In this paper, we propose a two-stage variable selection procedure for high dimensional quantile varying coefficient models. The proposed method is based on basis function approximation and LASSO-type penalties. We show that the first stage penalized estimator with LASSO penalty reduces the model from ultra-high dimensional to a model that has size...
Conventional analysis using quantile regression typically focuses on fitting the regression model at different quantiles separately. However, in situations where the quantile coefficients share some common feature, joint modeling of multiple quantiles to accommodate the commonality often leads to more efficient estimation. One example of common fea...
The estimation of extreme conditional quantiles is an important issue in numerous disciplines. Quantile regression (QR) provides a natural way to capture the covariate effects at different tails of the response distribution. However, without any distributional assumptions, estimation from conventional QR is often unstable at the tails, especially f...
Quantile regression has emerged as a powerful tool in survival analysis as it directly links the quantiles of patients' survival times to their demographic and genomic profiles, facilitating the identification of important prognostic factors. In view of limited work on variable selection in the context, we develop a new adaptive-lasso-based variabl...
Censored observations are a common occurrence in biomedical data sets. Although a large amount of research has been devoted to estimation and inference for data with censored responses, very little research has focused on proper statistical procedures when predictors are censored. In this paper, we consider statistical methods for dealing with mult...
In this paper, we develop a new variable selection procedure for quantile varying coefficient models with longitudinal data. The proposed method is based on basis function approximation and a class of group versions of the adaptive LASSO penalty, which penalizes the LγLγ norm of the within-group coefficients with γ≥1γ≥1. We show that with properly...
Muscular strength, usually quantified through the grip strength, can be used in humans and animals as an indicator of neuromuscular function or to assess hand function in patients with trauma or congenital problems. Because grip strength cannot be accurately measured, several contaminated measurements are often taken on the same subject. A research...
Estimation of conditional quantiles at very high or low tails is of interest in numerous applications. Quantile regression provides a convenient and natural way of quantifying the impact of covariates at different quantiles of a response distribution. However, high tails are often associated with data sparsity, so quantile regression estimation can...
Quantile regression in the presence of fixed censoring has been studied extensively in the literature. However, existing methods either suffer from computational instability or require complex procedures involving trimming and smoothing, which complicates the asymptotic theory of the resulting estimators. In this paper, we propose a simple estimato...
In varying coefficient models, three types of variable selection problems are of practical interests: separation of varying and constant effects, selection of variables with nonzero varying effects, and selection of variables with nonzero con-stant effects. Existing variable selection methods in the literature often focus on only one of the three t...
Quantile regression has emerged as one of the standard tools for regression analysis that enables a proper assessment of the complete conditional distribution of responses even in the presence of heteroscedastic errors. Quantile regression estimates ...
Statistical inference in censored quantile regression is challenging, partly due to the unsmoothness of the quantile score function. A new procedure is developed to estimate the variance of Bang and Tsiatis's inverse-censoring-probability weighted estimator for censored quantile regression by employing the idea of induced smoothing. The proposed va...
This paper considers generalized linear quantile regression for competing risks data when the failure type may be missing. Two estimation procedures for the regression co-efficients, including an inverse probability weighted complete-case estimator and an augmented inverse probability weighted estimator, are discussed under the assumption that the...
We develop a new multiple imputation approach for M-regression models with censored covariates. Instead of specifying parametric likelihoods, our method imputes the censored covariates by their conditional quantiles given the observed data, where the conditional quantiles are estimated through fitting a censored quantile regression process. The res...
Most existing methods for identifying aberrant regions with array CGH data are confined to a single target sample. Focusing on the comparison of multiple samples from two different groups, we develop a new penalized regression approach with a fused adaptive lasso penalty to accommodate the spatial dependence of the clones. The nonrandom aberrant ge...
We develop two empirical likelihood-based inference procedures for longitudinal data under the framework of quantile regression. The proposed methods avoid estimating the unknown error density function and the intra-subject correlation involved in the asymptotic covariance matrix of the quantile estimators. By appropriately smoothing the quantile s...
ABSTRACT Anthraquinone (AQ)-based repellents have been shown to reduce Canada goose (Branta canadensis) use of turfgrass; however, impacts of frequent mowing on efficacy of AQ have not been studied. Our objective was to determine efficacy and longevity of a rain-fast AQ-based avian repellent, FlightControl® PLUS (FCP), as a deterrent of free-rangin...
Since quantile regression curves are estimated individually, the quantile curves can cross, leading to an invalid distribution
for the response. A simple constrained version of quantile regression is proposed to avoid the crossing problem for both linear
and nonparametric quantile curves. A simulation study and a reanalysis of tropical cyclone inte...
We propose a new approach for analyzing skewed and heteroscedastic health care cost data through regression of the conditional quantiles of the transformed cost. Using the appealing equivariance property of quantiles to monotone transformations, we propose a distribution-free estimator of the conditional mean cost on the original scale. The propose...
In this paper, we develop an efficient moments-based permutation test approach to improve the tests computational efficiency by approximating the permutation distribution of the test statistic with Pearson distribution series. This approach involves the calculation of the first four moments of the permutation distribution. We propose a novel recurs...
Semiparametric models are often considered for analyzing longitudinal data for a good balance between flexibility and parsimony. In this paper, we study a class of marginal partially linear quantile models with possibly varying coefficients. The functional coefficients are estimated by basis function approximations. The estimation procedure is easy...
Quantile regression has emerged as a useful supplement to ordinary mean regression. Traditional frequentist quantile regression
makes very minimal assumptions on the form of the error distribution and thus is able to accommodate nonnormal errors, which
are common in many applications. However, inference for these models is challenging, particularly...
Censored quantile regression ofiers a valuable supplement to Cox propor- tional hazards model for survival analysis. Existing work in the literature of- ten requires stringent assumptions, such as unconditional independence of the survival time and the censoring variable or global linearity at all quantile lev- els. Moreover, some of the work use r...
This paper develops two weighted quantile rank score tests for the sig-nificance of fixed effects in a class of mixed models with nonhomogeneous groups. One test is constructed by weighting the residuals to account for heteroscedastic-ity, while the other test is based on asymptotically optimal weights accounting for both heteroscedasticity and cor...
We develop inference procedures for longitudinal data where some of the measurements are censored by fixed constants. We consider a semi-parametric quantile regression model that makes no distributional assumptions. Our research is motivated by the lack of proper inference procedures for data from biomedical studies where measurements are censored...
Recurrent chromosomal aberrations in solid tumors can reveal the genetic pathways involved in the evolution of a malignancy and in some cases predict biological behavior. However, the role of individual genetic backgrounds in shaping karyotypes of sporadic tumors is unknown. The genetic structure of purebred dog breeds, coupled with their susceptib...
Numerous attributes render the domestic dog a highly pertinent model for cancer-associated gene discovery. We performed microarray-based comparative genomic hybridization analysis of 60 spontaneous canine intracranial tumors to examine the degree to which dog and human patients exhibit aberrations of ancestrally related chromosome regions, consiste...
Microarray-based comparative genomic hybridization (aCGH) analysis of 38 cases of canine appendicular osteosarcoma. Each case was evaluated using a genomic microarray comprising a panel of cytogenetically validated dog bacterial artificial chromosome (BAC) clones. The BAC address of each clone is listed in column one. The chromosome to which each c...
In this paper, we develop an efficient moments-based permutation test approach to improve the test's computational efficiency by approximating the permutation distribution of the test statistic with Pearson distribution series. This approach involves the calculation of the first four moments of the permutation distribution. We propose a novel recur...
Due to the small number of replicates in typical gene microarray experiments, the performance of statistical inference is often unsatisfactory without some form of information-sharing across genes. In this article, we propose an enhanced quantile rank score test (EQRS) for detecting differential expression in GeneChip studies by analyzing the quant...
It is a fascinating yet challenging problem to accurately and efficiently localize regionally distinct features between face groups in multi-dimensional signal processing and analysis. Given a data with unknown distribution and small sample size, we propose a new statistical analysis framework using hybrid randomization (i.e., permutation) tests to...
DNA abundance provides important information about cell physiology and proliferation activity. In a typical in vitro cellular assay, the distribution of the DNA content within a sample is comprised of cell debris, G0/G1-, S-, and G2/M-phase cells. In some circumstances, there may be a collection of cells that contain more than two copies of DNA. Th...
We consider testing for differentially expressed genes in GeneChip studies by modeling and analyzing the quantiles of gene expression through probe level measurements. By developing a robust rank score test for linear quantile models with a random effect, we propose a reliable test for detecting differences in certain quantiles of the intensity dis...
In this article we consider testing for differentially expressed genes in GeneChip studies by modeling and analyzing the quantiles of gene expression through probe level measurements. By developing a robust rank score test for linear quantile models with a random effect, we propose a reliable test for detecting differences in certain quantiles of t...
Table 4, Table 5, Table 6, Table 7.
NCI60 cell lines are derived from cancers of 9 tissue origins and have been invaluable in vitro models for cancer research and anti-cancer drug screen. Although extensive studies have been carried out to assess the molecular features of NCI60 cell lines related to cancer and their sensitivities to more than 100,000 chemical compounds, it remains un...
Estrogen regulates thymic development and involution and modulates immune function. Despite its critical role in thymus, as well as in autoimmune disorders, the mechanism by which estrogen affects the thymus is not well understood. We previously reported that the estrogenic soy isoflavone genistein, as well as 17beta-estradiol (E2), could induce th...
As gene expression profile data from DNA microarrays accumulate rapidly, there is a natural need to compare data across labs and platforms. Comparisons of microarray data can be quite challenging due to data complexity and variability. Different labs may adopt different technology platforms. One may ask about the degree of agreement we can expect f...