Bing Li’s research while affiliated with Pennsylvania State University and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (82)


On relative universality, regression operator, and conditional independence
  • Preprint
  • File available

April 2025

·

1 Read

Bing Li

·

Ben Jones

·

The notion of relative universality with respect to a {\sigma}-field was introduced to establish the unbiasedness and Fisher consistency of an estimator in nonlinear sufficient dimension reduction. However, there is a gap in the proof of this result in the existing literature. The existing definition of relative universality seems to be too strong for the proof to be valid. In this note we modify the definition of relative universality using the concept of \k{o}-measurability, and rigorously establish the mentioned unbiasedness and Fisher consistency. The significance of this result is beyond its original context of sufficient dimension reduction, because relative universality allows us to use the regression operator to fully characterize conditional independence, a crucially important statistical relation that sits at the core of many areas and methodologies in statistics and machine learning, such as dimension reduction, graphical models, probability embedding, causal inference, and Bayesian estimation.

Download

A Bayesian Variation of Basu’s Theorem and its Ramification in Statistical Inference

December 2023

·

13 Reads

·

1 Citation

Sankhya A

One of the celebrated results of Professor D. Basu is his 1955 paper on ancillary statistics, which established the well known Basu’s Theorem. A Bayesian version of this result, where the parameter Θ\Theta is treated as a random variable, is developed in this note, along with other extensions of the related classical results, such as Rao-Blackwell and Lehmann-Scheffé theorems and the relation between complete sufficiency and minimal sufficiency. These extensions shed new light on these fundamental theorems for frequentist statistical inference in the context Bayesian inference.


Figure 2: Averaged receiver operating characteristic curves for Model V. Left panel: n = 100; right panel: n = 200.
Comparison of sufficient graphical model, Lee et al. (2016b), Na¨ıveNa¨ıve and the champion methods in DREAM 4 Challenge
On Sufficient Graphical Models

July 2023

·

54 Reads

We introduce a sufficient graphical model by applying the recently developed nonlinear sufficient dimension reduction techniques to the evaluation of conditional independence. The graphical model is nonparametric in nature, as it does not make distributional assumptions such as the Gaussian or copula Gaussian assumptions. However, unlike a fully nonparametric graphical model, which relies on the high-dimensional kernel to characterize conditional independence, our graphical model is based on conditional independence given a set of sufficient predictors with a substantially reduced dimension. In this way we avoid the curse of dimensionality that comes with a high-dimensional kernel. We develop the population-level properties, convergence rate, and variable selection consistency of our estimate. By simulation comparisons and an analysis of the DREAM 4 Challenge data set, we demonstrate that our method outperforms the existing methods when the Gaussian or copula Gaussian assumptions are violated, and its performance remains excellent in the high-dimensional setting.



On skewed Gaussian graphical models

November 2022

·

32 Reads

·

1 Citation

Journal of Multivariate Analysis

We introduce a skewed Gaussian graphical model as an extension to the Gaussian graphical model. One of the appealing properties of the Gaussian distribution is that conditional independence can be fully characterized by the sparseness in the precision matrix. The skewed Gaussian distribution adds a shape parameter to the Gaussian distribution to take into account possible skewness in the data; thus it is more flexible than the Gaussian model. Nevertheless, the appealing property of the Gaussian distribution is retained to a large degree: the conditional independence is still characterized by the sparseness in the parameters, which now include a shape parameter in addition to the precision matrix. As a result, the skewed Gaussian graphical model can be efficiently estimated through a penalized likelihood method just as the Gaussian graphical model. We develop an algorithm to maximize the penalized likelihood based on the alternating direction method of multipliers, and establish the asymptotic normality and variable selection consistency for the new estimator. Through simulations, we demonstrate that our method performs better than the Gaussian and Gaussian copula methods when these distributional assumptions are not satisfied. The method is applied to a breast cancer MicroRNA dataset to construct a gene network, which shows better interpretability than the Gaussian graphical model.




Nonparametric Functional Graphical Modeling Through Functional Additive Regression Operator

November 2021

·

88 Reads

·

15 Citations

In this article, we develop a nonparametric graphical model for multivariate random functions. Most existing graphical models are restricted by the assumptions of multivariate Gaussian or copula Gaussian distributions, which also imply linear relations among the random variables or functions on different nodes. We relax those assumptions by building our graphical model based on a new statistical object – the functional additive regression operator. By carrying out regression and neighborhood selection at the operator level, our method can capture nonlinear relations without requiring any distributional assumptions. Moreover, the method is built up using only one-dimensional kernel, thus avoids the curse of dimensionality from which a fully nonparametric approach often suffers, and enables us to work with large-scale networks. We derive error bounds for the estimated regression operator and establish graph estimation consistency, while allowing the number of functions to diverge at the exponential rate of the sample size. We demonstrate the efficacy of our method by both simulations and analysis of an electroencephalography dataset.


B-scaling: A Novel Nonparametric Data Fusion Method

September 2021

·

15 Reads

Very often for the same scientific question, there may exist different techniques or experiments that measure the same numerical quantity. Historically, various methods have been developed to exploit the information within each type of data independently. However, statistical data fusion methods that could effectively integrate multi-source data under a unified framework are lacking. In this paper, we propose a novel data fusion method, called B-scaling, for integrating multi-source data. Consider K measurements that are generated from different sources but measure the same latent variable through some linear or nonlinear ways. We seek to find a representation of the latent variable, named B-mean, which captures the common information contained in the K measurements while takes into account the nonlinear mappings between them and the latent variable. We also establish the asymptotic property of the B-mean and apply the proposed method to integrate multiple histone modifications and DNA methylation levels for characterizing epigenomic landscape. Both numerical and empirical studies show that B-scaling is a powerful data fusion method with broad applications.


Covariance based low‐dimensional registration for function‐on‐function regression

July 2021

·

33 Reads

·

2 Citations

Stat

We propose a new low‐dimensional registration procedure that exploits the relationship between response and predictor in a function‐on‐function regression. In this context, Functional Covariance Components (FCC) provide a flexible and powerful tool to represent the data in a low‐dimensional space, capturing the most meaningful modes of dependency between the two set of curves. Based on this reduced representation, our procedure aligns simultaneously the two sets of curves, in a way that optimizes the subsequent regression analysis. To implement our procedure, we use both the Continuous Registration algorithm (CR) and a novel parallel algorithm coded in R. We then compare it to other common registration approaches via simulations and an application to the AneuRisk data.


Citations (57)


... By leveraging a higher order moment, SAVE is able to uncover the symmetric features that SIR might overlook, although its performance can diminish when response surfaces are asymmetric. Contour regression (CR; [17]) captures relevant directions in predictors by leveraging empirical directions. DR combines aspects of SIR and SAVE, utilizing both first and second conditional moments. ...

Reference:

Multivariate response directional regression: a projective resampling approach
Contour regression: A general approach to dimension reduction
  • Citing Preprint
  • August 2005

... This volume contains sixteen statistics papers by some of the most renowned authors. Seven papers are on topics of Basu's special interest, such as foundational issues (Berger, 2024;Martin, 2024), survey sampling (Di Zio et al., 2024;Banerjee, 2024), and connections among sufficiency, completeness, and ancillary (Babu and Li, 2024;Mukhopadhyay, 2024;Reid, 2024). Other papers cover various technical topics such as valid confidence interval with very limited observations (Dasgupta and Portnoy, 2024), Bayesian analysis (Mueller et al., 2024;Menger et al., 2024;Banerjee, 2024), Bayesian theory (Yan et al., 2024;Catalano et al., 2024;Sethuraman and Ghosh, 2024), prediction (Dustin and Clarke, 2024), exploratory data analysis (Xiang et al., 2024), and probability theory (Catalano et al., 2024;Xiang et al., 2024;Wolpert, 2024). ...

A Bayesian Variation of Basu’s Theorem and its Ramification in Statistical Inference
  • Citing Article
  • December 2023

Sankhya A

... Prior studies by Zareifard et al. (2016), Nghiem et al. (2022), and Sheng et al. (2023) have explored graphical models while incorporating skewness into the sparsity pattern estimation. Zareifard et al. (2016) exploit a multivariate CSN distribution to define the skew Gaussian graphical models which are estimated through a Bayesian approach. ...

On skewed Gaussian graphical models
  • Citing Article
  • November 2022

Journal of Multivariate Analysis

... Functional data refer to observations collected over multiple time points or other continuous domains (Horváth and Kokoszka, 2012;Wang et al., 2016;Kokoszka and Reimherr, 2017), common in fields like biomedical research (Gao et al., 2024), economics, and finance. In the context of functional data analysis (FDA, Yao et al. 2005;Lin et al. 2018;Ye and Hooker 2020;Lee et al. 2020;Li et al. 2021;Lee et al. 2023;Sang et al. 2024), functional censored quantile regression (Jiang et al., 2020) has emerged as a more powerful tool for handling censored data, offering a distinct approach from the functional Cox model (Qu et al., 2016). This model is particularly valuable in medical and biometric applications for exploring the relationship between a functional predictor and various quantiles of survival time. ...

Nonparametric Functional Graphical Modeling Through Functional Additive Regression Operator
  • Citing Article
  • November 2021

... In particular, accurate data on cases and hospitalizations in addition to deaths, and at a resolution much finer than that of Italian regions. Such data would allow a more systematic evaluation of the lags between the temporal patterns of mobility, contagions, illnesses and casualties-an important avenue for future studies, which could again utilize FDA tools (e.g., registration and dimension reduction techniques 36 ). Such data would also be critical to better capture predictive signals in a number of covariates-which may weaken and/or become confounded www.nature.com/scientificreports/ ...

Covariance based low‐dimensional registration for function‐on‐function regression
  • Citing Article
  • July 2021

Stat

... A doubly functional graphical model has been developed to deal with the case where functional data is sparsely observed [21]. A functional copula Gaussian graphical model was proposed to deal with marginal violation of the Gaussian assumption [22]. A conditional functional graphical models was also introduced for the graph structure that is conditioned on and thus varies with the external variables [23]. ...

Copula Gaussian Graphical Models for Functional Data
  • Citing Article
  • October 2020

... A different approach to dimension estimation is taken in a recent proposal known as predictor augmentation (Luo and Li, 2021). The full description of the method is given in Section 2, but on a heuristic level, in predictor augmentation the observed n × p data is augmented into a sample of size n × (p + r), where r is essentially a tuning parameter and the added nr variables are drawn i.i.d. ...

On order determination by predictor augmentation
  • Citing Article
  • October 2020

Biometrika

... In a more recent work, [6] proposed an aggregate inverse mean estimation (AIME) procedure that may substantially improve estimation accuracy compared to the previous methods. It incorporates the cumulative slicing scheme into the aggregate SDR idea proposed by [11] and is much less sensitive to linearity condition violations with the localization step before aggregation. [12] proposed a real-time approach for SDR that uses a principal least squares support vector machines approach to estimate the central subspace more accurately. ...

On aggregate dimension reduction
  • Citing Article
  • January 2020

Statistica Sinica

... Moreover, Weng and Yin [23] integrated the Fourier basis expansion into the minimum discrepancy framework, demonstrating how these two approaches can be combined to enhance estimation accuracy. Finally, post-SDR statistical inference [7,8] have emerged as active areas of research. For a comprehensive review of SDR methods, see Li [12]. ...

On post dimension reduction statistical inference
  • Citing Article
  • June 2020

The Annals of Statistics

... If we instead wanted to explain the most covariate variability, PCA-based methods would have selected far fewer patterns. There has been recent work (Jones et al., 2020) showing that the first PCA components tend to be more correlated with y than later components under certain conditions, if one accepts working with transformations of the predictors that cannot be interpreted on the original X scale. However, we have shown that for our scientific scope, PLS is more reliable at creating initial components that are correlated with the outcome than PCA. ...

On the predictive potential of kernel principal components
  • Citing Article
  • January 2020

Electronic Journal of Statistics