Young-Geun Choi’s research while affiliated with Sungkyunkwan University and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (3)


A comparison study of weighting-based estimators for average treatment effects on the treated in case of rare exposure
  • Article

February 2025

Korean Journal of Applied Statistics

Eun-Kyoung Lee

·

Young-Geun Choi

Parametric methods and their nonparametric counterparts in the studies
Parametric analysis and normality assumption in phase 3 trials with small sample sizes
  • Preprint
  • File available

May 2024

·

82 Reads

·

1 Citation

Seong Kyung Kim

·

Soeun Kim

·

Jisu Park

·

[...]

·

Kyungim Kim

Background In studies with small sample sizes, the use of nonparametric methods is generally recommended for statistical analysis. However, various studies continue to employ parametric analysis without verifying the assumption of a normal distribution. Objectives To assess the current utilization of parametric and nonparametric methods for primary outcomes, as well as the reporting of normality assumptions, in phase 3 clinical trials with small sample sizes. Methods All phase 3 trials registered on ClinicalTrials.gov until September 12, 2023, were collected. After undergoing a two-step selection process, only publications with a sample size per group of less than 30, involving two or more groups, and specifying the statistical methods used to compare the means or medians of primary outcomes between groups were selected. Statistical methods were categorized as nonparametric, parametric, and either parametric or nonparametric. The reporting of normality assumptions was also evaluated. Results A total of 317 studies were assessed in this study. Among these studies, 164 (51.7%) studies conducted parametric analysis, and 111 (35.0%) studies employed nonparametric analysis; however, 42 (13.2%) studies conducted parametric or nonparametric analysis without specifying which method was used. In addition, 63.1% of the total studies did not report normality assumptions. Specifically, within the subset of studies with parametric analysis, 70.1% of studies did not report normality assumptions. Conclusions This research demonstrated that most studies with small sample sizes employed parametric analysis without reporting normality assumptions. The findings emphasize the need for increased awareness of and compliance with statistical principles in the analysis of phase 3 clinical trials with limited sample sizes.

Download

Clustering accuracy summarized by mean (Mean) and standard error (S.E.) ×10 3 , and the number of simulation failures (# failures) over 100 replications with µ 2 = 1p/2.
Model-Based Clustering of Mixed Data With Sparse Dependence

January 2023

·

20 Reads

·

3 Citations

IEEE Access

Mixed data refers to a mixture of continuous and categorical variables. The clustering problem with mixed data is a long-standing statistical problem. The latent Gaussian mixture model, a model-based approach for such a problem, has received attention owing to its simplicity and interpretability. However, these approaches are prone to dimensionality problems. Specifically, parameters must be estimated for each group, and the number of covariance parameters is quadratic in the number of variables. To address this, we propose “regClustMD,” a novel model-based clustering method that can address sparse dependence among variables. We consider a sparse latent Gaussian mixture model, assuming that the precision matrix between variables has sparse nonzero elements. We propose maximizing a penalized complete log-likelihood using the Monte Carlo expectation-maximization (MCEM) algorithm. We demonstrate our method through a simulation study and real-world data examples.

Citations (2)


... The values obtained in this study were within the acceptable ranges. Additionally, a sample size exceeding 30 supports the use of parametric tests (Ghasemi and Zahediasl, 2012;Kim, 2024). Parametric tests offer greater statistical power than nonparametric tests (Norman and Streiner, 2008). ...

Reference:

Investigation of the Relationship Between Biophilia Levels and Problem-solving Skills of 60-72 Month-old Children Attending Preschool Education
Parametric analysis and normality assumption in phase 3 trials with small sample sizes

... Le medie e le matrici di covarianza forniscono indicazioni dettagliate sulla natura di ciascun cluster, evidenziando differenze significative nei valori medi e nella variabilità interna. Questo suggerisce che il clustering basato su modelli è stato in grado di identificare gruppi distinti, ma i risultati potrebbero beneficiare di ulteriori ottimizzazioni o verifiche rispetto ad altri metodi di clustering per garantire la robustezza delle conclusioni (Tabella 11) (Zhuang et al., 2023;Choi et al., 2023;. ...

Model-Based Clustering of Mixed Data With Sparse Dependence

IEEE Access