Jonathan G. Goldin’s research while affiliated with University of California, Los Angeles and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (92)


WHISTLE-PF: Study Design of a Phase 2b, Multi-center, Randomized, Double-blind, Controlled Trial of ENV-101 (Taladegib) in Patients With Idiopathic Pulmonary Fibrosis
  • Article

May 2025

American Journal of Respiratory and Critical Care Medicine

·

P.L. Molyneaux

·

J.G. Goldin

·

[...]

·

J. Hood




Overview of the model construction stage and the model evaluation stage. (a) Model construction stage with five‐fold cross validation (previously development with reference condition): we built three deep learning models for the IPF diagnosis task, including one 2D model and two 3D models trained under different hyperparameters. Subjects were divided into five folds, where four folds of subjects were used for model training and one‐fold was used for model testing. (b) Model evaluation (i.e., non‐reference conditions) stage (the focus of this article): at each fold, we obtained CT scans from subjects on the same day in the model testing group, but with varying imaging protocols compared to those used in the testing fold. Model discrepancy was measured by comparing the model prediction results between the reference conditions and evaluation conditions. We further built a statistical model to understand if models are robust to different imaging protocols and analyze which factors lead to the model discrepancy.
A sample CT scan of one patient under a reference condition (a) and two pairs of evaluation conditions (a and b; a and c). Related imaging protocols are provided in the table below. Note: Compared with the reference condition (a), the evaluation condition (b) contained a different exposure at this slice, average exposure for this scan, and patient position. All imaging protocols listed below were different between the evaluation condition (c) and the reference condition.
Evaluating the robustness of deep learning models trained to diagnose idiopathic pulmonary fibrosis using a retrospective study
  • Article
  • Publisher preview available

March 2025

·

13 Reads

Background Deep learning (DL)‐based systems have not yet been broadly implemented in clinical practice, in part due to unknown robustness across multiple imaging protocols. Purpose To this end, we aim to evaluate the performance of several previously developed DL‐based models, which were trained to distinguish idiopathic pulmonary fibrosis (IPF) from non‐IPF among interstitial lung disease (ILD) patients, under standardized reference CT imaging protocols. In this study, we utilized CT scans from non‐IPF ILD subjects, acquired using various imaging protocols, to assess the model performance. Methods Three DL‐based models, including one 2D and two 3D models, have been previously developed to classify ILD patients into IPF or non‐IPF based on chest CT scans. These models were trained on CT image data from 389 IPF and 700 non‐IPF ILD patients, retrospectively, obtained from five multicenter studies. For some patients, multiple CT scans were acquired (e.g., one at inhalation and one at exhalation) and/or reconstructed (e.g., thin slice and/or thick slice). Thus, for each patient, one CT image dataset was selected to be used in the construction of the classification model, so the parameters of that data set serve as the reference conditions. In one non‐IPF ILD study, due to its specific study protocol, many patients had multiple CT image data sets that were acquired under both prone and supine positions and/or reconstructed under different imaging parameters. Therefore, to assess the robustness of the previously developed models under different (e.g., non‐reference) imaging protocols, we identified 343 subjects from this study who had CT data from both the reference condition (used in model construction) and non‐reference conditions (e.g., evaluation conditions), which we used in this model evaluation analysis. We reported the specificities from three model under the non‐reference conditions. Generalized linear mixed effects model (GLMM) was utilized to identify the significant CT technical and clinical parameters that were associated with getting inconsistent diagnostic results between reference and evaluation conditions. Selected parameters include effective tube current‐time product (known as “effective mAs”), reconstruction kernels, slice thickness, patient orientation (prone or supine), CT scanner model, and clinical diagnosis. Limitations include the retrospective nature of this study. Results For all three DL models, the overall specificity of the previously trained IPF diagnosis model decreased (p < 0.05 for two out of three models). GLMM further suggests that for at least one out of three models, mean effective mAs across the scan is the key factor that leads to the decrease in model predictive performance (p < 0.001); the difference of mean effective mAs between the reference and evaluation conditions (p = 0.03) and slice thickness (3 mm; p = 0.03) are flagged as significant factors for one out of three models; other factors are not statistically significant (p > 0.05). Conclusion Preliminary findings demonstrated the lack of robustness of IPF diagnosis model when the DL‐based model is applied to CT series collected under different imaging protocols, which indicated that care should be taken as to the acquisition and reconstruction conditions used when developing and deploying DL models into clinical practice.

View access options


Quantification of Interstitial Lung Diseases, From the AJR Special Series on Quantitative Imaging

November 2024

·

43 Reads

·

1 Citation

American Journal of Roentgenology

High-resolution CT (HRCT) plays an important role in diagnosing and monitoring interstitial lung diseases (ILDs). Despite advances, predicting disease progression and treatment response remains challenging. HRCT enables noninvasive visualization and classification of patterns of lung injury and assessment of disease extent. Visual estimation of CT extent of fibrotic lung disease is an independent predictor of mortality and progression, but is subjective, with only modest interobserver agreement for radiologic interpretation of ILD. Machine learning-based textural analysis of fibrosis extent on baseline and serial HRCT scans shows robust correlations with physiologic measures and strong association with risk of disease progression or mortality across various fibrosing ILDs. In idiopathic pulmonary fibrosis, quantitative CT (QCT) assessment is associated with physiologic impairment and risk of progression and death, and increasing severity of fibrosis on longitudinal evaluation is associated with increased risk of progression and death. Similar results have been noted for fibrotic hypersensitivity pneumonitis and connective tissue disease. This review focuses on QCT techniques for ILDs. We describe the clinical need for quantification of lung disease and illustrate the role of conventional visual evaluation and of QCT approaches in defining disease severity, prognosis, and longitudinal progression, both in established disease and in preclinical interstitial abnormality.


Fig. 2 Non-target progressive disease
Fig. 3 Development of a new lesion
RECIST 1.1 time point response
A call for objectivity: Radiologists’ proposed wishlist for response evaluation in solid tumors (RECIST 1.1)

November 2024

·

37 Reads

·

2 Citations

Cancer Imaging

The Response Evaluation in Solid Tumors (RECIST) 1.1 provides key guidance for performing imaging response assessment and defines image-based outcome metrics in oncology clinical trials, including progression free survival. In this framework, tumors identified on imaging are designated as either target lesions, non-target disease or new lesions and a structured categorical response is assigned at each imaging time point. While RECIST provides definitions for these categories, it specifically and objectively defines only the target disease. Predefined thresholds of size change provide unbiased metrics for determining objective response and disease progression of the target lesions. However, worsening of non-target disease or emergence of new lesions is given the same importance in determining disease progression despite these being qualitatively assessed and less rigorously defined. The subjective assessment of non-target and new disease contributes to reader variability, which can impact the quality of image interpretation and even the determination of progression free survival. The RECIST Working Group has made significant efforts in developing RECIST 1.1 beyond its initial publication, particularly in its application to targeted agents and immunotherapy. A review of the literature highlights that the Working Group has occasionally employed or adopted objective measures for assessing non-target and new lesions in their evaluation of RECIST-based outcome measures. Perhaps a prospective evaluation of these more objective definitions for non-target and new lesions within the framework of RECIST 1.1 might improve reader interpretation. Ideally, these changes could also better align with clinically meaningful outcome measures of patient survival or quality of life.


POS0557 FECAL MICROBIOTA SIGNATURES OF SYSTEMIC SCLEROSIS-ASSOCIATED INTERSTITIAL LUNG DISEASE: AN EXPLORATION OF THE GUT-LUNG AXIS IN AN INTERNATIONAL MULTICENTER STUDY

June 2024

·

21 Reads

·

1 Citation

Annals of the Rheumatic Diseases

Background Systemic sclerosis (SSc) is a complex autoimmune disease characterized by inflammation and fibrosis across different organ systems, including the gastrointestinal tract and the lungs. Studies have demonstrated that gut microbiota modulate pulmonary immune function [1], and alterations of intestinal microbiota communities can influence disease outcomes in distant organs, including the lungs, through gut transfer experiments of dysbiotic microbiota [2]. To our knowledge, no prior studies have investigated the gut-lung axis in SSc-associated interstitial lung disease (ILD) using an international, multicentre study. Objectives We aimed to identify differentially abundant bacterial species in SSc patients with ILD compared to without ILD and to determine whether specific bacterial species are associated with ILD severity based on the quantitative radiological extent of ILD. Methods SSc patients with and without ILD were recruited from 7 international SSc Centres (University of California, Los Angeles [UCLA], USA; Lund University [LU], Sweden; Duke-National University of Singapore; Johns Hopkins University, USA; Ghent University, Belgium; University of Adelaide, Australia; Pontificia Universidad Católica de Chile) and provided a stool sample. Shotgun metagenomics were performed using the Illumina NovaSeq 6000 with a target depth of 10 million 150x2 sequences per sample. Shotgun reads were inputted into MetaPhlAn4 for taxonomic identification of species for compositional analysis and subsequently underwent center log-ratio transformation. Samples were filtered to retain species with at least 10% non-zero counts. High-resolution computed tomography (HRCT) scans of the chest underwent quantitative image analysis to determine the radiological extent of ILD (QILD) in patients from UCLA and LU. General linear models were applied to identify differentially abundant species based on ILD presence and determine associations between QILD scores and species abundance, adjusting for body mass index, current proton pump inhibitor use, current probiotic use, current or prior immunomodulatory therapy, presence of small intestinal bacterial overgrowth and site. We considered p<0.05 at the threshold for reporting and provide 5% false discovery rate corrected p-values (q). We also computed effect size estimates, Cohen’s D for mean differences and standardized Beta for association analyses. Results Among the 261 SSc participants, 220 (84%) were female and 167 (64%) had HRCT-defined ILD. The mean age was 54.6 (SD 13) years; the mean body mass index (BMI) was 24.9 (SD 4.8); and the median disease duration was 6.8 (IQR 3.5, 12.9) years. Among 254 species analyzed, the abundance of 12 bacterial species was altered in patients with ILD compared to without ILD in all study participants, including Anaerotignum faecicola (Cohen’s d= 0.37 [95% CI 0.07-0.68]; p=0.017; q=0.868) and Roseburia hominis (Cohen’s d= 0.42 [95% CI 0.11-0.72]; p=0.007; q=0.868). Among 113 SSc-ILD participants who had an HRCT scan amenable to quantitative image analysis, 16 bacterial species were associated with severity of ILD based on the QILD scores, including Dysosmobacter welbionis (Standardized beta 0.33; p<0.0001; q=0.34 [Figure 1]) and Bifidobacterium adolescentis (Standardized beta -0.24; p=0.02; q=0.56 [Figure 1]). Conclusion Patients with SSc-ILD recruited from several international centres have a unique microbiota signature compared with SSc patients without ILD. The finding that the abundance of certain bacterial species is associated with ILD and its severity supports the hypothesis that intestinal dysbiosis may contribute to ILD pathogenesis. Future studies are needed to identify additional mediators of the gut-lung axis in SSc-ILD, including bacterial metabolites. REFERENCES [1] Sencio V, et al. Mucosal Immunol 2021;14:296–304.[2] Skalaski JH, et al. PLoS Pathog 2018;14:e1007260. • Download figure • Open in new tab • Download powerpoint Figure 1. Association between the abundance of Dysosmobacter welbionis (top) and Bifidobacterium adolescentis (bottom) and QILD score in SSc-ILD patients from LU in Sweden (red) and UCLA in USA (blue) based on generalized linear model estimates. Acknowledgements Funding Sources: Anonymous donation (EV), NHLBI (EV), Boehringer Ingelheim (EV and KA) Disclosure of Interests Kristofer Andréasson Johnson & Johnson Innovative Medicine, Swapna Joshi: None declared, Jen Labus: None declared, Arissa Young: None declared, Andrea Low: None declared, Vanessa Smith Boehringer Ingelheim, Zsuzsanna McMahan Boehringer Ingelheim, Susanna M. Proudman Boehringer Ingelheim, Janssen, Boehringer Ingelheim, Janssen, Antonia Valenzuela Vegara: None declared, Grace Kim: None declared, Jonathan Goldin: None declared, Ezinne Aja: None declared, Jonathan Jacobs: None declared, Elizabeth Volkmann Boehringer Ingelheim, GSK, Horizon, Prometheus, Boehringer Ingelheim.


Quantifying lung fissure integrity using a three-dimensional patch-based convolutional neural network on CT images for emphysema treatment planning

May 2024

·

12 Reads

·

1 Citation

Journal of Medical Imaging

Purpose: Evaluation of lung fissure integrity is required to determine whether emphysema patients have complete fissures and are candidates for endobronchial valve (EBV) therapy. We propose a deep learning (DL) approach to segment fissures using a three-dimensional patch-based convolutional neural network (CNN) and quantitatively assess fissure integrity on CT to evaluate it in subjects with severe emphysema. Approach: From an anonymized image database of patients with severe emphysema, 129 CT scans were used. Lung lobe segmentations were performed to identify lobar regions, and the boundaries among these regions were used to construct approximate interlobar regions of interest (ROIs). The interlobar ROIs were annotated by expert image analysts to identify voxels where the fissure was present and create a reference ROI that excluded non-fissure voxels (where the fissure is incomplete). A CNN configured by nnU-Net was trained using 86 CT scans and their corresponding reference ROIs to segment the ROIs of left oblique fissure (LOF), right oblique fissure (ROF), and right horizontal fissure (RHF). For an independent test set of 43 cases, fissure integrity was quantified by mapping the segmented fissure ROI along the interlobar ROI. A fissure integrity score (FIS) was then calculated as the percentage of labeled fissure voxels divided by total voxels in the interlobar ROI. Predicted FIS (p-FIS) was quantified from the CNN output, and statistical analyses were performed comparing p-FIS and reference FIS (r-FIS). Results: The absolute percent error mean (±SD) between r-FIS and p-FIS for the test set was 4.0% (±4.1%), 6.0% (±9.3%), and 12.2% (±12.5%) for the LOF, ROF, and RHF, respectively. Conclusions: A DL approach was developed to segment lung fissures on CT images and accurately quantify FIS. It has potential to assist in the identification of emphysema patients who would benefit from EBV treatment.


Citations (44)


... Widespread variability in RECIST responses also exists due to the different selection of lesions between radiologists, diminishing the reproducibility of RECIST assessments [41]. In a study conducted by Bucho et al. [42], ML models were compared to radiologists in the selection of measurable and target lesions for RECIST assessment. ...

Reference:

Large Language Models in Cancer Imaging: Applications and Future Perspectives
A call for objectivity: Radiologists’ proposed wishlist for response evaluation in solid tumors (RECIST 1.1)

Cancer Imaging

... Paired-sample t-test of TIW-D values between conventional to STO600, conventional to STO200 and STO600 to STO200 gave p-values of 0.001, 4 × 10 À12 and 3 × 10 À10 respectively. We measured motion model robustness 19 by deformably registering motion compensated STO200 images to the non-motion compensated STO600 images and extracting the motion in the tumor region as per 20 . We observed mean displacements of 0.28 mm, -0.0041 mm and 0.11 mm with standard deviation 0.70 mm, 0.75 mm and 0.68 mm in the superior/inferior, posterior/anterior and left/right axes respectively. ...

Motion compensated cone-beam CT reconstruction using ana Priorimotion model from CT simulation: a pilot study

... Tomographic findings such as the presence of a dispersed bilateral distribution of lesions, a higher number of involved lobes, consolidations, and bronchial distortion, as well as the absence of mixed and reticular patterns, are associated with a poor prognosis in patients admitted to the intensive care unit. However, there is still no standardization of the tomographic findings and predictors of long-term morbidity [17]. Individuals infected with SARS-CoV-2, especially cases classified as severe and of a longer duration, may have chronic lung lesions with architectural distortion and residual abnormalities, as seen on computed tomography, functional impairment, and reduced exercise capacity in the long term [15]. ...

Quantitative Computed Tomography Lung COVID Scores with Laboratory Markers: Utilization to Predict Rapid Progression and Monitor Longitudinal Changes in Patients with Coronavirus 2019 (COVID-19) Pneumonia

... Bexotegrast is currently undergoing phase 2 clinical trials for the treatment of IPF and primary sclerosing cholangitis (PSC). To date, a favorable safety profile and efficacy have been demonstrated in these trials [158][159][160][161][162]. The phase 2a (INTEGRIS-IPF) and 2b (BEACON-IPF) trials for IPF showed dose-dependent effects on forced vital capacity (FVC) and quantitative lung fibrosis imaging, as well as a reduction in biomarkers of fibrosis in lung fluid cells after 7 days of treatment. ...

Late Breaking Abstract - Safety, tolerability and antifibrotic activity of bexotegrast: Phase 2a INTEGRIS-IPF study (NCT04396756)
  • Citing Conference Paper
  • October 2023

... Burnout presents a significant concern within radiology due to the myriad challenges radiologists face. Factors such as demanding knowledge requirements, extended work hours, continuous on-call duties, and the pressure for accurate diagnoses, compounded by acute radiology workforce shortages in some countries, render radiologists highly susceptible to burnout (11,12). Studies utilizing the Maslach Burnout Inventory (MBI), encompassing the syndrome's three subcomponents, have revealed the significant prevalence of this issue within the radiology community (13)(14)(15). ...

Radiologist and Radiology Practice Wellbeing: A Report of the 2023 ARRS Wellness Summit
  • Citing Article
  • September 2023

Academic Radiology

... This effect was observed regardless of whether antifibrotic treatment was used concurrently. BMS-986278 was safe and well tolerated, with the incidences of gastrointestinal adverse events and treatment discontinuation similar to those seen with the placebo [42][43][44] (Table 3). ...

BMS-986278, an Oral Lysophosphatidic Acid Receptor 1 (LPA1) Antagonist, for Patients With Idiopathic Pulmonary Fibrosis: Results From a Phase 2 Randomized Trial
  • Citing Conference Paper
  • May 2023

... Alone, these changes are sufficient to invalidate the use of any model for use in a clinical setting. Incorporating explainability into AI models may facilitate addressing this challenge [105]. Explainability essentially refers to making the decisions and predictions made by AI algorithms more understandable and interpretable to humans and overcoming the so-called "black box" criticism often levelled at ML-based solutions. ...

Translating AI to Clinical Practice: Overcoming Data Shift with Explainability
  • Citing Article
  • May 2023

Radiographics

... However, the associations between QLF score and progression were no longer significant after adjustment for age, sex, FVC % predicted, and oxygen use, implying that among patients known to have PPF, the QLF score does not add to the prognostic information provided by commonly assessed clinical variables. The association that we observed between higher QLF score and worse disease severity is consistent with prior studies in patients with ILDs [20,31,32,37,38] and supports a structure-function relationship between lung fibrosis and lung function. Given these strong associations, it is not surprising that the association between QLF score and the risk of ILD progression was attenuated after adjusting for other measures of disease severity. ...

Quantitative interstitial lung disease scores in idiopathic inflammatory myopathies: longitudinal changes and clinical implications
  • Citing Article
  • March 2023

British Journal of Rheumatology

... Furthermore, there are other aspects concerning these systems that still need validation, especially in terms of reproducibility since lung texture analysis may be affected by patient characteristics (i.e., lung volume, breath hold duration during CT scan acquisition, change in smoking status) or related to the scanner (i.e., calibration, radiation dose, acquisition and reconstruction protocols) [49][50][51]. ...

Quantitative Imaging Metrics for the Assessment of Pulmonary Pathophysiology: An Official American Thoracic Society and Fleischner Society Joint Workshop Report

Annals of the American Thoracic Society