Bram van Ginneken’s research while affiliated with Fraunhofer Institute for Digital Medicine and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (570)


Receiver operating characteristic (ROC) plots for the HIVE model (A) and the HIVE-LAB model (B) in predicting appendicitis versus other causes of AAP using the validation population. The top 10 individual parameter contributions to the models are represented by SHapley Additive exPlanations (SHAP) values scaled and plotted as percentage contributions to the prediction (C, D). Parameters outside the top 10 contribute a combined total of 17.5% to the HIVE model and 32.5% to the HIVE-LAB model. HIVE, intake, medical HIstory, Vital signs, physical Examination; HIVE-LAB, intake, medical HIstory, Vital signs, physical Examination, Laboratory testing; AAP, acute abdominal pain; Temp, Temperature; MAP, Mean Arterial Pressure; MH, Medical History; PE, Physical Examination
Receiver operating characteristic (ROC) plots illustrating the performance of three ED physicians in diagnosing cases of appendicitis versus other causes of AAP within the same validation population. (A) Performance of ED physicians using intake, medical history, vital signs, physical examination information. (B) Performance of ED physicians extended with laboratory test results. ED, emergency department; AAP, acute abdominal pain
(A) Box plots displaying the Alvarado score distributions for cases with appendicitis (n = 34) and other causes of AAP (n = 34). Sensitivity and specificity thresholds are highlighted for ruling out appendicitis at a score of ≤ 4 (56% specificity, 88% sensitivity) and for identifying appendicitis at a score of ≥ 7 (27% sensitivity, 100% specificity). (B) Receiver operating characteristic (ROC) plot using the Alvarado scoring system to predict the risk of appendicitis in the validation population. AAP, acute abdominal pain
Statistical comparison of AUROC values among ML models, ED physicians, and the Alvarado scoring system using DeLong's Test
Machine-learning based prediction of appendicitis for patients presenting with acute abdominal pain at the emergency department
  • Article
  • Full-text available

December 2024

·

26 Reads

·

Peter Belgers

·

Rory O’Connor

·

[...]

·

Background Acute abdominal pain (AAP) constitutes 5–10% of all emergency department (ED) visits, with appendicitis being a prevalent AAP etiology often necessitating surgical intervention. The variability in AAP symptoms and causes, combined with the challenge of identifying appendicitis, complicate timely intervention. To estimate the risk of appendicitis, scoring systems such as the Alvarado score have been developed. However, diagnostic errors and delays remain common. Although various machine learning (ML) models have been proposed to enhance appendicitis detection, none have been seamlessly integrated into the ED workflows for AAP or are specifically designed to diagnose appendicitis as early as possible within the clinical decision-making process. To mimic daily clinical practice, this proof-of-concept study aims to develop ML models that support decision-making using comprehensive clinical data up to key decision points in the ED workflow to detect appendicitis in patients presenting with AAP. Methods Data from the Dutch triage system at the ED, vital signs, complete medical history and physical examination findings and routine laboratory test results were retrospectively extracted from 350 AAP patients presenting to the ED of a Dutch teaching hospital from 2016 to 2023. Two eXtreme Gradient Boosting ML models were developed to differentiate cases with appendicitis from other AAP causes: one model used all data up to and including physical examination, and the other was extended with routine laboratory test results. The performance of both models was evaluated on a validation set (n = 68) and compared to the Alvarado scoring system as well as three ED physicians in a reader study. Results The ML models achieved AUROCs of 0.919 without laboratory test results and 0.923 with the addition of laboratory test results. The Alvarado scoring system attained an AUROC of 0.824. ED physicians achieved AUROCs of 0.894, 0.826, and 0.791 without laboratory test results, increasing to AUROCs of 0.923, 0.892, and 0.859 with laboratory test results. Conclusions Both ML models demonstrated comparable high accuracy in predicting appendicitis in patients with AAP, outperforming the Alvarado scoring system. The ML models matched or surpassed ED physician performance in detecting appendicitis, with the largest potential performance gain observed in absence of laboratory test results. Integration could assist ED physicians in early and accurate diagnosis of appendicitis. Graphical abstract

Download

Figure 2
Figure 3
. Using SHAP values, the top 10 contributing parameters for each model were identified. For the HIVE model, these parameters in descending order were: 1.
Statistical comparison of AUROC values among ML models, ED physicians, and the Alvarado scoring system using DeLong's Test
Machine-learning based prediction of appendicitis for patients presenting with acute abdominal pain at the emergency department.

December 2024

·

10 Reads

Background : Acute abdominal pain (AAP) constitutes 5-10% of all emergency department (ED) visits, with appendicitis being a prevalent AAP etiology often necessitating surgical intervention. The variability in AAP symptoms and causes, combined with the challenge of identifying appendicitis, complicate timely intervention. To estimate the risk of appendicitis, scoring systems such as the Alvarado score have been developed. However, diagnostic errors and delays remain common. Although various machine learning (ML) models have been proposed to enhance appendicitis detection, none have been seamlessly integrated into the ED workflows for AAP or are specifically designed to diagnose appendicitis as early as possible within the clinical decision-making process. To mimic daily clinical practice, this proof-of-concept study aims to develop ML models that support decision-making using comprehensive clinical data up to key decision points in the ED workflow to detect appendicitis in patients presenting with AAP. Methods : Data from the Dutch triage system at the ED, vital signs, complete medical history and physical examination findings and routine laboratory test results were retrospectively extracted from 350 AAP patients presenting to the ED of a Dutch teaching hospital from 2016 to 2023. Two eXtreme Gradient Boosting ML models were developed to differentiate cases with appendicitis from other AAP causes: one model used all data up to and including physical examination, and the other was extended with routine laboratory test results. The performance of both models was evaluated on a validation set ( n =68) and compared to the Alvarado scoring system as well as three ED physicians in a reader study. Results: The ML models achieved AUROCs of 0.919 without laboratory test results and 0.923 with the addition of laboratory test results. The Alvarado scoring system attained an AUROC of 0.824. ED physicians achieved AUROCs of 0.894, 0.826, and 0.791 without laboratory test results, increasing to AUROCs of 0.923, 0.892, and 0.859 with laboratory test results. Conclusions: Both ML models demonstrated comparable high accuracy in predicting appendicitis in patients with AAP, outperforming the Alvarado scoring system. The ML models matched or surpassed ED physician performance in detecting appendicitis, with the largest potential performance gain observed in absence of laboratory test results. Integration could assist ED physicians in early and accurate diagnosis of appendicitis.


Automated Tooth Segmentation in Magnetic Resonance Scans Using Deep Learning

November 2024

·

36 Reads

·

1 Citation

Dentomaxillofacial Radiology

Objectives The main objective was to develop and evaluate an artificial intelligence (AI) model for tooth segmentation in magnetic resonance (MR) scans. Material and Methods MR scans of 20 patients performed with a commercial 64-channel head coil with a T1-weighted 3D-SPACE (Sampling Perfection with Application Optimized Contrasts using different flip angle Evolution) sequence were included. 16 datasets were used for model training and 4 for accuracy evaluation. Two clinicians segmented and annotated the teeth in each dataset. A segmentation model was trained using the nnU-Net framework. The manual reference tooth segmentation and the inferred tooth segmentation were superimposed and compared by computing precision, sensitivity, and Dice-Sørensen coefficient. Surface meshes were extracted from the segmentations, and the distances between points on each mesh and their closest counterparts on the other mesh were computed, of which the mean (average symmetric surface distance, ASSD) and 95th percentile (Hausdorff distance 95%, HD95) were reported. Results The model achieved an overall precision of 0.867, a sensitivity of 0.926, a Dice-Sørensen coefficient of 0.895, and a 95% Hausdorff distance of 0.91 mm. The model predictions were less accurate for datasets containing dental restorations due to image artefacts. Conclusions The current study developed an automated method for tooth segmentation in MR scans with moderate to high effectiveness for scans with respectively without artefacts.


Figure 1 A portion of an MR scan with the segmentation and surface mesh of the center tooth. The low signal intensity of dental hard tissues and low spatial resolution make a detailed
Figure 2 Tooth annotations of two MR scans without (a) and with (b) a magnetic susceptibility artefact. The susceptibility artefact in the left region (blue asterisks) obstructs the molars, making the annotation inconsistent. As such, teeth severely obstructed by a susceptibility artefact were not included in the tooth segmentation ((b), bottom row).
Automated Tooth Segmentation in Magnetic Resonance Scans Using Deep Learning

November 2024

·

97 Reads

Dentomaxillofacial Radiology

Objectives The main objective was to develop and evaluate an artificial intelligence (AI) model for tooth segmentation in magnetic resonance (MR) scans. Material and Methods MR scans of 20 patients performed with a commercial 64-channel head coil with a T1-weighted 3D-SPACE (Sampling Perfection with Application Optimized Contrasts using different flip angle Evolution) sequence were included. 16 datasets were used for model training and 4 for accuracy evaluation. Two clinicians segmented and annotated the teeth in each dataset. A segmentation model was trained using the nnU-Net framework. The manual reference tooth segmentation and the inferred tooth segmentation were superimposed and compared by computing precision, sensitivity, and Dice-Sørensen coefficient. Surface meshes were extracted from the segmentations, and the distances between points on each mesh and their closest counterparts on the other mesh were computed, of which the mean (average symmetric surface distance, ASSD) and 95th percentile (Hausdorff distance 95%, HD95) were reported. Results The model achieved an overall precision of 0.867, a sensitivity of 0.926, a Dice-Sørensen coefficient of 0.895, and a 95% Hausdorff distance of 0.91 mm. The model predictions were less accurate for datasets containing dental restorations due to image artefacts. Conclusions The current study developed an automated method for tooth segmentation in MR scans with moderate to high effectiveness for scans with respectively without artefacts.


Deep Learning-Based Algorithm for Staging Secondary Caries in Bitewings

October 2024

·

49 Reads

Caries Research

Introduction: Despite the notable progress in developing artificial intelligence-based tools for caries detection in bitewings, limited research has addressed the detection and staging of secondary caries. Therefore, we aimed to develop a convolutional neural network (CNN)-based algorithm for these purposes using a novel approach for determining lesion severity. Methods: We used a dataset from a Dutch dental practice-based research network containing 2,612 restored teeth in 413 bitewings from 383 patients aged 15-88 years and trained the Mask R-CNN architecture with a Swin Transformer backbone. Two-stage training fine-tuned caries detection accuracy and severity assessment. Annotations of caries around restorations were made by two evaluators and checked by two other experts. Aggregated accuracy metrics (mean ± standard deviation - SD) in detecting teeth with secondary caries were calculated considering two thresholds: detecting all lesions and dentine lesions. The correlation between the lesion severity scores obtained with the algorithm and the annotators' consensus was determined using the Pearson correlation coefficient and Bland-Altman plots. Results: Our refined algorithm showed high specificity in detecting all lesions (0.966 ± 0.025) and dentine lesions (0.964 ± 0.019). Sensitivity values were lower: 0.737 ± 0.079 for all lesions and 0.808 ± 0.083 for dentine lesions. The areas under ROC curves (SD) were 0.940 (0.025) for all lesions and 0.946 (0.023) for dentine lesions. The correlation coefficient for severity scores was 0.802. Conclusion: We developed an improved algorithm to support clinicians in detecting and staging secondary caries in bitewing, incorporating an innovative approach for annotation, considering the lesion severity as a continuous outcome.



AI-based lumbar central canal stenosis classification on sagittal MR images is comparable to experienced radiologists using axial images

September 2024

·

80 Reads

European Radiology

Objectives The assessment of lumbar central canal stenosis (LCCS) is crucial for diagnosing and planning treatment for patients with low back pain and neurogenic pain. However, manual assessment methods are time-consuming, variable, and require axial MRIs. The aim of this study is to develop and validate an AI-based model that automatically classifies LCCS using sagittal T2-weighted MRIs. Methods A pre-existing 3D AI algorithm was utilized to segment the spinal canal and intervertebral discs (IVDs), enabling quantitative measurements at each IVD level. Four musculoskeletal radiologists graded 683 IVD levels from 186 LCCS patients using the 4-class Lee grading system. A second consensus reading was conducted by readers 1 and 2, which, along with automatic measurements, formed the training dataset for a multiclass (grade 0–3) and binary (grade 0–1 vs. 2–3) random forest classifier with tenfold cross-validation. Results The multiclass model achieved a Cohen’s weighted kappa of 0.86 (95% CI: 0.82–0.90), comparable to readers 3 and 4 with 0.85 (95% CI: 0.80–0.89) and 0.73 (95% CI: 0.68–0.79) respectively. The binary model demonstrated an AUC of 0.98 (95% CI: 0.97–0.99), sensitivity of 93% (95% CI: 91–96%), and specificity of 91% (95% CI: 87–95%). In comparison, readers 3 and 4 achieved a specificity of 98 and 99% and sensitivity of 74 and 54%, respectively. Conclusion Both the multiclass and binary models, while only using sagittal MR images, perform on par with experienced radiologists who also had access to axial sequences. This underscores the potential of this novel algorithm in enhancing diagnostic accuracy and efficiency in medical imaging. Key Points Question How can the classification of lumbar central canal stenosis (LCCS) be made more efficient? Findings Multiclass and binary AI models, using only sagittal MR images, performed on par with experienced radiologists who also had access to axial sequences. Clinical relevance Our AI algorithm accurately classifies LCCS from sagittal MRI, matching experienced radiologists. This study offers a promising tool for automated LCCS assessment from sagittal T2 MRI, potentially reducing the reliance on additional axial imaging .


Evaluation of C-Reactive Protein and Computer-Aided Analysis of Chest X-rays as Tuberculosis Triage Tests at Health Facilities in Lesotho and South Africa

August 2024

·

21 Reads

Clinical Infectious Diseases

Background To improve tuberculosis case-finding, rapid, non-sputum triage tests need to be developed according to the World Health Organization target product profile (TPP) (>90% sensitivity, >70% specificity). We prospectively evaluated and compared artificial intelligence–based, computer-aided detection software, CAD4TBv7, and C-reactive protein assay (CRP) as triage tests at health facilities in Lesotho and South Africa. Methods Adults (≥18 years) presenting with ≥1 of the 4 cardinal tuberculosis symptoms were consecutively recruited between February 2021 and April 2022. After informed consent, each participant underwent a digital chest X-ray for CAD4TBv7 and a CRP test. Participants provided 1 sputum sample for Xpert MTB/RIF Ultra and Xpert MTB/RIF and 1 for liquid culture. Additionally, an expert radiologist read the chest X-rays via teleradiology. For primary analysis, a composite microbiological reference standard (ie, positive culture or Xpert Ultra) was used. Results We enrolled 1392 participants, 48% were people with HIV and 24% had previously tuberculosis. The receiver operating characteristic curve for CAD4TBv7 and CRP showed an area under the curve of .87 (95% CI: .84–.91) and .80 (95% CI: .76–.84), respectively. At thresholds corresponding to 90% sensitivity, specificity was 68.2% (95% CI: 65.4–71.0%) and 38.2% (95% CI: 35.3–41.1%) for CAD4TBv7 and CRP, respectively. CAD4TBv7 detected tuberculosis as well as an expert radiologist. CAD4TBv7 almost met the TPP criteria for tuberculosis triage. Conclusions CAD4TBv7 is accurate as a triage test for patients with tuberculosis symptoms from areas with a high tuberculosis and HIV burden. The role of CRP in tuberculosis triage requires further research. Clinical Trials Registration Clinicaltrials.gov identifier: NCT04666311.



Fig. 1. Inclusions and exclusions in the model development and external validation data set. (A), The model development data set consisted of laboratories: Amphia Hospital (n = 1559), Isala Hospital (n = 1321), Jeroen Bosch Hospital (n = 927), Maasstad Hospital (n = 894), Máxima Medical Center (n = 719), Meander Medical Center (n = 984), and Zuyderland Medical Center (n = 758); (B), Medlon (n = 3160) was reserved for external validation. After exclusions, the development data set included 5908 cases, and the external validation data set comprised 2656 cases. Color figure available at https://academic.oup.com/clinchem.
Fig. 2. Receiver operating characteristic plots of (A) the XGBoost (XGB) and (D) logistic regression (LR) model of the independent external validation set. Contributions of each individual parameter to the XGB model are SHapley Additive exPlanations (SHAP) values (B) and logistic regression coefficients (E) scaled and plotted as percentage contributions to the prediction. Plot with different probability thresholds based on prioritizing specificity or sensitivity for the (C) XGB and (F) logistic regression model. Abbreviations: MCV, mean corpuscular volume; RDWCV, red cell distribution width -coefficient of variation; MCH, mean corpuscular hemoglobin; Hb, hemoglobin; MCHC, mean corpuscular hemoglobin concentration; RBC, red blood cell count; Plt, platelet count. Color figure available at https:// academic.oup.com/clinchem.
Fig. 3. Receiver operating characteristic plots for of the XGBoost (XGB) model for the subclasses of the external independent validation set. (A), α 0 -heterozygote (n = 46), homozygote (including compound heterozygotes) (n = 68), and α + -heterozygote (n = 304); (B), β-thalassemia (n = 306); (C), Hb E (hemoglobin E), Hb C (hemoglobin C), Hb D (hemoglobin D) (n = 35), sickle cell anemia (n = 44), and Hb S (Hemoglobin S) heterozygote (n = 79); and (D), combinations (combinations between thalassemia, and concomitant structural hemoglobin variants) (n = 118). Color figure available at https://academic.oup.com/clinchem.
Fig. 4. Receiver operating characteristic plot of (A) the XGBoost (XGB) model and (D) logistic regression (LR) of the Spanish external validation data set differentiating thalassemia from IDA (iron deficiency anemia). Receiver operating characteristic plot of (B) the XGB model and (E) the logistic regression, differentiating α-thalassemia and β-thalassemia from IDA (n α-thalassemia = 429, n β-thalassemia = 941, n IDA = 1259). (C), XGB model predicted probabilities for IDA (median = 0.54, IQR: 0.36 to 0.75), α-thalassemia (median = 0.93, IQR: 0.85 to 0.96), and β-thalassemia (median = 0.98, IQR: 0.96 to 0.99); (F), Logistic regression predicted probabilities for IDA (median = 0.56, IQR: 0.45 to 0.65), α-thalassemia (median = 0.83, IQR: 0.74 to 0.88), and β-thalassemia (median = 0.94, IQR: 0.89 to 0.98). Color figure available at https://academic.oup.com/clinchem.
Fig. 5. Receiver operation characteristic plot of (A) the XGBoost (XGB) and (D) logistic regression (LR) models for distinguishing cases of the general population of Jeroen Bosch Hospital, considered negative, from known positive hemoglobinopathy cases. (B), XGB model predicted probabilities for positive cases (median = 0.85, IQR: 0.72 to 0.95) and negative cases (median = 0.10, IQR: 0.05 to 0.22); (E), LR model predicted probabilities for positive cases (median = 0.82, IQR: 0.64 to 0.92) and negative cases (median = 0.17, IQR: 0.10 to 0.27). The area under the precision-recall curve of (C) the XGB model and (F) the LR model. Color figure available at https://academic.oup.com/clinchem.
Machine Learning-Based Prediction of Hemoglobinopathies Using Complete Blood Count Data

July 2024

·

303 Reads

·

2 Citations

Clinical Chemistry

Background Hemoglobinopathies, the most common inherited blood disorder, are frequently underdiagnosed. Early identification of carriers is important for genetic counseling of couples at risk. The aim of this study was to develop and validate a novel machine learning model on a multicenter data set, covering a wide spectrum of hemoglobinopathies based on routine complete blood count (CBC) testing. Methods Hemoglobinopathy test results from 10 322 adults were extracted retrospectively from 8 Dutch laboratories. eXtreme Gradient Boosting (XGB) and logistic regression models were developed to differentiate negative from positive hemoglobinopathy cases, using 7 routine CBC parameters. External validation was conducted on a data set from an independent Dutch laboratory, with an additional external validation on a Spanish data set (n = 2629) specifically for differentiating thalassemia from iron deficiency anemia (IDA). Results The XGB and logistic regression models achieved an area under the receiver operating characteristic (AUROC) of 0.88 and 0.84, respectively, in distinguishing negative from positive hemoglobinopathy cases in the independent external validation set. Subclass analysis showed that the XGB model reached an AUROC of 0.97 for β-thalassemia, 0.98 for α0-thalassemia, 0.95 for homozygous α+-thalassemia, 0.78 for heterozygous α+-thalassemia, and 0.94 for the structural hemoglobin variants Hemoglobin C, Hemoglobin D, Hemoglobin E. Both models attained AUROCs of 0.95 in differentiating IDA from thalassemia. Conclusions Both the XGB and logistic regression model demonstrate high accuracy in predicting a broad range of hemoglobinopathies and are effective in differentiating hemoglobinopathies from IDA. Integration of these models into the laboratory information system facilitates automated hemoglobinopathy detection using routine CBC parameters.


Citations (64)


... A CT intensity-based FG algorithm and a deep learning-based version are developed [125]. Graph neural network was also employed for extracting airways from chest CT data [126,127]. A coarse-to-fine framework was proposed for addressing challenges in small airway branch segmentation [59,128,129]. ...

Reference:

Artificial intelligence in COPD CT images: identification, staging, and quantitation
Structure and position-aware graph neural network for airway labeling
  • Citing Article
  • August 2024

Medical Image Analysis

... To add the shadow artifact within a specific B-scan, we adhere to the procedure outlined by (de Vente et al., 2023). For any given B-scan, each constituent A-scan a x undergoes an individualized modification process. ...

Uncertainty-aware multiple-instance learning for reliable classification: Application to optical coherence tomography
  • Citing Article
  • June 2024

Medical Image Analysis

... Nevertheless, CBCT images show subtle changes in bone density and often mark fracture lines, making it difficult to identify fractures compared to CT. The evaluation of deep learning algorithms based on AI enables us to reach a precision of 97.8% and a sensitivity of 95.6% in detecting mandibular fractures based on CBCT scans [37]. In terms of open fractures of the mandible, the AS produced a sensitivity of 96% and NPV of 93.3%, which, in comparison to closed fractures cases (with a sensitivity of 88.9% and NPV of 57.1%, respectively), proved to be more useful in identifying the patients who did not have a fracture. ...

Detecting Mandible Fractures in CBCT Scans Using a 3-Stage Neural Network

Journal of Dental Research

... [2] Overview of deep learning techniques Deep learning frameworks Provided taxonomy of deep learning, emphasizing future applications in healthcare. [3] AI in prostate cancer detection Radiomics, ML Validated AI's diagnostic performance on MRI, proving noninferiority to radiologists. [4] AI in urological diagnostics and treatment Various AI tools Discussed AI's potential in personalized urology care and early diagnosis. ...

Artificial Intelligence and Radiologists in Prostate Cancer Detection on MRI (PI-CAI): An International, Paired, Non-inferiority, Confirmatory Study
  • Citing Article
  • June 2024

The Lancet Oncology

... Although the results shown in Table 3 of participants were computed on different testing dataset which is not public yet (see Fig. 2) unlike our proposed methods 3D-CNN and 3D-EffiBOT which are evaluated on public data set after stratified 5-fold sampling. As T3 approach [30] is implemented during this STOIC challenge, therefore the Qualification Leader-board scores were collected and computed after Last submission round of the challenge in qualification phase. This means that these results of all participants were reported after training on public dataset and testing on private dataset Test set A2 (see Fig. 2). ...

The STOIC2021 COVID-19 AI challenge: Applying reusable training methodologies to private data
  • Citing Article
  • June 2024

Medical Image Analysis

... Therefore, early identification of carpal instability is crucial to avoid deterioration of this condition. Nevertheless, signs of carpal instability are often unnoticed on conventional radiographs [86] . In response to this diagnostic challenge, Hendrix et al. developed an AI model to identify and assess signs of carpal instability on X-rays [87] . ...

Artificial intelligence for automated detection and measurements of carpal instability signs on conventional radiographs

European Radiology

... We used the publicly available NODE21 [68] dataset, which is sourced from several well-known publicly available datasets, including JSRT [65], PadChest [5], ChestX-ray14 [80], and Open-I [14]. The NODE21 dataset consists of as '1' or positive class, and non-nodules were labelled as '0' or negative class. ...

Nodule Detection and Generation on Chest X-Rays: NODE21 Challenge
  • Citing Article
  • March 2024

IEEE Transactions on Medical Imaging

... Moreover, the two distinct TME subtypes were significantly predictive of OS and PFS in the patients treated with first-line immune checkpoint inhibitors, going beyond TIL estimates and PD-L1 scores. While numerous deep learning studies have emerged for predicting ICI responses in NSCLC from H&E images, they are primarily focused on refining PD-L1 quantification [56][57][58] . In contrast to previous studies, our approach aims to offer a more comprehensive overview of the tumor microenvironment by predicting the TME cell type and molecular composition. ...

Comparing deep learning and pathologist quantification of cell-level PD-L1 expression in non-small cell lung cancer whole-slide images

... Az is bizonyított, hogy a leletezési diszkrepanciák gyakoribbak a nagyobb leletezendő vizsgálati számok és hosszabb műszakok esetén [9]. Bár a köznyelvben nagy figyelmet kap, ugyanakkor a mesterséges intelligencián alapuló megoldások alkalmazása az eddigi tapasztalatok szerint csak jelzetten csökkenti, sőt sok esetben növeli a radiológiai munkaterhelést [10][11][12], továbbá a betegellátásra gyakorolt hatásukról is kevés az adat. ...

The emperor has few clothes: a realistic appraisal of current AI in radiology
  • Citing Article
  • March 2024

European Radiology

... Training a neural network model for medical image segmentation requires a large amount of image data with corresponding semantic segmentation masks. A large, publicly available dataset was used to train the BRAU-Net++ network for vertebral and spinal canal segmentation from sagittal MR images of the lumbar spine [21]. To evaluate the statistical significance of the different VBQ determination methods, 166 image series from 83 studies were randomly selected from the database of the resident hospital. ...

Lumbar spine segmentation in MR images: a dataset and a public benchmark

Scientific Data