Marcus R. Makowski’s research while affiliated with Charité Universitätsmedizin Berlin and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (409)


Leveraging Phase Information of 3D Isotropic Ultrashort Echo Time Pulmonary MRI for the Detection of Thoracic Lymphadenopathy: Toward an All-in-One Scan Solution
  • Article

November 2024

·

17 Reads

Investigative Radiology

·

·

·

[...]

·

Markus Graf

Background Ultrashort echo time (UTE) allows imaging of tissues with short relaxation times, but it comes with the expense of long scan times. Magnitude images of UTE magnetic resonance imaging (MRI) are widely used in pulmonary imaging due to excellent parenchymal signal, but have insufficient contrast for other anatomical regions of the thorax. Our work investigates the value of UTE phase images (UTE-Ps)—generated simultaneously from the acquired UTE signal used for the magnitude images—for the detection of thoracic lymph nodes based on water-fat contrast. It employs an advanced imaging sequence and reconstruction allowing isotropic 3D UTE MRI in clinically acceptable scan times. Methods In our prospective study, 42 patients with 136 lymph nodes had undergone venous computed tomography and pulmonary MRI scans with UTE within a 14-day interval. 3D isotropic UTE images were acquired using FLORET (fermat looped, orthogonally encoded trajectories). Background-corrected phase images (UTE-P) and magnitude images were reconstructed simultaneously from the UTE-Signal. Three radiologists performed a blinded reading in which all lymph nodes with a short-axis diameter (SAD) of at least 0.5 cm were detected. Detection rates and performance metrics of UTE-P for all lymph node regions and for pathologic (SAD ≥10 mm) and nonpathologic lymph nodes (SAD <10 mm) were calculated using computed tomography as a reference. The interreader agreement defined as the presence or absence of lymph nodes based on patient and region was calculated using Fleiss kappa (κ). Findings In the phase images, pathologic lymph nodes in the mediastinal and hilar region were detected with a high diagnostic confidence due to the achieved water-fat contrast (average sensitivity, specificity, positive predictive value, and negative predictive value of 95.83% [confidence interval (CI), 92.76%–98.91%], 100%, 100%, and 99.32% [CI, 98.08%–100%]). Stepwise inclusion of all lymph node regions and nonpathologic lymph nodes was associated with a moderate decrease resulting in an average sensitivity, specificity, positive predictive value, and negative predictive value of 77.9% (CI, 70.9%–84.7%), 99.4% (CI, 98.7%–99.9%), 97.0% (CI, 93.4%–99.7%), and 94.7% (CI, 92.8%–96.4%) for the inclusion of all lymph nodes sizes and regions. Interreader agreement was almost perfect (κ = 0.92). Conclusions Pathological lymph nodes in the mediastinal and hilar region can be detected in phase-images with high diagnostic confidence, thanks to the ability of the phase images to depict water-fat contrast in combination with high spatial 3D resolution, extending the clinical applicability of UTE into the simultaneous assessment of lung parenchyma and lymph nodes.


Extracellular Matrix–Specific Molecular MR Imaging Probes for the Assessment of Aortic Aneurysms

November 2024

·

3 Reads

All tissues in the human body are composed of cells that are embedded in the extracellular matrix. The extracellular matrix has, besides its structural role, several important functions. These functions include important regulatory mechanisms for signal transduction and matrix cell interactions. If pathological processes, e.g., in atherosclerosis or aortic aneurysms occur, the extracellular matrix changes in response. This includes alterations in the structural and functional components of the extracellular matrix. While traditional imaging technologies, such as X-ray or computed tomography (CT), are mainly aimed at imaging morphological changes, molecular magnetic resonance (MR) imaging is a technique that enables the visualization and quantification of pathological changes on a molecular scale. Different techniques can be used for molecular MR imaging. The most commonly employed techniques include the use of specific molecular magnetic resonance probes. These probes are, in most cases, either based on iron oxide particles or gadolinium chelates for signal generation. Aortic abdominal aneurysms represent an irreversible dilation of the aortic wall which could cause severe consequences, including wall rupture with a mortality rate >90%. Due to the absence of symptoms during the development of aortic aneurysms, early diagnosis remains challenging. In the following chapter, we will outline major developments regarding extracellular matrix–specific molecular magnetic resonance imaging for the assessment of aortic aneurysms.


Fig. 1 Flowchart of the Inclusion and exclusion criteria of the study cohort
Fig. 2 Spine segmentation and extraction of microstructural parameters. a-c Micro-CT-scan of an osteoporotic spine specimen of a 77-year-old female (bone mineral density = 65.75 mg/dL) with an illustration of the manually sampled segmentation masks (b). Derived from these masks, the trabeculae were automatically segmented (depicted as colorful dots and lines), and bone microstructural parameters were extracted using the Python library Scikitimage and BoneJ (c)
Fig. 4 Lateral conventional attenuation (a, b, e, f) and co-registered dark-field (c, d, g, h) images of two spine specimens. Vertical (a, c, e, g) and horizontal (b, d, f, h) scans of the spine specimen of a 77-year-old female with osteoporosis (BMD = 65.75 mg/dL) (a-d) and a non-osteoporotic spine specimen (e-h) of a 61-year-old female (BMD = 169.38 mg/dL). BMD, Bone mineral density
Fig. 5 Primary analysis of correlations between the dark-field signal and bone quality parameters. Correlations of the microstructural parameters, degree of anisotropy (DA) and hydroxyapatite (HA) density with the dark-field signal from vertical and horizontal position (a-e) and with the calculated ratio from vertical/horizontal dark-field signal (f, g, h, i, k)
Demographic characteristics of the study sample

+1

Dark-field radiography for the detection of bone microstructure changes in osteoporotic human lumbar spine specimens
  • Article
  • Full-text available

November 2024

·

40 Reads

European Radiology Experimental

Background Dark-field radiography imaging exploits the wave character of x-rays to measure small-angle scattering on material interfaces, providing structural information with low radiation exposure. We explored the potential of dark-field imaging of bone microstructure to improve the assessment of bone strength in osteoporosis. Methods We prospectively examined 14 osteoporotic/osteopenic and 21 non-osteoporotic/osteopenic human cadaveric vertebrae (L2–L4) with a clinical dark-field radiography system, micro-computed tomography (CT), and spectral CT. Dark-field images were obtained in both vertical and horizontal sample positions. Bone microstructural parameters (trabecular number, Tb.N; trabecular thickness, Tb.Th; bone volume fraction, BV/TV; degree of anisotropy, DA) were measured using standard ex vivo micro-CT, while hydroxyapatite density was measured using spectral CT. Correlations were assessed using Spearman rank correlation coefficients. Results The measured dark-field signal was lower in osteoporotic/osteopenic vertebrae (vertical position, 0.23 ± 0.05 versus 0.29 ± 0.04, p < 0.001; horizontal position, 0.28 ± 0.06 versus 0.34 ± 0.04, p = 0.003). The dark-field signal from the vertical position correlated significantly with Tb.N ( ρ = 0.46, p = 0.005), BV/TV ( ρ = 0.45, p = 0.007), DA ( ρ = -0.43, p = 0.010), and hydroxyapatite density ( ρ = 0.53, p = 0.010). The calculated ratio of vertical/horizontal dark-field signal correlated significantly with Tb.N ( ρ = 0.43, p = 0.011), BV/TV ( ρ = 0.36, p = 0.032), DA ( ρ = -0.51, p = 0.002), and hydroxyapatite density ( ρ = 0.42, p = 0.049). Conclusion Dark-field radiography is a feasible modality for drawing conclusions on bone microarchitecture in human cadaveric vertebral bone. Relevance statement Gaining knowledge of the microarchitecture of bone contributes crucially to predicting bone strength in osteoporosis. This novel radiographic approach based on dark-field x-rays provides insights into bone microstructure at a lower radiation exposure than that of CT modalities. Key Points Dark-field radiography can give information on bone microstructure with low radiation exposure. The dark-field signal correlated positively with bone microstructure parameters. Dark-field signal correlated negatively with the degree of anisotropy. Dark-field radiography helps to determine the directionality of trabecular loss. Graphical Abstract

Download


Fig. 2 Schematic illustration of the present and future role of LLMs in the domain of radiology report processing, categorized across four main areas: documentation, translation and summarization, clinical evaluation, and data mining.
Fig. 1 Workflow example for the integration of LLMs to structure radiology reports in clinical practice.
Large language models for structured reporting in radiology: past, present, and future

European Radiology

Structured reporting (SR) has long been a goal in radiology to standardize and improve the quality of radiology reports. Despite evidence that SR reduces errors, enhances comprehensiveness, and increases adherence to guidelines, its widespread adoption has been limited. Recently, large language models (LLMs) have emerged as a promising solution to automate and facilitate SR. Therefore, this narrative review aims to provide an overview of LLMs for SR in radiology and beyond. We found that the current literature on LLMs for SR is limited, comprising ten studies on the generative pre-trained transformer (GPT)-3.5 ( n = 5) and/or GPT-4 ( n = 8), while two studies additionally examined the performance of Perplexity and Bing Chat or IT5. All studies reported promising results and acknowledged the potential of LLMs for SR, with six out of ten studies demonstrating the feasibility of multilingual applications. Building upon these findings, we discuss limitations, regulatory challenges, and further applications of LLMs in radiology report processing, encompassing four main areas: documentation, translation and summarization, clinical evaluation, and data mining. In conclusion, this review underscores the transformative potential of LLMs to improve efficiency and accuracy in SR and radiology report processing. Key Points Question How can LLMs help make SR in radiology more ubiquitous ? Findings Current literature leveraging LLMs for SR is sparse but shows promising results, including the feasibility of multilingual applications . Clinical relevance LLMs have the potential to transform radiology report processing and enable the widespread adoption of SR. However, their future role in clinical practice depends on overcoming current limitations and regulatory challenges, including opaque algorithms and training data .


Figure 1. Violin plots comparing readability metrics across different large language models (LLMs). The metrics shown include Flesch Reading Ease, Flesch-Kincaid Grade Level, SMOG Index, and Dale-Chall Readability Score for models GPT-4, Gemini-Ultra, GPT-3.5-Turbo with few-shot prompting, Mistral-8×7b, GPT-3.5-Turbo, Claude-3-Opus, and Mistral-7b. The image displays four violin plots, each representing the distribution of readability scores for the seven investigated LLMs.
Figure 2. Comparative evaluation of language models on various qualitative metrics. Panels (a) to (e) illustrate the distribution of ratings across five key metrics: (a) Accuracy and consistency, (b) Clarity and organization, (c) Clinical relevance and actionability, (d) completeness, and (e) naturalness of language. Panel (f) presents a scatter plot showing coherence and content errors for each model. 5-point Likert scale rating included: 'very poor' to 'very good' for each language model across different metrics. Panel F features a scatter plot that maps coherence errors against content errors for the language models, providing a visual comparison of their performance in maintaining coherence and accuracy.
Figure 3. Comparison of error counts and percentages by large language model (LLM). Panel (a) illustrates the counts of coherence and content errors for each language model, while Panel (b) shows the corresponding error percentages (errors per number of sentences).
Figure 4. Comparison of trust-breaking errors and errors leading to posttherapy misconduct across different large language models (LLMs). The stacked bar charts illustrate the distribution of error ratings for LLMs across two categories of errors: Trust-breaking errors and posttherapy misconduct errors. Each bar represents a LLM, segmented by the count of reports assigned each error severity level from 1 to 5. For the trust-breaking errors the Likert scale was defined as follows: 1 = no trust-breaking errors; entirely accurate and reliable; 2 = minor trust-breaking errors; minor inaccuracies that may cause slight concern; 3 = moderate trust-breaking errors; noticeable errors that may moderately impact trust; 4 = significant trust-breaking errors; significant errors causing substantial concern; 5 = severe trust-breaking errors; severe inaccuracies causing serious loss of trust. For the posttherapy misconduct errors, the Likert scale was defined as follows: 1 = completely accurate with no risk of misconduct; 2 = minor errors unlikely to lead to significant misconduct; 3 = errors that could moderately affect posttherapy actions; 4 = errors causing major deviations from appropriate practices; 5 = severe errors highly likely to result in misconduct.
Original Investigation Large Language Models for Simplified Interventional Radiology Reports: A Comparative Analysis Acad Radiol, September 30, 2024

September 2024

·

73 Reads

Academic Radiology

Purpose: To quantitatively and qualitatively evaluate and compare the performance of leading large language models (LLMs), including proprietary models (GPT-4, GPT-3.5 Turbo, Claude-3-Opus, and Gemini Ultra) and open-source models (Mistral-7b and Mistral-8×7b), in simplifying 109 interventional radiology reports. Methods: Qualitative performance was assessed using a five-point Likert scale for accuracy, completeness, clarity, clinical relevance, naturalness, and error rates, including trust-breaking and post-therapy misconduct errors. Quantitative readability was assessed using Flesch Reading Ease (FRE), Flesch-Kincaid Grade Level (FKGL), SMOG Index, and Dale-Chall Readability Score (DCRS). Paired t-tests and Bonferroni-corrected p-values were used for statistical analysis. Results: Qualitative evaluation showed no significant differences between GPT-4 and Claude-3-Opus for any metrics evaluated (all Bonferroni-corrected p-values: p = 1), while they outperformed other assessed models across five qualitative metrics (p < 0.001). GPT-4 had the fewest content and trust-breaking errors, with Claude-3-Opus second. However, all models exhibited some level of trust-breaking and post-therapy misconduct errors, with GPT-4-Turbo and GPT-3.5-Turbo with few-shot prompting showing the lowest error rates, and Mistral-7B and Mistral-8×7B showing the highest. Quantitatively, GPT-4 surpassed Claude-3-Opus in all readability metrics (all p < 0.001), with a median FRE score of 69.01 (IQR: 64.88-73.14) versus 59.74 (IQR: 55.47-64.01) for Claude-3-Opus. GPT-4 also outperformed GPT-3.5-Turbo and Gemini Ultra (both p < 0.001). Inter-rater reliability was strong (κ = 0.77-0.84). Conclusions: GPT-4 and Claude-3-Opus demonstrated superior performance in generating simplified IR reports, but the presence of errors across all models, including trust-breaking errors, highlights the need for further refinement and validation before clinical implementation. Clinical relevance/applications: With the increasing complexity of interventional radiology (IR) procedures and the growing availability of electronic health records, simplifying IR reports is critical to improving patient understanding and clinical decision-making. This study provides insights into the performance of various LLMs in rewriting IR reports, which can help in selecting the most suitable model for clinical patient-centered applications.


Global cross-sectional student survey on AI in medical, dental, and veterinary education and practice at 192 faculties

September 2024

·

215 Reads

·

1 Citation

Background The successful integration of artificial intelligence (AI) in healthcare depends on the global perspectives of all stakeholders. This study aims to answer the research question: What are the attitudes of medical, dental, and veterinary students towards AI in education and practice, and what are the regional differences in these perceptions? Methods An anonymous online survey was developed based on a literature review and expert panel discussions. The survey assessed students' AI knowledge, attitudes towards AI in healthcare, current state of AI education, and preferences for AI teaching. It consisted of 16 multiple-choice items, eight demographic queries, and one free-field comment section. Medical, dental, and veterinary students from various countries were invited to participate via faculty newsletters and courses. The survey measured technological literacy, AI knowledge, current state of AI education, preferences for AI teaching, and attitudes towards AI in healthcare using Likert scales. Data were analyzed using descriptive statistics, Mann–Whitney U-test, Kruskal–Wallis test, and Dunn-Bonferroni post hoc test. Results The survey included 4313 medical, 205 dentistry, and 78 veterinary students from 192 faculties and 48 countries. Most participants were from Europe (51.1%), followed by North/South America (23.3%) and Asia (21.3%). Students reported positive attitudes towards AI in healthcare (median: 4, IQR: 3–4) and a desire for more AI teaching (median: 4, IQR: 4–5). However, they had limited AI knowledge (median: 2, IQR: 2–2), lack of AI courses (76.3%), and felt unprepared to use AI in their careers (median: 2, IQR: 1–3). Subgroup analyses revealed significant differences between the Global North and South ( r = 0.025 to 0.185, all P < .001) and across continents ( r = 0.301 to 0.531, all P < .001), with generally small effect sizes. Conclusions This large-scale international survey highlights medical, dental, and veterinary students' positive perceptions of AI in healthcare, their strong desire for AI education, and the current lack of AI teaching in medical curricula worldwide. The study identifies a need for integrating AI education into medical curricula, considering regional differences in perceptions and educational needs. Trial registration Not applicable (no clinical trial).


Study design. Two groups of male apolipoprotein E-knockout mice received a high-fat diet. The 01BSUR treatment group (n = 10) received a first subcutaneous injection of the anti-interleukin-1β antibody 01BSUR at the start of the high-fat diet and a second injection 4 weeks later. The control group (n = 10) received no therapy. MR imaging was performed 8 weeks after the start of the high-fat diet. Following the imaging session, animals were sacrificed, and arterial tissue was excised for ex vivo analysis. Created with BioRender.com.
Molecular MRI using an elastin-specific probe for visualisation of atherosclerotic plaque burden. Unenhanced T1-weighted MR images showed no relevant signal of the atherosclerotic plaque area. After the intravenous injection of the elastin-specific probe, a clear signal enhancement (with arrows) was observed in the vessel wall corresponding to the atherosclerotic plaque area on T1-weighted images. Ex vivo histological staining confirmed atherosclerotic plaque formation including the expression of elastic fibers within the plaque area. MRI magnetic resonance imaging; EvG Elastica van Gieson; HE hematoxylin–eosin staining; *indicates the arterial lumen; #indicates the plaque area; Scale bars represent 100 µm.
Characterization of the plaque-burden by molecular MRI and ex vivo histology in response to anti-inflammatory therapy. (A) A significant increase of contrast-to-noise-ratio was observed after administration of the elastin-specific probe 8 weeks after start of the high-fat diet in both groups (p < 0.05). The control group showed a significantly stronger enhancement compared to the 01BSUR group. (B, C) Mice treated two times with the anti-interleukin-1β antibody 01BSUR showed a significant lower relative plaque size (p < 0.05) as well as lower elastin fiber content within the atherosclerotic plaque area (p < 0.05) compared to mice of the control group. (D) In vivo MRI measurements of the contrast-to-noise area were in good correlation with ex vivo histological analyses (y = 3.25x + 20.80, R² = 0.77). All data are presented as mean values ± SD.
Ex vivo histological analyses of inflammatory markers. (A, B) Mice treated two times with 01BSUR showed a significant lower expression of CD68 + macrophages (p < 0.05) and interleukin-1β (p < 0.05) in atherosclerotic plaque area after 8 weeks of high-fat diet compared to mice of the control group. (C) Immunofluorescence analyses: Macrophages (CD68 +) are shown in green, interleukin-1β in red and cell nuclei (DAPI) in blue. The stacked image shows a co-localization of interleukin-1β with macrophages. (D) The amount of CD68 + macrophages and IL-1β-expression within the atherosclerotic plaque area were in good correlation (y = 1.22x + 0.35; R² = 0.87). *indicates the arterial lumen; #indicates the plaque area; Scale bars represent 100 µm.
Laser-ablation inductively coupled plasma mass spectrometry (LA-ICP-MS) for spatial localization of gadolinium in the atherosclerotic plaque area. (A) Unenhanced light microscopy image of an animal (n = 1) from the control group. (B) Corresponding image of the LA-ICP-MS visualizing the gadolinium-based elastin-specific probe within atherosclerotic plaque by the gadolinium signal. (C) Elastica van Gieson staining revealed a clear colocalization of gadolinium-accumulation with elastic fibers in the atherosclerotic plaque. *indicates the arterial lumen; #indicates the plaque area; Scale bars represent 100 µm.
Elastin-specific MR probe for visualization and evaluation of an interleukin-1β targeted therapy for atherosclerosis

September 2024

·

46 Reads

Atherosclerosis is a chronic inflammatory condition of the arteries and represents the primary cause of various cardiovascular diseases. Despite ongoing progress, finding effective anti-inflammatory therapeutic strategies for atherosclerosis remains a challenge. Here, we assessed the potential of molecular magnetic resonance imaging (MRI) to visualize the effects of 01BSUR, an anti-interleukin-1β monoclonal antibody, for treating atherosclerosis in a murine model. Male apolipoprotein E-deficient mice were divided into a therapy group (01BSUR, 2 × 0.3 mg/kg subcutaneously, n = 10) and control group (no treatment, n = 10) and received a high-fat diet for eight weeks. The plaque burden was assessed using an elastin-targeted gadolinium-based contrast probe (0.2 mmol/kg intravenously) on a 3 T MRI scanner. T1-weighted imaging showed a significantly lower contrast-to-noise (CNR) ratio in the 01BSUR group (pre: 3.93042664; post: 8.4007067) compared to the control group (pre: 3.70679168; post: 13.2982156) following administration of the elastin-specific MRI probe (p < 0.05). Histological examinations demonstrated a significant reduction in plaque size (p < 0.05) and a significant decrease in plaque elastin content (p < 0.05) in the treatment group compared to control animals. This study demonstrated that 01BSUR hinders the progression of atherosclerosis in a mouse model. Using an elastin-targeted MRI probe, we could quantify these therapeutic effects in MRI.


Fig. 1. Geographical distribution of participating institutions (blue dots) on a world map.
Legend: The size of the blue dots refers to the proportion of respondents per institution relative to the total number of respondents. Countries with at least one participating institution are highlighted in dark green. A, North America; B, Europe; C, Africa; D, South America; E, Asia; F, Australia.
Fig. 2. Gantt diagrams depicting the results for each item for the total study cohort. 
Legend: Colors represent the response options indicated below each Gantt chart, including the corresponding numerator and denominator.
Absolute survey results and regional breakdowns for each item.
Multinational attitudes towards AI in healthcare and diagnostics among hospital patients

September 2024

·

177 Reads

The successful implementation of artificial intelligence (AI) in healthcare is dependent upon the acceptance of this technology by key stakeholders, particularly patients, who are the primary beneficiaries of AI-driven outcomes. This international, multicenter, cross-sectional study assessed the attitudes of hospital patients towards AI in healthcare across 43 countries. A total of 13806 patients at 74 hospitals were surveyed between February and November 2023, with 64.8% from the Global North and 35.2% from the Global South. The findings indicate a predominantly favorable general view of AI in healthcare, with 57.6% of respondents expressing a positive attitude. However, attitudes exhibited notable variation based on demographic characteristics, health status, and technological literacy. Female respondents and those with poorer health status exhibited fewer positive attitudes towards AI use in medicine. Conversely, higher levels of AI knowledge and frequent use of technology devices were associated with more positive attitudes. It is noteworthy that less than half of the participants expressed positive attitudes regarding all items pertaining to trust in AI. The lowest level of trust was observed for the accuracy of AI in providing information regarding treatment responses. Patients exhibited a strong preference for explainable AI and physician-led decision-making, even if it meant slightly compromised accuracy. This large-scale, multinational study provides a comprehensive perspective on patient attitudes towards AI in healthcare across six continents. Findings suggest a need for tailored AI implementation strategies that consider patient demographics, health status, and preferences for explainable AI and physician oversight. All study data has been made publicly available to encourage replication and further investigation.



Citations (57)


... weterynarii i zootechników) pracujących w ośrodkach amerykańskich czy w Europe Zachodniej. Należy podkreślić, iż e-zdrowie i telemedycyna weszły już na stałe do programów medycyny ludzkiej oraz pomocniczych zawodów medycznych w Polsce 59 już wiele lat temu 60 . W klasycznym curriculum weterynaryjnym w Polsce brakuje przedmiotów kompleksowych typu cyfrowa medycyna weterynaryjna (autor współprowadzi pod wodzą prof. ...

Reference:

Możliwości wykorzystywania sztucznej inteligencji (AI) w kontroli stanu zdrowotnego bydła i świń w Polsce oraz regionie
Global cross-sectional student survey on AI in medical, dental, and veterinary education and practice at 192 faculties

... This discrepancy likely stems from the prioritization of domain-specific knowledge during finetuning. Contrary to expectations, fine-tuning LLMs on biomedical data does not consistently lead to improved performance and may even result in reduced effectiveness on unseen medical tasks 41 . Additionally, further fine-tuning can cause the model to forget previous safety alignments 42,43 , and in some cases, these models may lack the safety alignments that are present in general models 30 . ...

Biomedical Large Languages Models Seem not to be Superior to Generalist Models on Unseen Medical Data

... For instance, a recent study demonstrated the effectiveness of AI in recognizing breast cancer from fine-needle aspirated tissue samples by utilizing nuclear features, which significantly improves early detection and diagnosis [131]. Similarly, AI-aided imaging systems were demonstrated to enhance pulmonary nodule detection, which is crucial for early-stage lung cancer diagnosis [132]. The integration of AI into the identification of treatment-resistant cancer cells has been used with machine learning to unveil distinct RNA methylation regulators in pan-cancer neoadjuvant therapy, potentially leading to more effective personalized treatment plans [133]. ...

Speed and efficiency: evaluating pulmonary nodule detection with AI-enhanced 3D gradient echo imaging

European Radiology

... LLM-based frameworks enable the creation of healthcare agents that integrate data sources, knowledge bases, and analytical models into their LLM-driven solutions (Abbasian et al., 2024). Open-source models like LLaMA (Touvron et al., 2023) have demonstrated performance comparable to proprietary models, particularly in radiology examination questions (Adams et al., 2024). LLMs show potential in generating accurate, concise, and comprehensive responses to patient queries with minimal risk of harm (Yalamanchili et al., 2024). ...

Llama 3 Challenges Proprietary State-of-the-Art Large Language Models in Radiology Board-style Examination Questions
  • Citing Article
  • August 2024

Radiology

... While studies have shown promise for LLMs in generating differential diagnoses, Hager et al. caution against relying on LLM-based diagnostics for abdominal pathology. They found that both generalist and specialist LLMs performed significantly worse than radiologists, particularly for conditions such as cholecystitis (84% accuracy for radiologists vs. 13-62% for LLMs) and diverticulitis (86% for radiologists vs. 36-59% for LLMs) [80]. ...

Evaluation and mitigation of the limitations of large language models in clinical decision-making

Nature Medicine

... In der PROBASE-Studie hatten in der ersten Screeningrunde bei 45-Jährigen von 186 PSA-Screeningtest-auffälligen Probanden 120 eine Biopsie und letztlich 48 der Biopsierten (40 %) ein Prostatakarzinom [18]. Noch wichtiger erscheint aber in diesem Zusammenhang die mögliche Identifikation einer Gruppe niedrigen Risikos, bei der keine höherfrequenten Screeninguntersuchungen stattfinden müssen [19]. Beide Parameter variieren je nach Qualität des Screeningtests und der Folgeuntersuchungen und stellen damit eine der Schlüssel-"Stellschrauben" für die Effektivität und die Kosten des Screeningprogramms dar. ...

Risk-adjusted Screening for Prostate Cancer-Defining the Low-risk Group by Data from the PROBASE Trial
  • Citing Article
  • May 2024

European Urology

... In the research article by McTavish et al., the effects of respiratory motion on signal loss in prostate DWI using echoplanar imaging (EPI) is evaluated [15]. At the same time, the potential advantages and trade-offs of partial Fourier imaging for single-shot and multi-shot EPI-based prostate DWI are characterized. ...

Partial Fourier in the presence of respiratory motion in prostate diffusion-weighted echo planar imaging

MAGMA Magnetic Resonance Materials in Physics Biology and Medicine

... Stiffer tumor tissues enable waves to propagate faster hence emphasizing a mechanical contrast with the surrounding softer tissues. Using MR elastography Kader et al. [97] analyzed tissue samples from two prostate cancer xenograft mouse models, PC3 and LNCaP, and found increased prostate tumor stiffness and decreased water mobility in the former compared to the latter and the increase was associated with higher collagen and elastin levels. ...

Sensitivity of magnetic resonance elastography to extracellular matrix and cell motility in human prostate cancer cell line-derived xenograft models
  • Citing Article
  • April 2024

Biomaterials Advances

... In our study, we compared the performance of the deep learning model with that of experienced nuclear medicine physicians. Previous studies have compared the deep learning model to radiologist or nuclear medicine physicians in several circumstances including identifying metastatic bone foci on whole-body bone scan (Ibrahim et al. 2023;Wang et al. 2024;Liu et al. 2022), differentiating malignant to benign bone fracture on multidetector CT (Foreman et al. 2024), that demonstrated the diagnostic performance of deep learning model is superior to physicians with lower experience with a shorter best-performing deep learning model in external test set (AUC: 0.75 vs. 0.77 vs. 0.84). ...

Deep Learning to Differentiate Benign and Malignant Vertebral Fractures at Multidetector CT
  • Citing Article
  • March 2024

Radiology

... Studies found that while DP methods could reduce models' privacy risks, they suffered from challenging hyperparameter tuning, significant model utility drop, and extremely long training time 42,43 . Recent studies on the use of DP in medical image analysis indicate that the utility decrease is more significant on smaller training datasets and more complicated tasks 44,45 . These findings are consistent with our experimental results that DP-trained models exhibit notably lower utility compared to those trained with DiffGuard, given that mediastinal neoplasm diagnosis has limited training data and features complicated segmentation and subtype classification tasks. ...

Preserving fairness and diagnostic accuracy in private large-scale AI models for medical imaging

Communications Medicine