Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Foundation models for X-ray interpretation: a narrative review of current techniques and future perspectives in diagnostic imaging
0
Zitationen
11
Autoren
2026
Jahr
Abstract
Background and Objective: The scarcity of high-quality annotated data is a primary bottleneck in developing artificial intelligence (AI) for chest X-ray (CXR) interpretation. Foundation models (FMs), trained on broad datasets via self-supervision, present a transformative solution. This narrative review analyzes the current state of vision and vision-language foundation models (VLFMs) specifically for CXR, evaluating their potential to bridge research and clinical practice through a novel analytical framework. Methods: Following PRISMA 2020 guidelines, we conducted a narrative review of literature from 2021–2025. We introduced and applied a novel four-pillar analytical taxonomy to structure the field: (I) core model architecture; (II) training framework; (III) clinical application adaptation; and (IV) quality assurance (QA). This framework guided the synthesis of evidence from over 150 studies, enabling a structured analysis of model designs, training paradigms, adaptation strategies, and evaluation metrics. Key Content and Findings: The four-pillar taxonomy provides a critical lens to deconstruct the FM development pipeline. Our analysis reveals distinct maturity levels across pillars: while architectural innovation and training strategies are advanced, rigorous QA and adaptation for real-world heterogeneity (e.g., portable vs. stationary imaging, device/site shifts) are underdeveloped. We identify domain-specific challenges, including bias propagation from limited public datasets, label noise in report-mined supervision, and a significant gap between retrospective benchmark performance and prospective clinical utility. Evidence-based recommendations are provided for model selection, efficient adaptation via parameter-efficient fine-tuning (PEFT), and the implementation of stratified evaluation. Conclusions: CXR-FMs offer a path toward more generalizable and data-efficient diagnostic AI. However, successful clinical translation is contingent upon coordinated advances across all four pillars of the proposed framework, moving beyond architectural scaling to prioritize data diversity, robustness auditing, and seamless workflow integration. This taxonomy provides researchers and clinicians with an actionable framework to develop and evaluate models whose impact will be determined by demonstrated fairness, reliability, and utility in diverse clinical environments.
Ähnliche Arbeiten
Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in Wuhan, China: a descriptive study
2020 · 22.631 Zit.
La certeza de lo impredecible: Cultura Educación y Sociedad en tiempos de COVID19
2020 · 19.284 Zit.
A Multi-Modal Distributed Real-Time IoT System for Urban Traffic Control (Invited Paper)
2024 · 14.276 Zit.
UNet++: A Nested U-Net Architecture for Medical Image Segmentation
2018 · 8.626 Zit.
Review of deep learning: concepts, CNN architectures, challenges, applications, future directions
2021 · 7.244 Zit.