Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
DEMT: Diffusion-Enhanced Multi-Modal Transformer for Bone Fracture Detection
0
Zitationen
3
Autoren
2025
Jahr
Abstract
Early and accurate detection of bone fractures is essential for improving clinical outcomes and reducing diagnostic workload in radiology. This study proposes a Diffusion-Enhanced Multi-Modal Transformer (DEMT) designed to jointly analyze X-ray images and patient metadata for robust fracture classification. The model is evaluated using the publicly available Kaggle Human Bone Fractures Image Dataset, which includes multiple anatomical regions. A comprehensive preprocessing pipeline is implemented, incorporating conventional augmentation, diffusion-based augmentation for rare cases, and synthetic metadata generation to enhance data diversity. The dataset is stratified and partitioned into training (85%), validation (10%), and testing (5%) subsets. DEMT integrates a Vision Transformer backbone for image representation with a dedicated metadata encoder, fusing both modalities through a cross-attention mechanism. The framework further incorporates uncertainty quantification via specialized loss functions and produces calibrated confidence intervals to support clinical decision-making. The model is trained in PyTorch using the AdamW optimizer, cosine annealing learning rate scheduling, and early stopping. Experimental results show that DEMT outperforms baseline CNN and ResNet architectures, achieving higher classification accuracy and maintaining low calibration error (ECE ¡ 0.1). DEMT also demonstrates strong performance on rare fracture categories, supported by interpretable attention maps and a clinically oriented decision-support dashboard. These findings highlight the potential of DEMT as a reliable and interpretable tool for real-world radiological assessment.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.245 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.100 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.466 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.776 Zit.
Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)
2018 · 5.429 Zit.