Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Mapping and Quality Appraisal of Artificial Intelligence Preferential Reporting Checklists, Items, Guidelines, and Consensus in Healthcare: An Altmetric, Bibliometric, and Systematic Review
0
Zitationen
6
Autoren
2026
Jahr
Abstract
Introduction: The integration of artificial intelligence (AI) in healthcare has garnered significant scholarly attention, particularly in areas such as medical image analysis, prognosis, and treatment. Despite its potential, concerns regarding AI's reliability and application persist, prompting the development of guidelines aimed at standardizing its use in medicine. This study aims to evaluate the current content and quality of AI guidelines in healthcare, focusing on identifying gaps and providing a critical appraisal of existing checklists. Methodology: Comprehensive bibliometric analysis, Altmetric analysis, and systematic review were conducted, utilizing the AGREE II Tool for quality appraisal. The systematic search spanned Scopus, PubMed, and Dimension AI databases, focusing on English-language, open-access articles related to AI reporting guidelines. Two reviewers independently evaluated the data, with manual extraction performed in Microsoft Excel. The AGREE II Tool assessed six domains of guideline quality. Results: The search yielded 2477 articles, ultimately identifying 27 AI-specific reporting guidelines published between 2020 and 2025. The analysis revealed significant variations in quality across the AGREE II domains. Among the very first and most impactful were SPIRIT-AI, CONSORT-AI, MINIMAR, and CLAIM, all prioritized structured reporting but were hampered by their timing-based when there were few AI trials-resulting in limited applicability and risk of missing older AI terminologies. However, STAR-machine learning (ML), APPRAISE-AI, and CLEAR exhibited more general, domain-specific frameworks, whereas checklists such as CHEERS-AI, CREMLS, and MI-CLEAR-large language model (LLM) showcased limited author diversity in contribution. However, many guidelines exhibited weaknesses in methodological rigor and stakeholder involvement, limiting their practical applicability. Conclusion: The findings emphasize the need for evidence-based updates to AI reporting guidelines to ensure methodological integrity amid rapid advancements. Increased expert involvement and stakeholder engagement are crucial for enhancing the guidelines' applicability and rigor, addressing AI complexities in health research, and adapting reporting frameworks to evolving AI technologies in healthcare.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.561 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.452 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.948 Zit.
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
2019 · 6.797 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.