Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
A comparative analysis of AI scribes versus human documentation in simulated general practice consultations
0
Zitationen
5
Autoren
2026
Jahr
Abstract
BACKGROUND AND OBJECTIVES: Artificial intelligence (AI) scribes are emerging as transformative tools in healthcare to automatically generate clinical documentation from patient-clinician encounters. The aim of this study was to compare documentation quality between AI scribes and human-generated notes in simulated general practice consultations. METHOD: This was a cross-sectional study using The Royal Australian College of General Practitioners' clinical exam cases with four professional patient actors, two experienced general practitioners (GPs) and three blinded GP raters. Documentation quality was assessed using a modified Physician Documentation Quality Instrument (PDQI-9). RESULTS: AI scribes demonstrated comparable or superior performance to human documentation using the modified PDQI-9, although the difference was not statistically significant (P = 0.071). Significant differences were found in the domains of accuracy (P = 0.022), thoroughness (P <0.001), succinctness (P <0.001) and freedom from hallucination (P = 0.025). DISCUSSION: Commercially available AI scribes can potentially produce clinical documentation of comparable or superior quality to human documentation in simulated settings, particularly regarding accuracy, thoroughness and succinctness. The finding that both AI and human documentation contain 'hallucinations' challenges the assumption that human‑generated documentation represents the gold standard of clinical documentation quality. Further research is needed to evaluate performance in real-world settings.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.697 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.602 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 8.127 Zit.
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
2019 · 6.872 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.