Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Towards physician-centered oversight of conversational diagnostic AI
0
Zitationen
35
Autoren
2025
Jahr
Abstract
Recent work has demonstrated the promise of conversational AI systems for diagnostic dialogue. However, real-world assurance of patient safety means that providing individual diagnoses and treatment plans is considered a regulated activity by licensed professionals. Furthermore, physicians commonly oversee other team members in such activities, including nurse practitioners (NPs) or physician assistants/associates (PAs). Inspired by this, we propose a framework for effective, asynchronous oversight of the Articulate Medical Intelligence Explorer (AMIE) AI system. We propose guardrailed-AMIE (g-AMIE), a multi-agent system that performs history taking within guardrails, abstaining from individualized medical advice. Afterwards, g-AMIE conveys assessments to an overseeing primary care physician (PCP) in a clinician cockpit interface. The PCP provides oversight and retains accountability of the clinical decision. This effectively decouples oversight from intake and can thus happen asynchronously. In a randomized, blinded virtual Objective Structured Clinical Examination (OSCE) of text consultations with asynchronous oversight, we compared g-AMIE to NPs/PAs or a group of PCPs under the same guardrails. Across 60 scenarios, g-AMIE outperformed both groups in performing high-quality intake, summarizing cases, and proposing diagnoses and management plans for the overseeing PCP to review. This resulted in higher quality composite decisions. PCP oversight of g-AMIE was also more time-efficient than standalone PCP consultations in prior work. While our study does not replicate existing clinical practices and likely underestimates clinicians' capabilities, our results demonstrate the promise of asynchronous oversight as a feasible paradigm for diagnostic AI systems to operate under expert human oversight for enhancing real-world care.
Ähnliche Arbeiten
The Strengths and Difficulties Questionnaire: A Research Note
1997 · 14.516 Zit.
Making sense of Cronbach's alpha
2011 · 13.646 Zit.
QUADAS-2: A Revised Tool for the Quality Assessment of Diagnostic Accuracy Studies
2011 · 13.521 Zit.
A method for estimating the probability of adverse drug reactions
1981 · 11.446 Zit.
Evidence-Based Medicine
1992 · 4.133 Zit.
Autoren
- Elahe Vedadi
- David Barrett
- Natalie Harris
- Ellery Wulczyn
- S. G. Eswara Reddy
- Roma Ruparel
- Mike Schaekermann
- Tim Strother
- Ryutaro Tanno
- Yash Sharma
- Jin Young Lee
- Cían Hughes
- Dylan Slack
- Anil Palepu
- Jan Freyberg
- Khaled Saab
- Valentin Liévin
- Wei‐Hung Weng
- Tao Tu
- Yun Liu
- Nenad Tomašev
- Kavita Kulkarni
- S. Sara Mahdavi
- Kelvin Guu
- Joëlle Barral
- Dale R. Webster
- James Manyika
- Avinatan Hassidim
- Katherine Chou
- Yossi Matias
- Pushmeet Kohli
- Adam Rodman
- Vivek Natarajan
- Alan Karthikesalingam
- David Stutz