Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
First, do NOHARM: towards clinically safe large language models
1
Zitationen
51
Autoren
2025
Jahr
Abstract
Large language models (LLMs) are routinely used by physicians and patients for medical advice, yet their clinical safety profiles remain poorly characterized. We present NOHARM (Numerous Options Harm Assessment for Risk in Medicine), a benchmark using 100 real primary care-to-specialist consultation cases to measure frequency and severity of harm from LLM-generated medical recommendations. NOHARM covers 10 specialties, with 12,747 expert annotations for 4,249 clinical management options. Across 31 LLMs, potential for severe harm from LLM recommendations occurs in up to 22.2% (95% CI 21.6-22.8%) of cases, with harm of omission accounting for 76.6% (95% CI 76.4-76.8%) of errors. Safety performance is only moderately correlated (r = 0.61-0.64) with existing AI and medical knowledge benchmarks. The best models outperform generalist physicians on safety (mean difference 9.7%, 95% CI 7.0-12.5%), and a diverse multi-agent approach improves safety compared to solo models (mean difference 8.0%, 95% CI 4.0-12.1%). Therefore, despite strong performance on existing evaluations, widely used AI models can produce severely harmful medical advice at nontrivial rates, underscoring clinical safety as a distinct performance dimension necessitating explicit measurement.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.200 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.051 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.416 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.776 Zit.
Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)
2018 · 5.410 Zit.
Autoren
- David Wu
- Fateme Nateghi Haredasht
- Saloni Kumar Maharaj
- Prachi Jain
- Jessica Tran
- Matthew Gwiazdon
- Arjun Rustagi
- Jenelle Jindal
- Jacob M. Koshy
- Vinay B. Kadiyala
- Anup Agarwal
- Bassman Tappuni
- Benjamin French
- Sirus Jesudasen
- Christopher V. Cosgriff
- Rebanta Chakraborty
- Jillian Caldwell
- Susan Ziolkowski
- David J. Iberri
- Robert Diep
- Rahul S. Dalal
- Kira L. Newman
- Kristin Galetta
- J. Carl Pallais
- Nancy Wei
- Kathleen M. Buchheit
- David I. Hong
- Ernest Y. Lee
- Allen Shih
- Vartan Pahalyants
- Tamara B. Kaplan
- Vishnu Ravi
- Sarita Khemani
- April S. Liang
- Daniel Shirvani
- Advait Patil
- N. J. Marshall
- K.K. Chopra
- Joel Koh
- Adi Badhwar
- Liam G. McCoy
- David Wu
- Yingjie Weng
- Sumant Ranji
- Kevin A. Schulman
- Nigam H. Shah
- Jason Hom
- Arnold Milstein
- Adam Rodman
- Jonathan H. Chen
- Ethan Goh