Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Current validation practice undermines surgical AI development
1
Zitationen
97
Autoren
2025
Jahr
Abstract
Surgical data science (SDS) is rapidly advancing, yet clinical adoption of artificial intelligence (AI) in surgery remains severely limited, with inadequate validation emerging as a key obstacle. In fact, existing validation practices often neglect the temporal and hierarchical structure of intraoperative videos, producing misleading, unstable, or clinically irrelevant results. In a pioneering, consensus-driven effort, we introduce the first comprehensive catalog of validation pitfalls in AI-based surgical video analysis that was derived from a multi-stage Delphi process with 91 international experts. The collected pitfalls span three categories: (1) data (e.g., incomplete annotation, spurious correlations), (2) metric selection and configuration (e.g., neglect of temporal stability, mismatch with clinical needs), and (3) aggregation and reporting (e.g., clinically uninformative aggregation, failure to account for frame dependencies in hierarchical data structures). A systematic review of surgical AI papers reveals that these pitfalls are widespread in current practice, with the majority of studies failing to account for temporal dynamics or hierarchical data structure, or relying on clinically uninformative metrics. Experiments on real surgical video datasets provide the first empirical evidence that ignoring temporal and hierarchical data structures can lead to drastic understatement of uncertainty, obscure critical failure modes, and even alter algorithm rankings. This work establishes a framework for the rigorous validation of surgical video analysis algorithms, providing a foundation for safe clinical translation, benchmarking, regulatory review, and future reporting standards in the field.
Ähnliche Arbeiten
The SCARE 2020 Guideline: Updating Consensus Surgical CAse REport (SCARE) Guidelines
2020 · 5.571 Zit.
Virtual Reality Training Improves Operating Room Performance
2002 · 2.782 Zit.
An estimation of the global volume of surgery: a modelling strategy based on available data
2008 · 2.503 Zit.
Objective structured assessment of technical skill (OSATS) for surgical residents
1997 · 2.256 Zit.
Does Simulation-Based Medical Education With Deliberate Practice Yield Better Results Than Traditional Clinical Education? A Meta-Analytic Comparative Review of the Evidence
2011 · 1.701 Zit.
Autoren
- Annika Reinke
- Z. Li
- Minu D. Tizabi
- Paulo André
- Marcel Knopp
- M. Rother
- Inês Machado
- Maria S. Altieri
- Deepak Alapatt
- Sophia Bano
- Sebastian Bodenstedt
- Oliver Burgert
- Elvis C. S. Chen
- Justin Collins
- Olivier Colliot
- Evangelia Christodoulou
- Tobias Czempiel
- Adrito Das
- Reuben Docea
- Daniel A. Donoho
- Qi Dou
- Jennifer A. Eckhoff
- Sandy Engelhardt
- Gábor Fichtinger
- Philipp Fürnstahl
- Pablo García Kilroy
- Stamatia Giannarou
- Stephen Gilbert
- Ines Gockel
- Patrick Godau
- Jan Gödeke
- Teodor Grantcharov
- Tamás Haidegger
- Alexander Hann
- Makoto Hashizume
- Charles Heitz
- Rebecca Hisey
- Hanna Hoffmann
- Arnaud Huaulmé
- Paul F. Jäger
- Pierre Jannin
- Anthony Jarc
- R. Jena
- Yueming Jin
- Leo Joskowicz
- Luc Joyeux
- Max Kirchner
- Axel Krieger
- Gernot Kronreif
- Kyle Lam
- Stefan Laufer
- Joël L. Lavanchy
- G Lee
- Robert H. Lim
- Peng Liu
- Hani J. Marcus
- Pietro Mascagni
- Ozanan R. Meireles
- Beat P. Mueller
- Lars Mündermann
- Hirenkumar Nakawala
- Nassir Navab
- Abdourahmane Ndong
- Juliane Neumann
- Felix Nickel
- Marco Nolden
- Chinedu Innocent Nwoye
- Noeun Oh
- Nicolas Padoy
- Thomas Pausch
- Micha Pfeiffer
- Tim Rädsch
- Hongliang Ren
- Nicola Rieke
- Dominik Rivoir
- Duygu Sarikaya
- Samuel Schmidgall
- Matthias Seibold
- Silvia Seidlitz
- Lalith Sharan
- Jeffrey H. Siewerdsen
- Vinkle Srivastav
- Raphael Sznitman
- Russell Taylor
- Thuy Nuong Tran
- Mathias Unberath
- Fons van der Sommen
- Martin Wagner
- Amine Yamlahi
- S. Kevin Zhou
- Aneeq Zia
- Amin Madani
- Danail Stoyanov
- Stefanie Speidel
- D. Hashimoto
- Fiona R. Kolbinger
- Lena Maier‐Hein