Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Surgeon, Trainee, or <scp>GPT</scp> ? A Blinded Multicentric Study of <scp>AI</scp> ‐Augmented Operative Notes
5
Zitationen
15
Autoren
2025
Jahr
Abstract
OBJECTIVES: Clear, complete operative documentation is essential for surgical safety, continuity of care, and medico-legal standards. Large language models such as ChatGPT offer promise for automating clinical documentation; however, their performance in operative note generation, particularly in surgical subspecialties, remains underexplored. This study aimed to compare the quality, accuracy, and efficiency of operative notes authored by a surgical resident, attending surgeon, GPT alone, and an attending surgeon using GPT as a writing aid. METHODS: Five publicly available otolaryngologic procedures were selected. For each procedure, four operative notes were generated, one by a resident, one by an attending, one by GPT alone, and one by a hybrid of attending plus GPT. Ten blinded otolaryngologists (five residents, five attendings) independently reviewed all 20 notes. Reviewers scored each note across eight domains using a five-point scale, assigned a final approval rating, and provided qualitative feedback. Writing time was recorded to assess documentation efficiency. RESULTS: Hybrid notes written by an attending surgeon with GPT assistance received the highest average domain scores and the highest "as is" approval rate (79%), outperforming all other groups. GPT-only notes were the fastest to generate but had the lowest approval rate (23%) and the highest incidence of both omissions and overdocumentation. Writing time was significantly reduced in both AI-assisted groups compared to human-only authorship. Inter-rater reliability among reviewers was moderate to high across most domains. CONCLUSION: In this limited dataset, hybrid human-AI collaboration outperformed both human-only and AI-only authorship in operative documentation. These findings support GPT-assisted documentation to improve operative note efficiency and consistency. LEVEL OF EVIDENCE: N/A.
Ähnliche Arbeiten
The SCARE 2018 statement: Updating consensus Surgical CAse REport (SCARE) guidelines
2018 · 2.335 Zit.
The Levels of Evidence and Their Role in Evidence-Based Medicine
2011 · 2.115 Zit.
CARE guidelines for case reports: explanation and elaboration document
2017 · 2.020 Zit.
The SCARE Statement: Consensus-based surgical case report guidelines
2016 · 1.681 Zit.
Dermoscopy of pigmented skin lesions: Results of a consensus meeting via the Internet
2003 · 1.201 Zit.
Autoren
Institutionen
- Sheba Medical Center(IL)
- AOL (United States)(US)
- Ospedale San Paolo(IT)
- Royal Hallamshire Hospital(GB)
- Sheffield Teaching Hospitals NHS Foundation Trust(GB)
- Università degli Studi di Enna Kore(IT)
- University of Catania(IT)
- University of Antwerp(BE)
- Antwerp University Hospital(BE)
- Hospital General Universitario De Valencia(ES)
- Biogipuzkoa Health Research Institute(ES)
- Osakidetza(ES)
- Children's National(US)