Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Vision-language models for automated video analysis and documentation in laparoscopic surgery: a proof-of-concept study
6
Zitationen
13
Autoren
2025
Jahr
Abstract
GPT-4o and Gemini-1.5-pro performed reliably in object detection and procedure classification but showed limitations in grading pathology and accurately describing procedural steps, which could be enhanced through in-context learning. This shows that domain-agnostic VLMs can be applied to surgical video analysis. In the future, VLMs with domain knowledge can be envisioned to enhance the operating room in the form of companions.
Ähnliche Arbeiten
The SCARE 2020 Guideline: Updating Consensus Surgical CAse REport (SCARE) Guidelines
2020 · 5.579 Zit.
Virtual Reality Training Improves Operating Room Performance
2002 · 2.805 Zit.
An estimation of the global volume of surgery: a modelling strategy based on available data
2008 · 2.513 Zit.
Objective structured assessment of technical skill (OSATS) for surgical residents
1997 · 2.260 Zit.
Does Simulation-Based Medical Education With Deliberate Practice Yield Better Results Than Traditional Clinical Education? A Meta-Analytic Comparative Review of the Evidence
2011 · 1.742 Zit.
Autoren
Institutionen
- Fresenius (Germany)(DE)
- Purdue University West Lafayette(US)
- University Hospital Carl Gustav Carus(DE)
- Indiana University Indianapolis(US)
- Indiana University – Purdue University Indianapolis(US)
- German Cancer Research Center(DE)
- Heidelberg University(DE)
- Helmholtz-Zentrum Dresden-Rossendorf(DE)
- Cicor (Germany)(DE)
- University Hospital Heidelberg(DE)
- National Center for Tumor Diseases(DE)