Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.

Vision-language models for automated video analysis and documentation in laparoscopic surgery: a proof-of-concept study

2025·6 Zitationen·International Journal of SurgeryOpen Access

Volltext beim Verlag öffnen

Zitationen

Autoren

2025

Jahr

Abstract

GPT-4o and Gemini-1.5-pro performed reliably in object detection and procedure classification but showed limitations in grading pathology and accurately describing procedural steps, which could be enhanced through in-context learning. This shows that domain-agnostic VLMs can be applied to surgical video analysis. In the future, VLMs with domain knowledge can be envisioned to enhance the operating room in the form of companions.

Autoren

Institutionen

Themen

Surgical Simulation and TrainingColorectal Cancer Screening and DetectionArtificial Intelligence in Healthcare and Education

Volltext beim Verlag öffnen

Vision-language models for automated video analysis and documentation in laparoscopic surgery: a proof-of-concept study

Abstract

Ähnliche Arbeiten

Autoren

Institutionen

Themen