OpenAlex · Aktualisierung stündlich · Letzte Aktualisierung: 22.03.2026, 05:57

Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.

DispatchMAS: fusing taxonomy and artificial intelligence agents for emergency medical services

2026·0 Zitationen·BMC Emergency MedicineOpen Access
Volltext beim Verlag öffnen

0

Zitationen

20

Autoren

2026

Jahr

Abstract

Emergency medical dispatch is a critical, high-stakes process where dispatcher decisions directly impact patient outcomes. While standardized protocols exist, they are challenged by factors like caller distress, ambiguous symptom descriptions, and high cognitive load. The convergence of Large Language Models (LLMs) and Multi-Agent Systems (MAS) offers a novel opportunity to augment human dispatchers. This study aimed to develop and evaluate a taxonomy-grounded, LLM-powered multi-agent system for simulating realistic clinician’s medical dispatch scenarios. We first constructed a clinically curated taxonomy and fact commons for emergency dispatch, defining 32 Chief Complaints based on national standards, six distinct caller identities derived from real-world electronic health records (Medical Information Mart for Intensive Care III [MIMIC-III]), and a standardized six-phase call protocol. Using this framework, we developed a multi-agent simulation system featuring a Caller Agent and a Dispatcher Agent. The system, built on the AutoGen multi-agent framework for large language models (AutoGen), grounds agent interactions in the fact commons to ensure clinical plausibility and mitigate misinformation. We designed a hybrid evaluation combining expert clinical assessment, automated linguistic analysis, and operational performance dynamics auditing. Four physicians evaluated 100 simulated dispatch cases for “Guidance Efficacy” and “Dispatch Effectiveness” using a structured questionnaire. Automated metrics assessed sentiment, emotion, readability, and politeness of agent-generated dialogue. Operational performance dynamics analyses showed phase-dependent efficiency peaking during Assessment and faster pacing in life-critical events. Human evaluation, with substantial inter-rater agreement (Gwet’s AC1 $$ > \,\text{0.70}$$ , confirmed the system’s high performance. It demonstrated excellent Dispatch Effectiveness (e.g., 94% contacting the correct potential other agents) and Guidance Efficacy (advice provided in 91% of cases), both rated highly by physicians. Algorithmic metrics corroborated these findings, indicating a predominantly neutral affective profile (73.7% neutral sentiment; 90.4% neutral emotion), high readability (Flesch 80.9), and a consistently polite style (60.0% polite; 0% impolite). Operational performance evaluation further showed urgency-adaptive pacing: for life-critical events, information completeness rose faster and plateaued earlier while converging to comparable end-of-call completeness across complaint types. The agent also responded more rapidly in life-critical scenarios (1.8 s per dispatcher turn vs 2.1–2.4 s), indicating accelerated early questioning without loss of overall coverage. Our LLM-based MAS simulates diverse, clinically plausible dispatch scenarios with high fidelity. The resulting platform provides a controlled environment for analyzing dispatcher–caller interactions, stress-testing protocol variants, and deriving structured design patterns that may inform future real-time decision support. Our simulation-based tools could serve as an intermediate step between offline method development and eventual integration into emergency response workflows.

Ähnliche Arbeiten