Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Iterative Multidisciplinary Development and Evaluation a Patient-Facing SDoH Chatbot Using Synthetic Data Simulation (Preprint)
0
Zitationen
10
Autoren
2025
Jahr
Abstract
<sec> <title>BACKGROUND</title> Systematic collection of social determinants of health (SDoH) data remains inconsistent across healthcare settings, despite its critical impact on patient outcomes. Large language model (LLM)-powered chatbots offer promise for scalable SDoH data collection, but rigorous, feasible evaluation methods for patient-facing applications are lacking. </sec> <sec> <title>OBJECTIVE</title> To describe an efficient, iterative, multidisciplinary approach for developing and evaluating a patient-facing SDoH chatbot using synthetic data and case simulation, with the goal of optimizing both chatbot performance and the evaluation rubric prior to clinical deployment. </sec> <sec> <title>METHODS</title> A 10-criterion evaluation rubric was adapted from established healthcare artificial intelligence (AI) frameworks and applied to 27 synthetic clinical scenarios representing diverse SDoH profiles. Scenarios were role-played by a licensed clinical social worker, and chatbot-patient interactions were independently rated by three multidisciplinary experts (social worker, nurse practitioner, physician). Quantitative analysis used descriptive statistics and percent agreement to characterize chatbot performance and rater consensus, with percent agreement selected due to high prevalence of ceiling effects in several domains. Qualitative analysis synthesized rater feedback to guide iterative refinement of both chatbot prompts and rubric domains. </sec> <sec> <title>RESULTS</title> The chatbot demonstrated robust performance in domains such as accurate interpretation (mean = 0.98, SD = 0.09), communication quality and cultural sensitivity (mean = 0.99, SD = 0.06), and adaptive questioning (mean = 0.99, SD = 0.06), with near-perfect rater agreement. Lower scores and greater variability were observed in completeness of data collection, systematic domain exploration, and safety, prompting targeted adaptations. Qualitative feedback highlighted the importance of distinguishing screening from clinical interviewing capabilities and informed the refinement of the rubric, including clarifying the definition of safety to focus on recognition of physical and mental health emergencies. </sec> <sec> <title>CONCLUSIONS</title> This study provides a practical, replicable blueprint for pre-deployment evaluation of patient-facing SDoH chatbots, balancing rigor with feasibility. The iterative, multidisciplinary approach enabled rapid identification and remediation of performance gaps, supporting responsible integration of AI into SDoH data collection. Explicit performance thresholds and rubric refinement are essential for protecting patient trust and safety, particularly in vulnerable populations. Future work will validate findings with real patient interactions and expand stakeholder involvement. </sec> <sec> <title>CLINICALTRIAL</title> N/A </sec>
Ähnliche Arbeiten
Amazon's Mechanical Turk
2011 · 10.029 Zit.
The Transtheoretical Model of Health Behavior Change
1997 · 7.687 Zit.
COVID-19 and mental health: A review of the existing literature
2020 · 3.707 Zit.
Cognitive Therapy and the Emotional Disorders
1977 · 2.931 Zit.
Mental health problems and social media exposure during COVID-19 outbreak
2020 · 2.790 Zit.