Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Retrieval-Augmented Large Language Models for a Chronic Kidney Disease Patient Education Chatbot
0
Zitationen
3
Autoren
2025
Jahr
Abstract
Chronic Kidney Disease (CKD) is rising globally, underscoring the need for scalable and accurate patient education. This paper presents the design and evaluation of a CKD education chatbot specifically for hemodialysis cases in Indonesia that runs on-premises and employs a Retrieval-Augmented Generation (RAG) framework. We conducted a comparative study of a general-purpose LLM (Llama 3.1) and a language adapted LLM (Sahabat AI). Both models were used in identical RAG pipelines and paired with several retrieval embeddings. Using 70 unique questions evaluated across 10 runs per configuration (n = 700), we assessed semantic similarity (METEOR), factual correctness, and RAG-specific metrics (faithfulness, answer relevancy, context precision) with a Gemini 2.0 Flash evaluator. RAG consistently improved performance over non-RAG baselines, with statistically significant gains across metrics. Sahabat AI outperformed Llama 3.1, posting the highest METEOR (0.4116, mContriever) and correctness (4.0293 with BGE-M3). It also exhibits higher, more stable faithfulness (∼0.95), indicating closer adherence to the source evidence. These findings suggest that, for healthcare education, localized LLMs combined with carefully chosen embeddings can yield more effective and trustworthy systems.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.687 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.591 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 8.114 Zit.
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
2019 · 6.867 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.