Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Towards the Reliability of Agentic Artificial Intelligence Models in Healthcare
0
Zitationen
3
Autoren
2026
Jahr
Abstract
SUMMARY & CONCLUSIONSRecent advancements in Artificial Intelligence (AI) have revealed promising opportunities across industries. Continuous learning systems such as agentic AI models are designed to work collaboratively, adapt decision-making strategies, and solve complex problems. In healthcare, current practice has shown that it is challenging to certify the reliability of systems that are powered by self-learning AI models. This paper presents a methodology for evaluating and optimizing the reliability of agentic models, based on a multi-step approach.The case study presented in this work involves agentic models comprising a Generative Pre-trained Transformer (GPT) model, agents, and tools. The model configurations took event data and users’ prompt as input. The prompts were decomposed and orchestrated to appropriate tools to generate general and specific recommendations as output through tool learning. The model configurations were evaluated against performance metrics such as task adherence, tool call accuracy and intent resolution. A reliability index was introduced, based on the metrics obtained.A key strength of the proposed approach is its extended reach to knowledge resources and tools, beyond the specialty of foundation models. The proposed reliability evaluation approach can be applied to situations where scalability is required to monitor and optimize AI systems.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.336 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.207 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.607 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.776 Zit.
Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)
2018 · 5.476 Zit.