Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
A Framework for Evaluating AI-Powered Virtual Assistants to Support Older Adults’ Information-Seeking Needs
0
Zitationen
7
Autoren
2025
Jahr
Abstract
Abstract Older adults often face the challenge of searching for critical health, financial, and resource-related information to make complex decisions, a process further complicated by age-related cognitive changes that impact information processing and decision-making. Artificial intelligence (AI)-powered virtual assistants may help by providing concise, easy-to-understand information, yet their accuracy and effectiveness remain unclear. This presentation will introduce a general framework for evaluating AI’s potential to support important decisions of older adults and provide a case example illustrating this approach. To examine the accuracy and utility of AI-powered virtual assistants, we assessed the responses of Alexa, Google Assistant, Bard, and ChatGPT-4 to queries related to Medicare, long-term care insurance, and resource access. Findings showed that Large Language Model (LLM)-based assistants (Bard, ChatGPT-4) were more accurate than non-LLM systems, with Bard producing 6% inaccurate responses compared to Alexa’s 60%. They also provided more supplemental details, with Bard offering high levels of additional information in 79% of responses, compared to 37% for ChatGPT-4 and under 20% for others. However, response variability was observed over time. While LLM-powered virtual assistants may be useful tools for older adults seeking health and financial information, potential inaccuracies, response complexity, and variability must be considered. We will outline key challenges in conducting this research and implementing AI solutions, emphasizing the need for further refinement and user training to enhance reliability and usability for older users.
Ähnliche Arbeiten
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.776 Zit.
An Experiment in Linguistic Synthesis with a Fuzzy Logic Controller
1999 · 5.632 Zit.
An experiment in linguistic synthesis with a fuzzy logic controller
1975 · 5.552 Zit.
A FRAMEWORK FOR REPRESENTING KNOWLEDGE
1988 · 4.548 Zit.
Opinion Paper: “So what if ChatGPT wrote it?” Multidisciplinary perspectives on opportunities, challenges and implications of generative conversational AI for research, practice and policy
2023 · 3.317 Zit.