Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Evaluating ChatGPT’s Accuracy and Helpfulness in Gastroesophageal Reflux Disease Queries Compared to Traditional Search Engines: A New Era of Health Information Retrieval
0
Zitationen
4
Autoren
2025
Jahr
Abstract
Abstract The use of Artificial Intelligence (AI) is exponentially growing across medical specialties, offering numerous advantages over traditional search engines. This manuscript aims to fill this gap by evaluating the performance of the AI system ChatGPT in the context of Gastroesophageal Reflux Disease (GERD) related queries compared to traditional search engine responses. We compared the AI system ChatGPT 3.5 with the traditional search engine Google. Six standardized queries on GERD topics (definition, risk factors, diagnosis, symptoms, and management) were generated using Google’s auto-completions. Responses from ChatGPT 3.5 and Google were evaluated for accuracy and helpfulness using a five-point Likert scale by three independent gastroenterology experts. Statistical analysis was performed using Python. For accuracy, ChatGPT received average ratings of 5, 4.67, and 4.67, while traditional search engines scored 4.83, 4.33, and 3.33 ( p -values 0.363, 0.105, 0.001). For helpfulness, ChatGPT was rated 5, 4.67, and 4.83, compared to 4.33, 4, and 3.33 for traditional search engines ( p -values 0.012, 0.025, 0.004). Evaluator 3 found ChatGPT significantly more accurate, while evaluators 1 and 2 saw no significant difference. All three evaluators rated ChatGPT as significantly more helpful. Overall, ChatGPT received higher ratings for both accuracy and helpfulness. This study demonstrates that AI systems like ChatGPT 3.5 provide more accurate and useful information on GERD than traditional search engines like Google. AI can enhance patient education and decision-making, though further research is needed to confirm its reliability across other topics.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.239 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.095 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.463 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.776 Zit.
Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)
2018 · 5.428 Zit.