Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Evaluation of ChatGPT‐4, Gemini, Claude, and Copilot in Generating Nursing Diagnoses Based on NANDA‐I Taxonomy II: A Comparative Cross‐Sectional Study
0
Zitationen
2
Autoren
2025
Jahr
Abstract
AIM: To evaluate the capability of large language models to generate nursing diagnoses based on NANDA-I Taxonomy II and assess their performance across domains and overall. BACKGROUND: Large language models are emerging tools in nursing, showing potential to aid in diagnosis generation and education. However, their accuracy and applicability in clinical and educational settings remain underexplored. METHODS: This cross-sectional comparative study used 10 realistic patient scenarios based on NANDA-I Taxonomy II, covering 12 domains. The study aimed to evaluate the capability of four models to generate nursing diagnoses based on patient scenarios. The responses were assessed by five nursing experts for accuracy and alignment with NANDA-I Taxonomy II in a single-blind evaluation process. RESULTS: All models demonstrated similar performance across different domains and overall, with Claude attaining the highest overall performance score. Expert evaluations indicated moderate interrater reliability. DISCUSSION: Small variations between models and occasional omissions suggest that expert review is still required before clinical use. CONCLUSIONS: Large language models are not yet sufficiently reliable for independent use in clinical settings and nursing education. Their application as supportive tools necessitates a cautious approach. Moreover, the development of specialized models designed to address the unique demands of the nursing field would be advantageous. IMPLICATIONS FOR NURSING: When large language models are used in nursing practice, their limitations should be considered, and the outputs they produce should be verified by nurses. IMPLICATIONS FOR NURSING POLICY: Ensuring the safe integration of artificial intelligence tools into nursing necessitates the establishment of robust regulatory policies to safeguard patient safety, the deployment of effective systems to monitor models' performance, and the development of comprehensive guidelines and training programs.
Ähnliche Arbeiten
Three Approaches to Qualitative Content Analysis
2005 · 43.392 Zit.
Qualitative content analysis in nursing research: concepts, procedures and measures to achieve trustworthiness
2003 · 20.574 Zit.
Nursing Research - Generating And Assessing Evidence For Nursing Practice
2016 · 8.521 Zit.
Nursing Research: Principles and Methods
1987 · 6.970 Zit.
Nursing Research Generating and Assessing Evidence for Nursing Practice
2013 · 5.594 Zit.