OpenAlex · Aktualisierung stündlich · Letzte Aktualisierung: 13.03.2026, 02:47

Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.

Performance of ChatGPT-4o and Four Open-Source Large Language Models in Generating Diagnoses Based on China’s Rare Disease Catalog: Comparative Study

2025·6 Zitationen·Journal of Medical Internet ResearchOpen Access
Volltext beim Verlag öffnen

6

Zitationen

9

Autoren

2025

Jahr

Abstract

ChatGPT-4o demonstrated superior diagnostic performance for rare diseases. While Llama3.1:8b demonstrates viability for localized deployment in resource-constrained English diagnostic workflows, Chinese applications require larger models to achieve comparable diagnostic accuracy. This urgency is heightened by the release of open-source models like DeepSeek-R1, which may see rapid adoption without thorough validation. Successful clinical implementation of LLMs requires 3 core elements: model parameterization, user language, and pretraining data. The integration of RAG significantly enhanced open-source LLM accuracy for rare disease diagnosis, although caution remains warranted for low-parameter reasoning models showing substantial performance limitations. We recommend hospital IT departments and policymakers prioritize language relevance in model selection and consider integrating RAG with curated knowledge bases to enhance diagnostic utility in constrained settings, while exercising caution with low-parameter models.

Ähnliche Arbeiten

Autoren

Institutionen

Themen

Genomics and Rare DiseasesArtificial Intelligence in Healthcare and EducationTopic Modeling
Volltext beim Verlag öffnen