OpenAlex · Aktualisierung stündlich · Letzte Aktualisierung: 01.04.2026, 10:54

Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.

Can ChatGPT Diagnose Hematological Diseases from Peripheral Blood Smear Images?

2025·0 Zitationen·Turkish Journal of HematologyOpen Access
Volltext beim Verlag öffnen

0

Zitationen

2

Autoren

2025

Jahr

Abstract

Recent advancements in artificial intelligence (AI), especially in natural language processing, have led to the growing interest in integrating large language models (LLMs) into clinical workflows.ChatGPT, a widely used LLM, has demonstrated potential in generating coherent responses across a wide array of domains.However, its applicability in medical image interpretation, particularly hematological image analysis, remains a matter of scientific inquiry.In this pilot study, we assessed ChatGPT's ability to analyze 42 peripheral blood smear (PBS) images representing common hematological conditions, including both malignant (n=34) and benign (n=8) cases.Images were selected from Anderson's Atlas of Hematology [1].For each image, ChatGPT (GPT-4, OpenAI, April 2024 version) was prompted three times with identical instructions: Identify if there are any pathological cells, specify the cell type (RBC, WBC, PLT), suggest a preliminary diagnosis, and recommend further confirmatory tests.ChatGPT consistently identified abnormal cells and correctly classified the cell type in 97.6% of cases.However, its diagnostic accuracy was limited: the correct diagnosis was achieved in 23.8% of cases overall, with better performance in acute leukemias (33.3%) and benign conditions (25%).Notably, ChatGPT failed to correctly diagnose any of the chronic leukemia cases.The model performed better in recommending appropriate confirmatory tests (e.g., bone marrow biopsy, flow cytometry) in 69% of cases.We acknowledge several methodological limitations: (i) the use of standardized textbook images may introduce bias if similar data were part of ChatGPT's pretraining set, (ii) real-world clinical smear images may differ significantly in resolution, staining quality, and morphological complexity, and (iii) ChatGPT's output can vary due to prompt randomness and temperature parameters if not fixed.Moreover, the version used (GPT-4) is now surpassed by GPT-4o (May 2024), which may exhibit different capabilities [2].Although our findings suggest that ChatGPT may assist in preliminary morphological recognition, its accuracy in disease diagnosis remains insufficient for clinical use.This is not unexpected, considering that hematological diagnoses, particularly for chronic leukemias, often rely on integrated morphological, immunophenotypic, cytogenetic, and molecular data [3,4].Diagnostic accuracy is further hindered in the presence of low-resolution or poorly illuminated images, where subtle cytological clues are difficult to discern limitations inherent to LLMs without dedicated vision processing [5,6].Several recent studies have evaluated LLMs in this domain.For example, Cai et al. demonstrated GPT-4o's capacity to identify abnormal cells with moderate accuracy in simulated blood morphology settings [5].Negrini et al. and Pighi et al. further explored ChatGPT's application in classifying urinary and hematological particles, with mixed results and significant variability [6,7].Our work aligns with these findings and underscores the need for controlled validation in real-world clinical datasets.To clarify further, our study focused primarily on morphological identification and did not incorporate molecular or immunophenotypic data into the diagnostic process.A detailed breakdown of evaluated cases has been added to the supplementary table, including AML-M1, AML-M2, AML-M4, reactive lymphocytosis, iron deficiency anemia, and inflammatory thrombocytosis.u n c o r r e c t e d p r o o fIn conclusion, while ChatGPT holds promise as an adjunctive tool for hematology education or initial triage, it cannot replace expert interpretation of peripheral blood smears.Future directions should focus on hybrid models that integrate image processing and contextual language understanding, alongside rigorous validation on institution-specific datasets.

Ähnliche Arbeiten

Autoren

Institutionen

Themen

Artificial Intelligence in Healthcare and EducationRadiomics and Machine Learning in Medical ImagingCOVID-19 diagnosis using AI
Volltext beim Verlag öffnen