Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Development of a Synthetic Oncology Pathology Dataset for Large Language Model Evaluation in Medical Text Classification
0
Zitationen
11
Autoren
2025
Jahr
Abstract
BACKGROUND: Large Language Models (LLMs) offer promising applications in oncology pathology report classification, improving efficiency, accuracy, and automation. However, the use of real patient data is restricted due to legal and ethical concerns, necessitating privacy-compliant alternatives. OBJECTIVES: This study aimed to develop a synthetic oncology pathology dataset to serve as a benchmark for LLM evaluation, enabling reproducible and privacy-preserving AI research. METHODS: A total of 227 synthetic pathology reports were generated using Microsoft Copilot, ChatGPT Plus, and Perplexity Pro to ensure structural and linguistic diversity. The dataset included cases of prostate (n=75), lung (n=78), and breast (n=74) cancer, evenly distributed between malignant (n=113) and benign (n=114) findings. Reports were reviewed and classified by three independent cancer registrars using a consensus-based validation process. RESULTS & CONCLUSION: The dataset provides a structured, clinically relevant benchmark for evaluating LLM performance in pathology text classification. It enables AI model assessment without compromising data privacy, paving the way for scalable and ethical AI-driven oncology documentation.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.644 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.550 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 8.061 Zit.
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
2019 · 6.850 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.