Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Llama 3.1 405B Is Comparable to GPT-4 for Extraction of Data from Thrombectomy Reports—A Step Towards Secure Data Extraction
4
Zitationen
10
Autoren
2025
Jahr
Abstract
PURPOSE: GPT‑4 has been shown to correctly extract procedural details from free-text reports on mechanical thrombectomy. However, GPT may not be suitable for analyzing reports containing personal data. The purpose of this study was to evaluate the ability of the large language models (LLM) Llama3.1 405B, Llama3 70B, Llama3 8B, and Mixtral 8X7B, that can be operated offline, to extract procedural details from free-text reports on mechanical thrombectomies. METHODS: Free-text reports on mechanical thrombectomy from two institutions were included. A detailed prompt was used in German and English languages. The ability of the LLMs to extract procedural data was compared to GPT‑4 using McNemar's test. The manual data entries made by an interventional neuroradiologist served as the reference standard. RESULTS: 100 reports from institution 1 (mean age 74.7 ± 13.2 years; 53 females) and 30 reports from institution 2 (mean age 72.7 ± 13.5 years; 18 males) were included. Llama 3.1 405B extracted 2619 of 2800 data points correctly (93.5% [95%CI: 92.6%, 94.4%], p = 0.39 vs. GPT-4). Llama3 70B with the English prompt extracted 2537 data points correctly (90.6% [95%CI: 89.5%, 91.7%], p < 0.001 vs. GPT-4), and 2471 (88.2% [95%CI: 87.0%, 89.4%], p < 0.001 vs. GPT-4) with the German prompt. Llama 3 8B extracted 2314 data points correctly (86.1% [95%CI: 84.8%, 87.4%], p < 0.001 vs. GPT-4), and Mixtral 8X7B extracted 2411 (86.1% [95%CI: 84.8%, 87.4%], p < 0.001 vs. GPT-4) correctly. CONCLUSION: Llama 3.1 405B was equal to GPT‑4 for data extraction from free-text reports on mechanical thrombectomies and may represent a data secure alternative, when operated locally.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.764 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.674 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 8.234 Zit.
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
2019 · 6.898 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.