Meistzitierte Publikationen im Bereich Gesundheit & MedTech
Do Large Language Models have Shared Weaknesses in Medical Question Answering?
2023 · 1 Zit. · arXiv (Cornell University)
Measuring what Matters: Construct Validity in Large Language Model Benchmarks
2025 · 0 Zit. · ArXiv.org