Yanjun Gao
Relevante Arbeiten
Meistzitierte Publikationen im Bereich Gesundheit & MedTech
Evaluating clinical AI summaries with large language models as judges
2025 · 12 Zit. · npj Digital Medicine
Automating Evaluation of AI Text Generation in Healthcare with a Large Language Model (LLM)-as-a-Judge
2025 · 12 Zit. · medRxiv
LCD benchmark: long clinical document benchmark on mortality prediction for language models
2024 · 6 Zit. · Journal of the American Medical Informatics Association
WITHDRAWN: Prompt Engineering GPT-4 to Answer Patient Inquiries: A Real-Time Implementation in the Electronic Health Record across Provider Clinics
2024 · 3 Zit. · medRxiv
LCD Benchmark: Long Clinical Document Benchmark on Mortality Prediction for Language Models
2024 · 2 Zit. · medRxiv
Rx-LLM: a benchmarking suite to evaluate safe large language model performance for medication-related tasks
2025 · 1 Zit. · medRxiv
Simple Yet Effective: An Information-Theoretic Approach to Multi-LLM Uncertainty Quantification
2025 · 1 Zit.
Large Language Models with Temporal Reasoning for Longitudinal Clinical Summarization and Prediction
2025 · 1 Zit.
Race, Ethnicity and Their Implication on Bias in Large Language Models
2026 · 0 Zit. · medRxiv
A Scoping Review of Publicly Available Language Tasks in Clinical Natural Language Processing
2021 · 0 Zit. · arXiv (Cornell University)
Toward Digital Twins in the Intensive Care Unit: A Medication Management Case Study
2024 · 0 Zit. · medRxiv
Brittleness and Promise: Knowledge Graph Based Reward Modeling for Diagnostic Reasoning
2025 · 0 Zit. · ArXiv.org
Multi-Task Training with In-Domain Language Models for Diagnostic Reasoning
2023 · 0 Zit. · arXiv (Cornell University)