Michael R. Lyu
Relevante Arbeiten
Meistzitierte Publikationen im Bereich Gesundheit & MedTech
Revisiting the Reliability of Psychological Scales on Large Language Models
2023 · 10 Zit. · arXiv (Cornell University)
Who is ChatGPT? Benchmarking LLMs' Psychological Portrayal Using PsychoBench
2023 · 6 Zit. · arXiv (Cornell University)
All Languages Matter: On the Multilingual Safety of Large Language Models
2023 · 4 Zit. · arXiv (Cornell University)
How Well Can LLMs Echo Us? Evaluating AI Chatbots' Role-Play Ability with ECHO
2024 · 3 Zit. · arXiv (Cornell University)
The Earth is Flat? Unveiling Factual Errors in Large Language Models
2024 · 3 Zit. · arXiv (Cornell University)
LogicAsker: Evaluating and Improving the Logical Reasoning Ability of Large Language Models
2024 · 2 Zit. · arXiv (Cornell University)
Rigor, Reliability, and Reproducibility Matter: A Decade-Scale Survey of 572 Code Benchmarks
2025 · 0 Zit. · ArXiv.org
New Job, New Gender? Measuring the Social Bias in Image Generation Models
2024 · 0 Zit. · arXiv (Cornell University)