Michael R. Lyu

995 Arbeiten42.020 Zitationen

Relevante Arbeiten

Meistzitierte Publikationen im Bereich Gesundheit & MedTech

Revisiting the Reliability of Psychological Scales on Large Language Models

2023 · 10 Zit. · arXiv (Cornell University)

Who is ChatGPT? Benchmarking LLMs' Psychological Portrayal Using PsychoBench

2023 · 6 Zit. · arXiv (Cornell University)

How Well Can LLMs Echo Us? Evaluating AI Chatbots' Role-Play Ability with ECHO

2024 · 4 Zit. · arXiv (Cornell University)

All Languages Matter: On the Multilingual Safety of Large Language Models

2023 · 4 Zit. · arXiv (Cornell University)

Identifying the Achilles' Heel: An Iterative Method for Dynamically Uncovering Factual Errors in Large Language Models

2024 · 3 Zit. · arXiv (Cornell University)

LogicAsker: Evaluating and Improving the Logical Reasoning Ability of Large Language Models

2024 · 2 Zit. · arXiv (Cornell University)

Rigor, Reliability, and Reproducibility Matter: A Decade-Scale Survey of 572 Code Benchmarks

2025 · 0 Zit. · ArXiv.org

New Job, New Gender? Measuring the Social Bias in Image Generation Models

2024 · 0 Zit. · arXiv (Cornell University)