Wenxuan Wang
80 Arbeiten1.259 Zitationen
Relevante Arbeiten
Meistzitierte Publikationen im Bereich Gesundheit & MedTech
Revisiting the Reliability of Psychological Scales on Large Language Models
2023 · 10 Zit. · arXiv (Cornell University)
Identifying the Achilles' Heel: An Iterative Method for Dynamically Uncovering Factual Errors in Large Language Models
2024 · 3 Zit. · arXiv (Cornell University)
Rigor, Reliability, and Reproducibility Matter: A Decade-Scale Survey of 572 Code Benchmarks
2025 · 0 Zit. · ArXiv.org
Knowledge-to-Jailbreak: Investigating Knowledge-driven Jailbreaking Attacks for Large Language Models
2025 · 0 Zit.
New Job, New Gender? Measuring the Social Bias in Image Generation Models
2024 · 0 Zit. · arXiv (Cornell University)