Xing Xie
Relevante Arbeiten
Meistzitierte Publikationen im Bereich Gesundheit & MedTech
A Survey on Evaluation of Large Language Models
2024 · 2.184 Zit. · ACM Transactions on Intelligent Systems and Technology
Defending ChatGPT against jailbreak attack via self-reminders
2023 · 96 Zit. · Nature Machine Intelligence
On the Robustness of ChatGPT: An Adversarial and Out-of-distribution Perspective
2023 · 90 Zit. · arXiv (Cornell University)
TrustLLM: Trustworthiness in Large Language Models
2024 · 50 Zit. · arXiv (Cornell University)
Trustworthy Machine Learning: Robustness, Generalization, and Interpretability
2023 · 7 Zit.
Unpacking the Ethical Value Alignment in Big Models
2023 · 4 Zit. · arXiv (Cornell University)
Unintended Harms of Value-Aligned LLMs: Psychological and Empirical Insights
2025 · 0 Zit. · ArXiv.org