Xing Xie

710 Arbeiten46.244 Zitationen

Relevante Arbeiten

Meistzitierte Publikationen im Bereich Gesundheit & MedTech

A Survey on Evaluation of Large Language Models

2024 · 2.302 Zit. · ACM Transactions on Intelligent Systems and Technology

Defending ChatGPT against jailbreak attack via self-reminders

2023 · 109 Zit. · Nature Machine Intelligence

On the Robustness of ChatGPT: An Adversarial and Out-of-distribution Perspective

2023 · 90 Zit. · arXiv (Cornell University)

TrustLLM: Trustworthiness in Large Language Models

2024 · 52 Zit. · arXiv (Cornell University)

Trustworthy Machine Learning: Robustness, Generalization, and Interpretability

2023 · 7 Zit.

Unpacking the Ethical Value Alignment in Big Models

2023 · 4 Zit. · arXiv (Cornell University)

Unintended Harms of Value-Aligned LLMs: Psychological and Empirical Insights

2025 · 0 Zit. · ArXiv.org