Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.

Challenges and Choices when Evaluating Alignment inHuman-AI Systems

2025·0 Zitationen·Proceedings of the AAAI Symposium SeriesOpen Access

Volltext beim Verlag öffnen

Zitationen

Autoren

2025

Jahr

Abstract

Aligning AI to human values is a current research endeavor where much focus goes to training AI systems to align with values, goals and tasks. But evaluating whether those aligned systems are actually better and more trusted by human users is an essential part of improving such systems. We present three challenges encountered in the evaluation of aligned AI systems. We present possible solutions to these challenges, discuss our own and alternative design choices, and outline next steps for AI alignment research to flourish.

Autoren

Institutionen

Visiting Nurse Association(US)

Themen

Ethics and Social Impacts of AIExplainable Artificial Intelligence (XAI)Artificial Intelligence in Healthcare and Education

Volltext beim Verlag öffnen

Challenges and Choices when Evaluating Alignment inHuman-AI Systems

Abstract

Ähnliche Arbeiten

Autoren

Institutionen

Themen