Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
The Singapore Consensus on Global AI Safety Research Priorities
0
Zitationen
88
Autoren
2025
Jahr
Abstract
Rapidly improving AI capabilities and autonomy hold significant promise of transformation, but are also driving vigorous debate on how to ensure that AI is safe, i.e., trustworthy, reliable, and secure. Building a trusted ecosystem is therefore essential -- it helps people embrace AI with confidence and gives maximal space for innovation while avoiding backlash. The "2025 Singapore Conference on AI (SCAI): International Scientific Exchange on AI Safety" aimed to support research in this space by bringing together AI scientists across geographies to identify and synthesise research priorities in AI safety. This resulting report builds on the International AI Safety Report chaired by Yoshua Bengio and backed by 33 governments. By adopting a defence-in-depth model, this report organises AI safety research domains into three types: challenges with creating trustworthy AI systems (Development), challenges with evaluating their risks (Assessment), and challenges with monitoring and intervening after deployment (Control).
Ähnliche Arbeiten
The global landscape of AI ethics guidelines
2019 · 4.482 Zit.
The Limitations of Deep Learning in Adversarial Settings
2016 · 3.853 Zit.
Trust in Automation: Designing for Appropriate Reliance
2004 · 3.362 Zit.
Fairness through awareness
2012 · 3.258 Zit.
Mind over Machine: The Power of Human Intuition and Expertise in the Era of the Computer
1987 · 3.182 Zit.
Autoren
- Yoshua Bengio
- Tegan Maharaj
- C.-H. Luke Ong
- Stuart D. Russell
- Dawn Song
- Max Tegmark
- Xue Lan
- Ya-Qin Zhang
- Stephen Casper
- Wan Sie Lee
- Sören Mindermann
- Vanessa Wilfred
- Vidhisha Balachandran
- Fazl Barez
- Michael Belinsky
- Imane Bello
- Malo Bourgon
- Mark Brakel
- Siméon Campos
- Duncan Cass-Beggs
- Jiahao Chen
- Rumman Chowdhury
- Kuan Chua Seah
- Jeff Clune
- Jie Dai
- Agnes Delaborde
- Nouha Dziri
- Francisco Eiras
- Joshua Engels
- Jinyu Fan
- Adam Gleave
- Noah D. Goodman
- Fynn Heide
- Johannes Heidecke
- Dan Hendrycks
- Cyrus Hodes
- Bryan Low Kian Hsiang
- Minlie Huang
- Sami Jawhar
- Jingyu Wang
- Adam Tauman Kalai
- Meindert Kamphuis
- Mohan Kankanhalli
- Subhash Kantamneni
- M. Kirk
- Thomas Kwa
- Jeffrey Ladish
- Kwok-Yan Lam
- Wan Lee Sie
- Taewhi Lee
- Xiaopeng Li
- Jiajun Liu
- Ching‐Cheng Lu
- Yifan Mai
- Richard Mallah
- Julian Michael
- Nick Moës
- Simon Geir Møller
- K. H. Nam
- TP Ng
- Mark Nitzberg
- Besmira Nushi
- Seán Ó hÉigeartaigh
- Alejandro Ortega
- Pierre Peigné
- J. Howard Petrie
- Benjamin Prud'homme
- Reihaneh Rabbany
- Nayat Sanchez-Pi
- Sarah Schwettmann
- Buck Shlegeris
- Saad Siddiqui
- Ashish Sinha
- Martín Soto
- Cheston Tan
- Ting Dong
- William Tjhi
- Robert Trager
- Brian Tse
- Anthony Tung K. H.
- Vanessa Wilfred
- John Willes
- David Wong
- Wei Xu
- Rong Xu
- Yi Zeng
- Hao Zhang
- Djordje Žikelić