Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Bias in Large Language Models: Methods, Evaluation, and Prospects
0
Zitationen
1
Autoren
2026
Jahr
Abstract
With the in-depth penetration of large language models (LLMs) such as ChatGPT and DeepSeek into critical domains including recruitment, healthcare, and finance, the issue of bias in their outputs has become a core bottleneck restricting the credible application of the technology. This paper systematically reviews the latest advances in the field of LLM bias research, classifies mainstream debiasing methods into three categories—data-level, model-level, and application-level—based on their intervention stages, elaborates on the technical logic of each category of methods, analyzes their performance in various aspects, combs through common evaluation datasets and indicator systems, and finally conducts an in-depth analysis of current research limitations and proposes targeted solutions. This paper meticulously classifies mainstream methods in recent years according to their action stages and principles, and from a practical perspective, selects factors such as interpretability, cost, and closed-source adaptability for evaluation, which are visualized as radar charts. This facilitates the analysis of the applicable scenarios of the three categories of methods, aims to analyze the advantages and disadvantages of mainstream methods, clearly presents the current research status in this field, and provides ideas for future research.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.687 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.591 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 8.114 Zit.
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
2019 · 6.867 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.781 Zit.