Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Understanding Large Language Models
4
Zitationen
3
Autoren
2024
Jahr
Abstract
Large language models (LLMs) are a revolutionary development that allows machines to comprehend and produce text, similar to that of humans on a never-before-seen scale. This chapter examines the basic ideas underlying LLMs with an emphasis on their applications, training approaches, and architecture. Deep neural networks with billions of parameters are used by LLMs, such as the GPT-3 model, to capture complex linguistic patterns and contextual subtleties. Massive datasets, frequently drawn from a variety of online sources, are used in the training process to impart a thorough understanding of language. Consequently, LLMs show remarkable abilities in tasks like question answering, language translation, and text generation. Issues like bias, ethical issues, and interpretability thus become important concerns. So, this chapter outlines the main elements of LLMs, discusses their advantages, reviews current research, and addresses the ethical issues surrounding their application.
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.260 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.116 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.493 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.776 Zit.
Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)
2018 · 5.438 Zit.