Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.

Understanding Large Language Models

2024·4 Zitationen·Advances in computational intelligence and robotics book series

Volltext beim Verlag öffnen

Zitationen

Autoren

2024

Jahr

Abstract

Large language models (LLMs) are a revolutionary development that allows machines to comprehend and produce text, similar to that of humans on a never-before-seen scale. This chapter examines the basic ideas underlying LLMs with an emphasis on their applications, training approaches, and architecture. Deep neural networks with billions of parameters are used by LLMs, such as the GPT-3 model, to capture complex linguistic patterns and contextual subtleties. Massive datasets, frequently drawn from a variety of online sources, are used in the training process to impart a thorough understanding of language. Consequently, LLMs show remarkable abilities in tasks like question answering, language translation, and text generation. Issues like bias, ethical issues, and interpretability thus become important concerns. So, this chapter outlines the main elements of LLMs, discusses their advantages, reviews current research, and addresses the ethical issues surrounding their application.

Autoren

Institutionen

Themen

Artificial Intelligence in Healthcare and EducationTopic ModelingExplainable Artificial Intelligence (XAI)

Volltext beim Verlag öffnen

Understanding Large Language Models

Abstract

Ähnliche Arbeiten

Autoren

Institutionen

Themen