Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
A Cross-Domain Performance Report of Open AI ChatGPT o1 Model
3
Zitationen
2
Autoren
2024
Jahr
Abstract
Large language models (LLMs) represent a leap in the capabilities of artificial intelligence (AI) in natural language understanding, problem-solving, and domain-specific reasoning. Comparative and cross-domain evaluations of LLMs can help us understand their versatility and limitations, including real-world applicability. The o1 model developed by OpenAI represents a notable milestone in terms of state-of-the-art integration into the aspects of language processing and task execution. This report investigates the o1 (o1-preview) model on various tasks, including but not limited to mathematics, clinical knowledge, professional ethics, and the humanities. The results revealed that the o1 excels in certain areas, particularly in fields requiring specialized knowledge, such as college biology (98%) and clinical knowledge (93%). In comparison, it shows lower performance in areas like professional law (54%) and business ethics (81%).
Ähnliche Arbeiten
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 · 8.292 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.143 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.539 Zit.
Proceedings of the 19th International Joint Conference on Artificial Intelligence
2005 · 5.776 Zit.
Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)
2018 · 5.452 Zit.