Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.

Comparative Evaluation of ChatGPT, Gemini, and DeepSeek in Educational Problem Solving

2025·0 Zitationen

Volltext beim Verlag öffnen

Zitationen

Autoren

2025

Jahr

Abstract

This study compares the performance of three large language models; ChatGPT, Gemini, and DeepSeek, on a set of programming-related educational problems from the Aizu Online Judge (AOJ) platform. The evaluation focuses on problem-solving accuracy and code characteristics, with additional comparisons to human Java submissions to contextualize model performance. Metrics include CPU time, memory usage, and code size, enabling a detailed analysis of solution quality and efficiency. Results indicate that ChatGPT consistently achieves the most efficient solutions while maintaining high accuracy, often matching the fastest human submissions. Gemini and DeepSeek also demonstrate strong accuracy but tend to produce less optimized code in computationally demanding cases. These findings contribute to understanding how current LLMs can address structured problem-solving tasks within educational environments.

Autoren

Institutionen

University of Aizu(JP)

Themen

Artificial Intelligence in Healthcare and EducationIntelligent Tutoring Systems and Adaptive LearningTeaching and Learning Programming

Volltext beim Verlag öffnen

Comparative Evaluation of ChatGPT, Gemini, and DeepSeek in Educational Problem Solving

Abstract

Ähnliche Arbeiten

Autoren

Institutionen

Themen