OpenAlex · Aktualisierung stündlich · Letzte Aktualisierung: 12.04.2026, 01:48

Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.

Comparative Evaluation of ChatGPT, Gemini, and DeepSeek in Educational Problem Solving

2025·0 Zitationen
Volltext beim Verlag öffnen

0

Zitationen

4

Autoren

2025

Jahr

Abstract

This study compares the performance of three large language models; ChatGPT, Gemini, and DeepSeek, on a set of programming-related educational problems from the Aizu Online Judge (AOJ) platform. The evaluation focuses on problem-solving accuracy and code characteristics, with additional comparisons to human Java submissions to contextualize model performance. Metrics include CPU time, memory usage, and code size, enabling a detailed analysis of solution quality and efficiency. Results indicate that ChatGPT consistently achieves the most efficient solutions while maintaining high accuracy, often matching the fastest human submissions. Gemini and DeepSeek also demonstrate strong accuracy but tend to produce less optimized code in computationally demanding cases. These findings contribute to understanding how current LLMs can address structured problem-solving tasks within educational environments.

Ähnliche Arbeiten

Autoren

Institutionen

Themen

Artificial Intelligence in Healthcare and EducationIntelligent Tutoring Systems and Adaptive LearningTeaching and Learning Programming
Volltext beim Verlag öffnen