Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
DeepSeek vs. ChatGPT: prospects and challenges
25
Zitationen
10
Autoren
2025
Jahr
Abstract
DeepSeek has introduced its recent model DeepSeek-R1, showing divergence from OpenAI's ChatGPT, suggesting an open-source alternative to users. This paper analyzes the architecture of DeepSeek-R1, mainly adopting rule-based reinforcement learning (RL) without preliminary supervised fine-tuning (SFT), which has shown better efficiency. By integrating multi-stage training along with cold-start data usage before RL, the model can achieve meaningful performance in reasoning tasks along with reward modeling optimizing training process. DeepSeek shows its strength in technical, reasoning tasks, able to show its decision-making process through open source whereas ChatGPT shows its strength on general tasks and areas requiring creativeness. Despite the groundbreaking developments of both models, there is room for improvement in AI landscape and matters to be handled such as quality of data, black box problems, privacy management, and job displacement. This paper suggests the future of AI, expecting better performance in multi-modal tasks, enhancing its effectiveness in handling larger data sets, enabling users with improved AI landscapes and utility.
Ähnliche Arbeiten
Autoren
Institutionen
- Yeungnam University College
- Yeungnam University(KR)
- Barrow Neurological Institute(US)
- St. Joseph's Hospital and Medical Center(US)
- Chulabhorn Hospital(TH)
- Massachusetts Institute of Technology(US)
- University Hospitals of Cleveland(US)
- Mayo Clinic in Arizona(US)
- Essen University Hospital(DE)
- New York University(US)
- Icahn School of Medicine at Mount Sinai(US)
- University Hospital Münster(DE)
- Witten/Herdecke University(DE)