Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Playing Atari with Deep Reinforcement Learning
5.113
Zitationen
7
Autoren
2013
Jahr
Abstract
We present the first deep learning model to successfully learn control policies directly from high-dimensional sensory input using reinforcement learning. The model is a convolutional neural network, trained with a variant of Q-learning, whose input is raw pixels and whose output is a value function estimating future rewards. We apply our method to seven Atari 2600 games from the Arcade Learning Environment, with no adjustment of the architecture or learning algorithm. We find that it outperforms all previous approaches on six of the games and surpasses a human expert on three of them.
Ähnliche Arbeiten
Adaptation in Natural and Artificial Systems
1992 · 35.517 Zit.
Reinforcement Learning: An Introduction
1998 · 26.784 Zit.
Reinforcement Learning: An Introduction
2005 · 25.701 Zit.
Deep learning in neural networks: An overview
2014 · 17.722 Zit.
Diagnosing Non-Intermittent Anomalies in Reinforcement Learning Policy Executions (Short Paper)
2017 · 11.235 Zit.