Oliver Richter: Katalogdaten im Frühjahrssemester 2020
|Name||Herr Dr. Oliver Richter|
Inst. f. Techn. Informatik u. K.
ETH Zürich, ETZ G 93
|Departement||Informationstechnologie und Elektrotechnik|
|227-0559-00L||Seminar in Deep Reinforcement Learning |
Number of participants limited to 25.
|2 KP||2S||R. Wattenhofer, O. Richter|
|Kurzbeschreibung||In this seminar participating students present and discuss recent research papers in the area of deep reinforcement learning. The seminar starts with two introductory lessons introducing the basic concepts. Alongside the seminar a programming challenge is posed in which students can take part to improve their grade.|
|Lernziel||Since Google Deepmind presented the Deep Q-Network (DQN) algorithm in 2015 that could play Atari-2600 games at a superhuman level, the field of deep reinforcement learning gained a lot of traction. It sparked media attention with AlphaGo and AlphaZero and is one of the most prominent research areas. Yet many research papers in the area come from one of two sources: Google Deepmind or OpenAI. In this seminar we aim at giving the students an in depth view on the current advances in the area by discussing recent papers as well as discussing current issues and difficulties surrounding deep reinforcement learning.|
|Inhalt||Two introductory courses introducing Q-learning and policy gradient methods. Afterwards participating students present recent papers. For details see: www.disco.ethz.ch/courses.html|
|Skript||Slides of presentations will be made available.|
|Literatur||OpenAI course (https://spinningup.openai.com/en/latest/) plus selected papers.|
The paper selection can be found on www.disco.ethz.ch/courses.html.
|Voraussetzungen / Besonderes||It is expected that student have prior knowledge and interest in machine and deep learning, for instance by having attended appropriate courses.|