Show/Hide Menu
Hide/Show Apps
anonymousUser
Logout
Türkçe
Türkçe
Search
Search
Login
Login
OpenMETU
OpenMETU
About
About
Open Science Policy
Open Science Policy
Frequently Asked Questions
Frequently Asked Questions
Communities & Collections
Communities & Collections
TD-gammon revisited: integrating invalid actions and dice factor in continuous action and observation space
Download
index.pdf
Date
2018
Author
Usta, Engin Deniz
Metadata
Show full item record
Item Usage Stats
8
views
20
downloads
After TD-Gammon's success in 1991, the interest in game-playing agents has risen significantly. With the developments in Deep Learning and emulations for older games have been created, human-level control for Atari games has been achieved and Deep Reinforcement Learning has proven itself to be a success. However, the ancestor of DRL, TD-Gammon, and its game Backgammon got out of sight, because of the fact that Backgammon's actions are much more complex than other games (most of the Atari games has 2 or 4 different actions), the huge action space has much invalid actions, and there is a dice factor which involves stochasticity. Last but not least, the professional level in Backgammon has been achieved a long time ago. In this thesis, the latest methods in DRL will be tested against its ancestor game, Backgammon, while trying to teach how to select valid moves and considering the dice factor.
Subject Keywords
Backgammon.
,
Computer games.
,
Machine learning.
,
Artificial intelligence.
URI
http://etd.lib.metu.edu.tr/upload/12622619/index.pdf
https://hdl.handle.net/11511/27576
Collections
Graduate School of Natural and Applied Sciences, Thesis