TD-gammon revisited: integrating invalid actions and dice factor in continuous action and observation space

Usta, Engin Deniz
After TD-Gammon's success in 1991, the interest in game-playing agents has risen significantly. With the developments in Deep Learning and emulations for older games have been created, human-level control for Atari games has been achieved and Deep Reinforcement Learning has proven itself to be a success. However, the ancestor of DRL, TD-Gammon, and its game Backgammon got out of sight, because of the fact that Backgammon's actions are much more complex than other games (most of the Atari games has 2 or 4 different actions), the huge action space has much invalid actions, and there is a dice factor which involves stochasticity. Last but not least, the professional level in Backgammon has been achieved a long time ago. In this thesis, the latest methods in DRL will be tested against its ancestor game, Backgammon, while trying to teach how to select valid moves and considering the dice factor.


Transformative Impacts of High-Speed Railway (HSR) Stations on Urban Space: The Case of Ankara HSR Station
Songulen, Nazli; Memluk, Nihan Oya (2014-05-11)
Urban transformation became one of the premises on the urban agenda in Turkey, which was accelerated with the introduction of the neo-liberal policies in the 1980s and became apparent in the 2000s as rapid and massive urban transformations mainly held by macro-scaled public investments. In this regard, high-speed railway (HSR) systems, newly introduced to the Turkish case, appear as one of these macro-scale public investments. In this context, the European literature on HSR development projects reveals that...
Cross-cultural Perspectives of Successful Aging: Young Turks and Europeans
Cosco, Theodore D.; Brehme, David; Grigoruta, Nora; Kaufmann, Lisa-Katrin; Lemsalu, Liis; Meex, Ruth; Schuurmans, Angela A. T.; Sener, Neslihan; Stephan, Blossom C. M.; Brayne, Carol (2015-11-02)
Successful aging (SA) has been conceptualized in a number of ways. Despite increasing research into how laypersons define SA, few studies capturing lay perspectives of SA in younger cohorts and in non-English speaking countries have been undertaken. The current study examines cross-cultural perspectives of SA in young (aged 18-35), lay adults from a variety of continental European countries and Turkey. Participants were recruited via snowball sampling from social network sites and invited to participate in ...
Analyzing RD activities of foreign enterprises in emerging economies. Lessons from Turkey
Erdil, Erkan; Pamukcu, Mehmet Teoman (Science And Technology Policies Research Center, Middle East Technical University (Ankara, Turkey), 2011-01-01)
Emerging economies have played an important role in the internationalization of R&D activities at least since the 1990s. Turkey, an emerging economy and at same time an accession country to the European Union which signed a Customs Union Agreement with the EU already in 1995, is no exception. In-depth face-to-face semi-structured interviews were conducted with R&D directors of 26 multinational companies operating in Turkey –with headquarters located in France, Germany, Italy, Japan, Switzerlandand USA- in t...
Labor market transformation in technologically renewed firms: printing and publishing sector in case of Istanbul
Erdoganaras, Fatma Çetinkaya; Ersoy, Melih; Department of City and Regional Planning (2002)
In both in the World as a whole and Turkey, the printing and publishing sector is one of the sectors which has experienced a restructuring process after 1980. This was largely realized depending on new technology as a result of the recent developments in transportation, communication and telecommunication technologies and particularly intensive use of the computer aided production system. The introduction of new technologies brings about several impacts on a sector depending on social, economic, political a...
Looking at Kazakhstan’s Higher Education Landscape: From Transition to Transformation Between 1920 and 2015
Ahn, Elise S.; Dixon, John; Chekmareva, Larissa (PALGRAVE, HOUNDMILLS, BASINGSTOKE RG21 6XS, ENGLAND, 2018)
Since independence in 1991, the Kazakhstani government has been aggressively pursuing higher education reform. This has led to the passing of a number of education-related laws and the adaptation of different policies and practices in order to facilitate the government’s initial priority of transitioning to a market economy and more recently, to achieve its goal of becoming one of the world’s top 30 economies by the year 2050. This chapter provides an overview of Kazakhstan’s Soviet higher education legacy ...
Citation Formats
E. D. Usta, “TD-gammon revisited: integrating invalid actions and dice factor in continuous action and observation space,” M.S. - Master of Science, Middle East Technical University, 2018.