Show/Hide Menu
Hide/Show Apps
Logout
Türkçe
Türkçe
Search
Search
Login
Login
OpenMETU
OpenMETU
About
About
Open Science Policy
Open Science Policy
Communities & Collections
Communities & Collections
Help
Help
Frequently Asked Questions
Frequently Asked Questions
Guides
Guides
Thesis submission
Thesis submission
MS without thesis term project submission
MS without thesis term project submission
Publication submission with DOI
Publication submission with DOI
Publication submission
Publication submission
Supporting Information
Supporting Information
General Information
General Information
Copyright, Embargo and License
Copyright, Embargo and License
Contact us
Contact us
Landmark Based Reward Shaping in Reinforcement Learning with Hidden States
Date
2019-01-01
Author
Demir, Alper
Cilden, Erkin
Polat, Faruk
Metadata
Show full item record
This work is licensed under a
Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License
.
Item Usage Stats
51
views
0
downloads
Cite This
While most of the work on reward shaping focuses on fully observable problems, there are very few studies that couple reward shaping with partial observability. Moreover, for problems with hidden states, where there is no prior information about the underlying states, reward shaping opportunities are unexplored. In this paper, we show that landmarks can be used to shape the rewards in reinforcement learning with hidden states. Proposed approach is empirically shown to improve the learning performance in terms of speed and quality.
Subject Keywords
Computing methodologies
,
Machine learning
,
Learning paradigms
,
Machine learning approaches
,
Reinforcement learning
,
Partially-observable Markov decision processes results
URI
https://hdl.handle.net/11511/53047
Conference Name
AAMAS '19: International Conference on Autonomous Agents and Multiagent Systems
Collections
Department of Computer Engineering, Conference / Seminar
Suggestions
OpenMETU
Core
Tracing Teacher Learning through Shifts in Discourses: The Case of a Mathematics Teacher
Ilhan, Emine Gul Celebi; Erbaş, Ayhan Kürşat (2017-06-01)
This study presents a methodology for investigating teacher learning in and from practice based on discourses that are in constant flux and transformation. Conceptualizing teacher learning as a frame of meaning based on knowing and doing discourses, the ideas are illustrated through data collected from a secondary mathematics teacher conducting an inquiry of self-practice. Narrative analysis of the data from the teacher interviews was conducted along with classroom observations of the teacher's mathematical...
Effect of human prior knowledge on game success and comparison with reinforcement learning
Hasanoğlu, Mert.; Çakır, Murat Perit; Department of Cognitive Sciences (2019)
This study aims to find out the effect of prior knowledge on the success of humans in a non-rewarding game environment, and then to compare human performance with a reinforcement learning method in an effort to observe to what extent this method can be brought closer to human behavior and performance with the data obtained. For this purpose, different versions of a simple 2D game were used, and data were collected from 32 participants. At the end of the experiment, it is concluded that prior knowledge, such...
Investigation of undergraduate students' mental models about the quantization of physical observables
Didiş, Nilüfer; Eryılmaz, Ali; Erkoç, Şakir; Department of Secondary Science and Mathematics Education (2012)
The purpose of this research is to investigate undergraduate students’ mental models about the quantization of physical observables. The research was guided by ethnography, case study, and content analysis integrated to each other. It focused on second-year physics and physics education students, who were taking the Modern Physics course at the Department of Physics, at Middle East Technical University. Wide range of data was collected by interview, observation, test, diary, and other documents during 2008-...
Effectiveness of 5E learning cycle model on hihg school students' understanding of solubility equilibrium concept
Aydemir, Nurdane; Geban, Ömer; Department of Secondary Science and Mathematics Education (2012)
The purpose of this study was to investigate the effect of instruction based on 5E learning cycle model (LCI) compared to Traditional Instruction (TI) and gender on 11th grade students’ understanding of solubility equilibrium concept, students’ perceived motivation, use of learning strategies, and attitudes towards chemistry. There were 53 students in the experimental group instructed by the LCI and 56 students in the control group instructed by the TI. Solution Concept Test and Science Process Skills Test ...
Effectiveness of context based approach through 5E learning cycle model on students' understanding of chemical reactions and energy concepts and their motivation to learn chemistry
Çiğdemoğlu, Ceyhan; Geban, Ömer; Department of Secondary Science and Mathematics Education (2012)
The aim of study was to investigate the effect of context-based approach (CBA) through 5E learning cycle (LC) model over traditional instruction on students’ understanding, achievement, and chemical literacy on chemical reactions and energy concepts. The effect of instruction on students’ motivation to learn chemistry and the factors of motivation questionnaire were also explored. Additionally, the effect of gender difference was investigated. Six eleventh grade classes with 187 students taught by three tea...
Citation Formats
IEEE
ACM
APA
CHICAGO
MLA
BibTeX
A. Demir, E. Cilden, and F. Polat, “Landmark Based Reward Shaping in Reinforcement Learning with Hidden States,” presented at the AAMAS ’19: International Conference on Autonomous Agents and Multiagent Systems, Montreal QC Canada, 2019, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/53047.