Show/Hide Menu
Hide/Show Apps
Logout
Türkçe
Türkçe
Search
Search
Login
Login
OpenMETU
OpenMETU
About
About
Open Science Policy
Open Science Policy
Open Access Guideline
Open Access Guideline
Postgraduate Thesis Guideline
Postgraduate Thesis Guideline
Communities & Collections
Communities & Collections
Help
Help
Frequently Asked Questions
Frequently Asked Questions
Guides
Guides
Thesis submission
Thesis submission
MS without thesis term project submission
MS without thesis term project submission
Publication submission with DOI
Publication submission with DOI
Publication submission
Publication submission
Supporting Information
Supporting Information
General Information
General Information
Copyright, Embargo and License
Copyright, Embargo and License
Contact us
Contact us
EMDD-RL: faster subgoal identification with diverse density in reinforcement learning
Download
12626042.pdf
Date
2021-1-15
Author
Sunel, Saim
Metadata
Show full item record
This work is licensed under a
Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License
.
Item Usage Stats
342
views
215
downloads
Cite This
Diverse Density (DD) algorithm is a well-known multiple instance learning method, also known to be effective to automatically identify sub-goals and improve Reinforcement Learning (RL). Expectation-Maximization Diverse Density (EMDD) improves DD in terms of both speed and accuracy. This study adapts EMDD to automatically identify subgoals for RL which is shown to perform significantly faster (3 to 10 times) than its predecessor, without sacrificing solution quality. The performance of the proposed method named EMDD-RL is empirically shown via extensive experimentation, together with the discussions on the effects of EMDD hyperparameters on the results.
Subject Keywords
Subgoal identification
,
Expectation-Maximization Diverse Density
,
Diverse Density
,
Reinforcement Learning
URI
https://hdl.handle.net/11511/89597
Collections
Graduate School of Natural and Applied Sciences, Thesis
Suggestions
OpenMETU
Core
A Concept Filtering Approach for Diverse Density to Discover Subgoals in Reinforcement Learning
DEMİR, ALPER; Cilden, Erkin; Polat, Faruk (2017-11-08)
In the reinforcement learning context, subgoal discovery methods aim to find bottlenecks in problem state space so that the problem can naturally be decomposed into smaller subproblems. In this paper, we propose a concept filtering method that extends an existing subgoal discovery method, namely diverse density, to be used for both fully and partially observable RL problems. The proposed method is successful in discovering useful subgoals with the help of multiple instance learning. Compared to the original...
GMDH: An R Package for Short Term Forecasting via GMDH-Type Neural Network Algorithms
DAĞ, OSMAN; Yozgatlıgil, Ceylan (2016-08-01)
Group Method of Data Handling (GMDH)-type neural network algorithms are the heuristic self organization method for the modelling of complex systems. GMDH algorithms are utilized for a variety of purposes, examples include identification of physical laws, the extrapolation of physical fields, pattern recognition, clustering, the approximation of multidimensional processes, forecasting without models, etc. In this study, the R package GMDH is presented to make short term forecasting through GMDH-type neural n...
Online collaboration: Collaborative behavior patterns and factors affecting globally distributed team performance
Serce, Fatma Cemile; Swigger, Kathleen; Alpaslan, Ferda Nur; Brazile, Robert; Dafoulas, George; Lopez, Victor (2011-01-01)
Studying the collaborative behavior of online learning teams and how this behavior is related to communication mode and task type is a complex process. Research about small group learning suggests that a higher percentage of social interactions occur in synchronous rather than asynchronous mode, and that students spend more time in task-oriented interaction in asynchronous discussions than in synchronous mode. This study analyzed the collaborative interaction patterns of global software development learning...
The Development of Mathematical Achievement in Analytic Geometry of Grade-12 Students through GeoGebra Activities
Khalil, Muhammad; Farooq, Rahmat Ali; Çakıroğlu, Erdinç; Khalil, Umair; Khan, Dost Muhammad (2018-01-01)
This research provides the instructional exploration of analytic geometry pattern based on van Hiele thinking pattern, and the potential of GeoGebra effect on experimental group along with its nested group (high and low achievers) in comparison with control group in analytic geometry. To investigate the significant effect of GeoGebra, the two match groups were constructed on their previous grade-11 mathematics records with almost equal statistical background and with the same compatibility in the biological...
Investigation of Students’ Cognitive Processes in Computer Programming: A Cognitive Ethnography Study
Doğan, Sibel; Aslan, Orhan; Yıldırım, İbrahim Soner (2019-01-01)
The aim of the current study is to investigate how cognitive processes of students categorized as novice, semi-expert and expert differ in terms of creating pseudocode for a given programming task. To conduct this aim, cognitive ethnography research design was employed to reveal the cognitive process of the participants behind the specified task. In the study, three undergraduate students from a Computer Education and Instructional Technology (CEIT) department were included as participants. These students w...
Citation Formats
IEEE
ACM
APA
CHICAGO
MLA
BibTeX
S. Sunel, “EMDD-RL: faster subgoal identification with diverse density in reinforcement learning,” M.S. - Master of Science, Middle East Technical University, 2021.