EMDD-RL: faster subgoal identification with diverse density in reinforcement learning

Download

12626042.pdf

Date

2021-1-15

Author

Sunel, Saim

Metadata

Show full item record

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Item Usage Stats

381
views

242
downloads

Diverse Density (DD) algorithm is a well-known multiple instance learning method, also known to be effective to automatically identify sub-goals and improve Reinforcement Learning (RL). Expectation-Maximization Diverse Density (EMDD) improves DD in terms of both speed and accuracy. This study adapts EMDD to automatically identify subgoals for RL which is shown to perform significantly faster (3 to 10 times) than its predecessor, without sacrificing solution quality. The performance of the proposed method named EMDD-RL is empirically shown via extensive experimentation, together with the discussions on the effects of EMDD hyperparameters on the results.

Subject Keywords

Subgoal identification, Expectation-Maximization Diverse Density, Diverse Density, Reinforcement Learning

URI

https://hdl.handle.net/11511/89597

Collections

Graduate School of Natural and Applied Sciences, Thesis

Suggestions

OpenMETU
Core

A Concept Filtering Approach for Diverse Density to Discover Subgoals in Reinforcement Learning DEMİR, ALPER; Cilden, Erkin; Polat, Faruk (2017-11-08) In the reinforcement learning context, subgoal discovery methods aim to find bottlenecks in problem state space so that the problem can naturally be decomposed into smaller subproblems. In this paper, we propose a concept filtering method that extends an existing subgoal discovery method, namely diverse density, to be used for both fully and partially observable RL problems. The proposed method is successful in discovering useful subgoals with the help of multiple instance learning. Compared to the original...
GMDH: An R Package for Short Term Forecasting via GMDH-Type Neural Network Algorithms DAĞ, OSMAN; Yozgatlıgil, Ceylan (2016-08-01) Group Method of Data Handling (GMDH)-type neural network algorithms are the heuristic self organization method for the modelling of complex systems. GMDH algorithms are utilized for a variety of purposes, examples include identification of physical laws, the extrapolation of physical fields, pattern recognition, clustering, the approximation of multidimensional processes, forecasting without models, etc. In this study, the R package GMDH is presented to make short term forecasting through GMDH-type neural n...
Online collaboration: Collaborative behavior patterns and factors affecting globally distributed team performance Serce, Fatma Cemile; Swigger, Kathleen; Alpaslan, Ferda Nur; Brazile, Robert; Dafoulas, George; Lopez, Victor (2011-01-01) Studying the collaborative behavior of online learning teams and how this behavior is related to communication mode and task type is a complex process. Research about small group learning suggests that a higher percentage of social interactions occur in synchronous rather than asynchronous mode, and that students spend more time in task-oriented interaction in asynchronous discussions than in synchronous mode. This study analyzed the collaborative interaction patterns of global software development learning...
The Development of Mathematical Achievement in Analytic Geometry of Grade-12 Students through GeoGebra Activities Khalil, Muhammad; Farooq, Rahmat Ali; Çakıroğlu, Erdinç; Khalil, Umair; Khan, Dost Muhammad (2018-01-01) This research provides the instructional exploration of analytic geometry pattern based on van Hiele thinking pattern, and the potential of GeoGebra effect on experimental group along with its nested group (high and low achievers) in comparison with control group in analytic geometry. To investigate the significant effect of GeoGebra, the two match groups were constructed on their previous grade-11 mathematics records with almost equal statistical background and with the same compatibility in the biological...
Investigation of Students’ Cognitive Processes in Computer Programming: A Cognitive Ethnography Study Doğan, Sibel; Aslan, Orhan; Yıldırım, İbrahim Soner (2019-01-01) The aim of the current study is to investigate how cognitive processes of students categorized as novice, semi-expert and expert differ in terms of creating pseudocode for a given programming task. To conduct this aim, cognitive ethnography research design was employed to reveal the cognitive process of the participants behind the specified task. In the study, three undergraduate students from a Computer Education and Instructional Technology (CEIT) department were included as participants. These students w...

Citation Formats

S. Sunel, “EMDD-RL: faster subgoal identification with diverse density in reinforcement learning,” M.S. - Master of Science, Middle East Technical University, 2021.