Learning a partially-observable card game hearts using reinforcement learning

Download
2020
Demirdöver, Buğra Kaan
Artificial intelligence and machine learning are widely popular in many sectors. Oneof them is the gaming industry. With many different scenarios, different types, gamesare perfect for machine learning and artificial intelligence. This study aims to developlearning agents to play the game of Hearts. Hearts is one of the most popular cardgames in the world. It is a trick based, imperfect information card game. In additionto having a huge state space, hearts offers many extra challenges due to the nature ofthe game. These challenges are divided into smaller parts where learning is easier andassigned to different learning agents. These agents use temporal difference learningto learn assigned parts.

Suggestions

Adapted Infinite Kernel Learning by Multi-Local Algorithm
Akyuz, Sureyya Ozogur; Ustunkar, Gurkan; Weber, Gerhard Wilhelm (2016-05-01)
The interplay of machine learning (ML) and optimization methods is an emerging field of artificial intelligence. Both ML and optimization are concerned with modeling of systems related to real-world problems. Parameter selection for classification models is an important task for ML algorithms. In statistical learning theory, cross-validation (CV) which is the most well-known model selection method can be very time consuming for large data sets. One of the recent model selection techniques developed for supp...
Deep learning approach for laboratory mice grimace scaling
Eral, Mustafa; Halıcı, Uğur; Department of Electrical and Electronics Engineering (2016)
Deep learning is extremely attractive research topic in pattern recognition and machine learning areas. Applications in speech recognition, natural language processing, and machine vision fields gained huge acceleration in performance by employing deep learning. In this thesis, deep learning is used for medical purposes in order to scale pain degree of drug stimulated mice by examining facial grimace. For this purpose each frame in the videos in the training set were scaled manually by experts according to ...
Reinforcement Learning in Card Game Environments Using Monte Carlo Methods and Artificial Neural Networks
Baykal, Ömer; Alpaslan, Ferda Nur (2019-09-01)
Artificial intelligence has wide range of application areas and games are one of the important ones. There are many applications of artificial intelligence methods in game environments. It is very common for game environments to include intelligent agents. Having intelligent agents makes a game more entertaining and challenging for its players. Reinforcement learning methods can be applied to develop artificial intelligence agents that learn to play a game by themselves without any supervision and can play ...
Generation and modification of 3D models with deep neural networks
Öngün, Cihan; Temizel, Alptekin; Department of Information Systems (2021-9)
Artificial intelligence (AI) and particularly deep neural networks (DNN) have become very hot topics in the recent years and they have been shown to be successful in problems such as detection, recognition and segmentation. More recently DNNs have started to be popular in data generation problems by the invention of Generative Adversarial Networks (GAN). Using GANs, various types of data such as audio, image or 3D models could be generated. In this thesis, we aim to propose a system that creates artificial...
Competing labels: a heuristic approach to pseudo-labeling in deep semi-supervised learning
Bayrak, Hamdi Burak; Ertekin Bolelli, Şeyda; Yücel, Hamdullah; Department of Scientific Computing (2022-2-10)
Semi-supervised learning is one of the dominantly utilized approaches to reduce the reliance of deep learning models on large-scale labeled data. One mostly used method of this approach is pseudo-labeling. However, pseudo-labeling, especially its originally proposed form tends to remarkably suffer from noisy training when the assigned labels are false. In order to mitigate this problem, in our work, we investigate the gradient sent to the neural network and propose a heuristic method, called competing label...
Citation Formats
B. K. Demirdöver, “Learning a partially-observable card game hearts using reinforcement learning,” Thesis (M.S.) -- Graduate School of Natural and Applied Sciences. Computer Engineering., Middle East Technical University, 2020.