Show/Hide Menu
Hide/Show Apps
Logout
Türkçe
Türkçe
Search
Search
Login
Login
OpenMETU
OpenMETU
About
About
Open Science Policy
Open Science Policy
Open Access Guideline
Open Access Guideline
Postgraduate Thesis Guideline
Postgraduate Thesis Guideline
Communities & Collections
Communities & Collections
Help
Help
Frequently Asked Questions
Frequently Asked Questions
Guides
Guides
Thesis submission
Thesis submission
MS without thesis term project submission
MS without thesis term project submission
Publication submission with DOI
Publication submission with DOI
Publication submission
Publication submission
Supporting Information
Supporting Information
General Information
General Information
Copyright, Embargo and License
Copyright, Embargo and License
Contact us
Contact us
Learning on the border: Active learning in imbalanced data classification
Date
2007-10-06
Author
Ertekin Bolelli, Şeyda
Bottou, Leon
Giles, C Lee
Metadata
Show full item record
This work is licensed under a
Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License
.
Item Usage Stats
177
views
0
downloads
Cite This
This paper is concerned with the class imbalance problem which has been known to hinder the learning performance of classification algorithms. The problem occurs when there are significantly less number of observations of the target concept. Various real-world classification tasks, such as medical diagnosis, text categorization and fraud detection suffer from this phenomenon. The standard machine learning algorithms yield better prediction performance with balanced datasets. In this paper, we demonstrate that active learning is capable of solving the class imbalance problem by providing the learner more balanced classes. We also propose an efficient way of selecting informative instances from a smaller pool of samples for active learning which does not necessitate a search through the entire dataset. The proposed method yields an efficient querying system and allows active learning to be applied to very large datasets. Our experimental results show that with an early stopping criteria, active learning achieves a fast solution with competitive prediction performance in imbalanced data classification.
URI
https://hdl.handle.net/11511/69499
DOI
https://doi.org/10.1145/1321440.1321461
Collections
Department of Computer Engineering, Conference / Seminar
Suggestions
OpenMETU
Core
Learning by Automatic Option Discovery from Conditionally Terminating Sequences
Girgin, Sertan; Polat, Faruk; Alhajj, Reda (2006-08-28)
This paper proposes a novel approach to discover options in the form of conditionally terminating sequences, and shows how they can be integrated into reinforcement learning framework to improve the learning performance. The method utilizes stored histories of possible optimal policies and constructs a specialized tree structure online in order to identify action sequences which are used frequently together with states that are visited during the execution of such sequences. The tree is then used to implici...
Student perceptions on learning by design method in web-based learning environment : a case study
Akman, Evrim; Karaaslan, Hasan; Department of Computer Education and Instructional Technology (2010)
The purpose of this study was to investigate the perceptions of students in an implementation of “Learning by Design” method through a web based learning environment. The information gathered from the students enrolled in the undergraduate course; “Foundations of Distance Education” in 2009 Summer School and 2009-2010 Fall Semesters was evaluated. The course was given in blended form, i.e. face to face lessons and online instructional activities were performed together. In the web based part of the course, ...
Prospective Middle School Mathematics Teachers' Covariational Reasoning for Interpreting Dynamic Events During Peer Interactions
Yemen-Karpuzcu, Secil; Ulusoy, Fadime; Işıksal Bostan, Mine (2017-01-01)
This study investigated the covariational reasoning abilities of prospective middle school mathematics teachers in a task about dynamic functional events involving two simultaneously changing quantities in an individual process and also in a peer interaction process. The focus was the ways in which prospective teachers' covariational reasoning abilities re-emerge in the peer interaction process in excess of their covariational reasoning. The data sources were taken from the individual written responses of p...
Learning Time-Vertex Dictionaries for Estimating Time-Varying Graph Signals
Acar, Abdullah Burak; Vural, Elif (2022-01-01)
In this work, we study the problem of learning time-vertex dictionaries for the modeling and estimation of time-varying graph signals. We consider a setting with a collection of partially observed time-varying graph signals, and propose a solution for the estimation of the missing signal observations by learning time-vertex dictionaries from the available observations. We adopt a time-vertex dictionary model defined through a set of joint time-vertex spectral kernels, each of which captures a different spec...
Adaptive Oversampling for Imbalanced Data Classification
Ertekin Bolelli, Şeyda (2013-01-01)
Data imbalance is known to significantly hinder the generalization performance of supervised learning algorithms. A common strategy to overcome this challenge is synthetic oversampling, where synthetic minority class examples are generated to balance the distribution between the examples of the majority and minority classes. We present a novel adaptive oversampling algorithm, Virtual, that combines the benefits of oversampling and active learning. Unlike traditional resampling methods which require preproce...
Citation Formats
IEEE
ACM
APA
CHICAGO
MLA
BibTeX
Ş. Ertekin Bolelli, L. Bottou, and C. L. Giles, “Learning on the border: Active learning in imbalanced data classification,” 2007, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/69499.