Boosted multiple kernel learning for first-person activity recognition

Download
2017-09-02
Özkan, Fatih
Arabacı, Mehmet Ali
Sürer, Elif
Temizel, Alptekin
Activity recognition from first-person (ego-centric) videos has recently gained attention due to the increasing ubiquity of the wearable cameras. There has been a surge of efforts adapting existing feature descriptors and designing new descriptors for the first-person videos. An effective activity recognition system requires selection and use of complementary features and appropriate kernels for each feature. In this study, we propose a data-driven framework for first-person activity recognition which effectively selects and combines features and their respective kernels during the training. Our experimental results show that use of Multiple Kernel Learning (MKL) and Boosted MKL in first-person activity recognition problem exhibits improved results in comparison to the state-of-the-art. In addition, these techniques enable the expansion of the framework with new features in an efficient and convenient way.

Suggestions

Multiple kernel learning for first-person activity recognition
Özkan, Fatih; Temizel, Alptekin; Sürer, Elif; Department of Information Systems (2017)
First-person vision applications have recently gained increasing popularity because of advances in wearable camera technologies. In the literature, existing descriptors have been adapted to the first-person videos or new descriptors have been proposed. These descriptors have been used in a single-kernel method which ignores the relative importance of each descriptor. On the other hand, first-person videos have different characteristics as compared to third-person videos which are captured by static cameras....
Multi-modal Egocentric Activity Recognition using Audio-Visual Features
Arabacı, Mehmet Ali; Özkan, Fatih; Sürer, Elif; Jancovic, Peter; Temizel, Alptekin (2018-07-01)
Egocentric activity recognition in first-person videos has an increasing importance with a variety of applications such as lifelogging, summarization, assisted-living and activity tracking. Existing methods for this task are based on interpretation of various sensor information using pre-determined weights for each feature. In this work, we propose a new framework for egocentric activity recognition problem based on combining audio-visual features with multi-kernel learning (MKL) and multi-kernel boosting (...
Multi-modal egocentric activity recognition using multi-kernel learning
Arabaci, Mehmet Ali; Ozkan, Fatih; Sürer, Elif; Jancovic, Peter; Temizel, Alptekin (2020-04-28)
Existing methods for egocentric activity recognition are mostly based on extracting motion characteristics from videos. On the other hand, ubiquity of wearable sensors allow acquisition of information from different sources. Although the increase in sensor diversity brings out the need for adaptive fusion, most of the studies use pre-determined weights for each source. In addition, there are a limited number of studies making use of optical, audio and wearable sensors. In this work, we propose a new framewo...
Automatic tests for camera performance analysis
Hasarpa, Alican; Akar, Gözde; Department of Electrical and Electronics Engineering (2018)
The camera technology is consistently improving, with high definition, smartcameras being utilized all around the world. Because of the different quality of such cameras, the camera performance analysis plays a critical role for the end users in order to determine the real difference between the available alternatives. The image quality of a camera may be assessed visually using digitally generated test patterns in a controlled environment. The main purpose of this thesis is to automate this assessment and ...
Multi-frame knowledge based text enhancement for mobile phone captured videos
Ozarslan, Suleyman; Eren, Pekin Erhan (2014-02-05)
In this study, we explore automated text recognition and enhancement using mobile phone captured videos of store receipts. We propose a method which includes Optical Character Resolution ( OCR) enhanced by our proposed Row Based Multiple Frame Integration (RB-MFI), and Knowledge Based Correction (KBC) algorithms. In this method, first, the trained OCR engine is used for recognition; then, the RB-MFI algorithm is applied to the output of the OCR. The RB-MFI algorithm determines and combines the most accurate...
Citation Formats
F. Özkan, M. A. Arabacı, E. Sürer, and A. Temizel, “Boosted multiple kernel learning for first-person activity recognition,” 2017, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/29977.