Comparison of Cuboid and Tracklet Features for Action Recognition on Surveillance Videos

Bayram, Ulya
Ulusoy, İlkay
Cicekli, Nihan Kesim
For recognition of human actions in surveillance videos, action recognition methods in literature are analyzed and coherent feature extraction methods that are promising for success in such videos are identified. Based on local methods, most popular two feature extraction methods (Dollar's "cuboid" feature definition and Raptis and Soatto's "tracklet" feature definition) are tested and compared. Both methods were classified by different methods in their original applications. In order to obtain a more fair comparison both methods are classified by using the same classification method. In addition, as it is more realistic for recognition of real videos, two most popular datasets KTH and Weizmann are classified by splitting method. According to the test results, convenience of tracklet features over other methods for action recognition in real surveillance videos is proven to be successful.


Comparison of histograms of oriented optical flow based action recognition methods
Erciş, Fırat; Ulusoy, İlkay; Department of Electrical and Electronics Engineering (2012)
In the task of human action recognition in uncontrolled video, motion features are used widely in order to achieve subject and appearence invariance. We implemented 3 Histograms of Oriented Optical Flow based method which have a common motion feature extraction phase. We compute an optical flow field over each frame of the video. Then those flow vectors are histogrammed due to angle values to represent each frame with a histogram. In order to capture local motions, The bounding box of the subject is divided...
Comparison of deep networks for gesture recognition
Sofu, Buğra; Ulusoy, İlkay; Department of Electrical and Electronics Engineering (2021-9-06)
Gesture recognition is an important problem and has been studied over the years especially in the fields such as surveillance systems, analysis of human behavior, robotics etc. In this thesis, different state of art algorithms, which are based on deep learning, were implemented and compared considering model complexities and accuracies. Also, a new approach was proposed and compared with them. Tested algorithms can be classified into two main categories: hybrid approaches, which use CNN and LSTM architectu...
Human action recognition for various input characteristics using 3 dimensional residual networks
Tüfekci, Gülin; Ulusoy, İlkay; Department of Electrical and Electronics Engineering (2019)
Action recognition using deep neural networks is a far-reaching research area which has been commonly utilized in applications such as statistical analysis of human behavior, detecting abnormalities using surveillance cameras and robotic systems. Previous studies have been performing researches to propose new machine learning algorithms and deep network architectures to obtain higher recognition accuracy levels. Instead of suggesting a network resulting in small accuracy gain, this thesis focuses on evaluat...
Feature Detection and Tracking for Extraction of Crowd Dynamics
Gunduz, Ayse Elvan; Temizel, Alptekin; Temizel, Tugba Taskaya (2013-01-01)
Extraction of crowd dynamics from video is the fundamental step for automatic detection of abnormal events. However, it is difficult to obtain sufficient performance with object tracking due to occlusions and insufficient resolution of the objects in the scene. As a result, optical flow or feature tracking methods are preferred in crowd videos. These applications also require algorithms to work in real-time. In this work, we investigated the applicability and performance of feature detection and tracking al...
Optical flow based video frame segmentation and segment classification
Akpınar, Samet; Alpaslan, Ferda Nur; Department of Computer Engineering (2018)
Video information retrieval is a field of multimedia research enabling us to extract desired semantic information from video data. In content-based video information retrieval, visual content obtained from video scenes is utilized. For developing methods to cope with content-based video information retrieval in terms of temporal concepts such as action, event, etc., representation of temporal information becomes critical. In this thesis, action detection is tackled based on a temporal video representation m...
Citation Formats
U. Bayram, İ. Ulusoy, and N. K. Cicekli, “Comparison of Cuboid and Tracklet Features for Action Recognition on Surveillance Videos,” 2013, Accessed: 00, 2020. [Online]. Available: