A Study on particle filter based audio-visual face tracking on the AV16.3 dataset

Download
2016
Yılmaz, Yunus Emre
People tracking has received considerable attention as a research field recently. Since, there are a wide range of application areas that requires to track single or multi target people in different environments with various scenarios using a variety of sensors. In this kind of tracking scenarios, usage of audio and visual information together is commonly preferred method, because these cues are mostly exist in the tracking environment and they contain complementary information about the targets. Our work focuses on particle filter based Bayesian tracking method that fuses location estimates obtained from audio and video data separately for indoor and crowded environments. Surveillance, video-conferencing and security are main examples of application areas for this kind of tracking scenario. In our work, particle filter based trackers are implemented with number of different configurations in order to nvestigate possible gains from including audio data to the tracking problem instead using only visual data. In these implementations, comprehensive experiments are conducted using the AV16.3 dataset. Usage of this dataset makes possible to compare our results with other works from the literature. Also, this dataset covers a variety of tracking situations (e.g. occlusions and rapid movements of persons) which can be encountered in realistic scenarios, making the results more useful. Our results indicates that no significant gains are possible when multiple cameras are used except when there are serious optical occlusions.

Suggestions

A survey on location estimation techniques for events detected in Twitter
Ozdikis, Ozer; Oğuztüzün, Mehmet Halit S.; Karagöz, Pınar (Springer Science and Business Media LLC, 2017-08-01)
Detection of events using voluntarily generated content in microblogs has been the objective of numerous recent studies. One essential challenge tackled in these studies is estimating the locations of events. In this paper, we review the state-of-the-art location estimation techniques used in the localization of events detected in microblogs, particularly in Twitter, which is one of the most popular microblogging platforms worldwide. We analyze these techniques with respect to the targeted event type, granu...
A new algorithm for automatic road network extraction in multispectral satellite images
Karaman, Ersin; Çınar, Umut; Gedik, Ekin; Çetin, Yasemin; Halıcı, Uğur (2012-05-09)
The aim of this study is to develop automatic road extraction algorithm in satellite images. As roads have different width and surface material characteristics in urban and rural areas, a modular approach for road extraction algorithm is desired. In this study, edge detection, segmentation, clustering and vegetation and land cover analyses are used. In order to combine the results of different methods, a score map based on segmentation analysis is constructed. Quantitative and visual results show that this ...
A Shadow based trainable method for building detection in satellite images
Dikmen, Mehmet; Halıcı, Uğur; Department of Geodetic and Geographical Information Technologies (2014)
The purpose of this thesis is to develop a supervised building detection and extraction algorithm with a shadow based learning method for high-resolution satellite images. First, shadow segments are identified on an over-segmented image, and then neighboring shadow segments are merged by assuming that they are cast by a single building. Next, these shadow regions are used to detect the candidate regions where buildings most likely occur. Together with this information, distance to shadows towards illuminati...
Range parameterized bearings only tracking using particle filter
Arslan, Ali Erkin; Demirekler, Mübeccel; Department of Electrical and Electronics Engineering (2012)
In this study, accurate target tracking for bearings-only tracking problem is investigated. A new tracking filter for this nonlinear problem is designed where both range parameterization and Rao-Blackwellized (marginalized) particle filtering techniques are used in a Gaussian mixture formulation to track both constant velocity and maneuvering targets. The idea of using target turn rate in the state equation in such a way that marginalization is possible is elaborated. Addition to nonlinear nature, unobserva...
Design and implementation of a novel visual analysis system for image clasiffication
Altintakan, Ümit Lütfü; Yazıcı, Adnan; Körpeoğlu, İbrahim; Department of Computer Engineering (2013)
Possibilities offered by the technology to create, share and disseminate image and video data have resulted in a rapid increase in the available visual data. However, the data is useless unless it is effectively accessed, which necessitates the semantic analysis of visual data. In this dissertation, we present a novel visual analysis system along with its application to image classification problem. We aim to address the challenges in the area originated from the semantic gap, and to facilitate the research...
Citation Formats
Y. E. Yılmaz, “A Study on particle filter based audio-visual face tracking on the AV16.3 dataset,” M.S. - Master of Science, Middle East Technical University, 2016.