Comparison of feature sets using multimedia translation

2003-01-01
Duygulu, P
Ozcanli, OC
Papernick, N
Feature selection is very important for many computer vision applications. However, it is hard to find a good measure for the comparison. In this study, feature sets are compared using the translation model of object recognition which is motivated by the availablity of large annotated data sets. Image regions are linked to words using a model which is inspired by machine translation. Word prediction performance is used to evaluate large numbers of images.

Suggestions

3D TRACKING OF PEOPLE WITH RAO-BLACKWELLIZED PARTICLE FILTERS
Topcu, Osman; Orguner, Umut; Alatan, Abdullah Aydın; ERCAN, ALİ ÖZER (2014-04-25)
Visual tracking has an important place among computer vision applications. Visual tracking with particle filters is a well-known methodology. The performance of particle filters is dependent on efficient sampling of the state space, which in turn, is dependent on number of particles. In this paper, Rao-Blackwell technique is applied to particle filters to improve sampling efficiency. Both algorithms are applied to people tracking problem. Under the same circumstances, the resulting algorithm is demonstrated...
Visual similarity for hdr images with applications to tone mapping
Aydınlılar, Merve; Akyüz, Ahmet Oğuz; Tarı, Zehra Sibel; Department of Computer Engineering (2021-2-15)
Assessing visual similarity between images is important for many computer vision applications. So far, investigations on visual similarity have been confined to low dynamic range images. However, recently, there is a growing interest to high dynamic range (HDR) imaging. In this thesis, the aim is to shed light on visual image similarity for HDR images by following an experimental approach. To this end, a user experiment is conducted through a novel web-based interface, in which the participants assess the p...
Visual object detection and tracking using local convolutional context features and recurrent neural networks
Kaya, Emre Can; Alatan, Abdullah Aydın; Department of Electrical and Electronics Engineering (2018)
Visual object detection and tracking are two major problems in computer vision which have important real-life application areas. During the last decade, Convolutional Neural Networks (CNNs) have received significant attention and outperformed methods that rely on handcrafted representations in both detection and tracking. On the other hand, Recurrent Neural Networks (RNNs) are commonly preferred for modeling sequential data such as video sequences. A novel convolutional context feature extension is introduc...
Edge strength functions as shape priors in image segmentation
Erdem, Erkut; Erdem, Aykut; Tarı, Zehra Sibel (2005-12-01)
Many applications of computer vision requires segmenting out of an object of interest from a given image. Motivated by unlevel-sets formulation of Raviv, Kiryati and Sochen [8] and statistical formulation of Leventon, Grimson and Faugeras [6], we present a new image segmentation method which accounts for prior shape information. Our method depends on Ambrosio-Tortorelli approximation of Mumford-Shah functional. The prior shape is represented by a by-product of this functional, a smooth edge indicator functi...
Prior knowledge guided weakly supervised object detection and semantic segmentation
Baltacı, Fatih; Cinbiş, Ramazan Gökberk; Department of Computer Engineering (2022-2)
State-of-the-art recognition models in computer vision are trained using annotated training data. Collecting manual annotation for images is a time-consuming and tedious task. Annotation time and difficulty also change across computer vision tasks. For example, object detection tasks require bounding-box annotations, which can be difficult to annotate, particularly in complex scenes, and semantic segmentation tasks require pixel-level annotations, which by definition requires a great amount of effort. Weakl...
Citation Formats
P. Duygulu, O. Ozcanli, and N. Papernick, “Comparison of feature sets using multimedia translation,” 2003, vol. 2869, p. 513, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/67142.