Comparison of feature sets using multimedia translation

Date

2003-01-01

Author

Duygulu, P
Ozcanli, OC
Papernick, N

Metadata

Show full item record

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Item Usage Stats

232
views

0
downloads

Feature selection is very important for many computer vision applications. However, it is hard to find a good measure for the comparison. In this study, feature sets are compared using the translation model of object recognition which is motivated by the availablity of large annotated data sets. Image regions are linked to words using a model which is inspired by machine translation. Word prediction performance is used to evaluate large numbers of images.

Subject Keywords

Machine translation, Image region, Content base image retrieval, Target distribution, Statistical machine translation

URI

https://hdl.handle.net/11511/67142

Collections

Department of Computer Engineering, Conference / Seminar

Suggestions

OpenMETU
Core

3D TRACKING OF PEOPLE WITH RAO-BLACKWELLIZED PARTICLE FILTERS Topcu, Osman; Orguner, Umut; Alatan, Abdullah Aydın; ERCAN, ALİ ÖZER (2014-04-25) Visual tracking has an important place among computer vision applications. Visual tracking with particle filters is a well-known methodology. The performance of particle filters is dependent on efficient sampling of the state space, which in turn, is dependent on number of particles. In this paper, Rao-Blackwell technique is applied to particle filters to improve sampling efficiency. Both algorithms are applied to people tracking problem. Under the same circumstances, the resulting algorithm is demonstrated...
Visual similarity for hdr images with applications to tone mapping Aydınlılar, Merve; Akyüz, Ahmet Oğuz; Tarı, Zehra Sibel; Department of Computer Engineering (2021-2-15) Assessing visual similarity between images is important for many computer vision applications. So far, investigations on visual similarity have been confined to low dynamic range images. However, recently, there is a growing interest to high dynamic range (HDR) imaging. In this thesis, the aim is to shed light on visual image similarity for HDR images by following an experimental approach. To this end, a user experiment is conducted through a novel web-based interface, in which the participants assess the p...
Visual object detection and tracking using local convolutional context features and recurrent neural networks Kaya, Emre Can; Alatan, Abdullah Aydın; Department of Electrical and Electronics Engineering (2018) Visual object detection and tracking are two major problems in computer vision which have important real-life application areas. During the last decade, Convolutional Neural Networks (CNNs) have received significant attention and outperformed methods that rely on handcrafted representations in both detection and tracking. On the other hand, Recurrent Neural Networks (RNNs) are commonly preferred for modeling sequential data such as video sequences. A novel convolutional context feature extension is introduc...
Edge strength functions as shape priors in image segmentation Erdem, Erkut; Erdem, Aykut; Tarı, Zehra Sibel (2005-12-01) Many applications of computer vision requires segmenting out of an object of interest from a given image. Motivated by unlevel-sets formulation of Raviv, Kiryati and Sochen [8] and statistical formulation of Leventon, Grimson and Faugeras [6], we present a new image segmentation method which accounts for prior shape information. Our method depends on Ambrosio-Tortorelli approximation of Mumford-Shah functional. The prior shape is represented by a by-product of this functional, a smooth edge indicator functi...
Prior knowledge guided weakly supervised object detection and semantic segmentation Baltacı, Fatih; Cinbiş, Ramazan Gökberk; Department of Computer Engineering (2022-2) State-of-the-art recognition models in computer vision are trained using annotated training data. Collecting manual annotation for images is a time-consuming and tedious task. Annotation time and difficulty also change across computer vision tasks. For example, object detection tasks require bounding-box annotations, which can be difficult to annotate, particularly in complex scenes, and semantic segmentation tasks require pixel-level annotations, which by definition requires a great amount of effort. Weakl...

Citation Formats

P. Duygulu, O. Ozcanli, and N. Papernick, “Comparison of feature sets using multimedia translation,” 2003, vol. 2869, p. 513, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/67142.