HandVR: a hand-gesture-based interface to a video retrieval system

Download
2015-10-01
Genc, Serkan
Bastan, Muhammet
GÜDÜKBAY, UĞUR
Atalay, Mehmet Volkan
GÜDÜKBAY, UĞUR
Using one's hands in human-computer interaction increases both the effectiveness of computer usage and the speed of interaction. One way of accomplishing this goal is to utilize computer vision techniques to develop hand-gesture-based interfaces. A video database system is one application where a hand-gesture-based interface is useful, because it provides a way to specify certain queries more easily. We present a hand-gesture-based interface for a video database system to specify motion and spatiotemporal object queries. We use a regular, low-cost camera to monitor the movements and configurations of the user's hands and translate them to video queries. We conducted a user study to compare our gesture-based interface with a mouse-based interface on various types of video queries. The users evaluated the two interfaces in terms of different usability parameters, including the ease of learning, ease of use, ease of remembering (memory), naturalness, comfortable use, satisfaction, and enjoyment. The user study showed that querying video databases is a promising application area for hand-gesture-based interfaces, especially for queries involving motion and spatiotemporal relations.
SIGNAL IMAGE AND VIDEO PROCESSING

Suggestions

Multimodal query-level fusion for efficient multimedia information retrieval
Sattari, Saeid; Yazıcı, Adnan (2018-10-01)
Managing a large volume of multimedia data containing various modalities such as visual, audio, and text reveals the necessity for efficient methods for modeling, processing, storing, and retrieving complex data. In this paper, we propose a fusion-based approach at the query level to improve query retrieval performance of multimedia data. We discuss various flexible query types including the combination of content as well as concept-based queries that provide users with the ability to efficiently perform mu...
Oblivious video watermaking using temporal sensitivity of HVS
Koz, A; Alatan, Abdullah Aydın (2004-04-30)
An oblivious video watermarking method is presented based on the temporal sensitivity of Human Visual System (HVS). The method exploits the temporal contrast thresholds of HVS to determine the maximum strength of watermark, which still gives imperceptible distortion after watermark insertion. Compared to other approaches in the literature, the method guarantees to avoid flickering problem in the watermarked video and gives better robustness results to video distortions, such as additive Gaussian noise, H.26...
Measurement-based replanning of cell capacities in GSM networks
Onur, Ertan; Ersoy, Cem; Çaǧlayan, M. Ufuk (2002-08-21)
Due to the scarcity of the spectral resources and mobility of the portables, the call attempts may be blocked during call initiation or terminated during the hand-off process. When the blocking ratio exceeds some grade of service level, the capacity of the congested cell must be replanned using the call attempt data. However, most of the time, the measurements are inflated by the redials and the retrials. During the replanning process, the first step should be to calculate the effective load from the measur...
Recursive shortest spanning tree algorithms for image segmentation
Bayramoglu, NY; Bazlamaçcı, Cüneyt Fehmi (2005-11-24)
Image segmentation has an important role in image processing and the speed of the segmentation algorithm may become a drawback for some applications. This study analyzes the run time performances of some variations of the Recursive Shortest Spanning Tree Algorithm (RSST) and proposes simple but effective modifications on these algorithms to improve their speeds. In addition, the effect of link weight cost function on the run time performance and the segmentation quality is examined. For further improvement ...
Flexible querying using structural and event based multimodal video data model
Oztarak, Hakan; Yazıcı, Adnan (2006-01-01)
Investments on multimedia technology enable us to store many more reflections of the real world in digital world as videos so that we carry a lot of information to the digital world directly. In order to store and efficiently query this information, a video database system (VDBS) is necessary. We propose a structural, event based and multimodal (SEBM) video data model which supports three different modalities that are visual, auditory and textual modalities for VDBSs and we can dissolve these three modaliti...
Citation Formats
S. Genc, M. Bastan, U. GÜDÜKBAY, M. V. Atalay, and U. GÜDÜKBAY, “HandVR: a hand-gesture-based interface to a video retrieval system,” SIGNAL IMAGE AND VIDEO PROCESSING, pp. 1717–1726, 2015, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/36556.