Content-Based Retrieval of Audio in News Broadcasts

2009-10-28
Dogan, Ebru
SERT, MUSTAFA
Yazıcı, Adnan
This paper describes a complete, scalable and extensible content-based retrieval system for news broadcasts. Depending on segmentation results of the selected audio data, our system allows users to query audio data semantically by using both domain based fuzzy classes (anchor, commercial, reporter, sports, transition, weatherforecast, and venuesound) and similarity search. Two kinds of experiments were conducted on audio tracks of TRECVID news broadcasts to evaluate performance of the proposed query-by-example technique. The results obtained from our experiments demonstrate that Audio Spectrum Flatness feature in MPEG-7 standard performs better in music audio samples compared to other kinds of audio samples and the system is robust under different conditions.

Suggestions

CONTENT BASED HYPERSPECTRAL IMAGE RETRIEVAL USING BAG OF ENDMEMBERS IMAGE DESCRIPTORS
Omruuzun, Fatih; Demir, Begum; Bruzzone, Lorenzo; Çetin, Yasemin (2016-08-24)
This paper proposes a novel system for fast and accurate content based retrieval of hyperspectral images. The proposed system aims at retrieving hyperspectral images that have both similar spectral characteristics associated with specific materials and fractional abundances to the query image. It consists of two modules. The first module characterizes the query and the target hyperspectral images in the archive by two descriptors: 1) a binary spectral descriptor representing spectral characteristics of dist...
Delay and Peak-Age Violation Probability in Short-Packet Transmissions
Devassy, Rahul; Durisi, Giuseppe; Ferrante, Guido Carlo; Simeone, Osvaldo; Uysal, Elif (2018-06-22)
This paper investigates the distribution of delay and peak age of information in a communication system where packets, generated according to an independent and identically distributed Bernoulli process, are placed in a single-server queue with first-come first-served discipline and transmitted over an additive white Gaussian noise (AWGN) channel. When a packet is correctly decoded, the sender receives an instantaneous error-free positive acknowledgment, upon which it removes the packet from the buffer. In ...
Electromagnetic Target Classification using time frequency analysis and neural networks
Sayan, Gönül; Leblebicioğlu, Mehmet Kemal (Wiley, 1999-04-01)
This paper demonstrates the feasibility and advantages of using a self-organizing map (SOM)-type neural network classifier for electromagnetic target recognition. The classifier is supported by a novel feature extraction unit in which the Wigner distribution (WD), a time-frequency representation, is utilized for the extraction of natural-resonance-related energy feature vectors from scattered fields. The proposed target classification technique is tested for a set of canonical targets, displaying an excelle...
Hierarchical multitasking control of discrete event systems: Computation of projections and maximal permissiveness
Schmidt, Klaus Verner; Cury, José E.r. (null; 2010-12-01)
This paper extends previous results on the hierarchical and decentralized control of multitasking discrete event systems (MTDES). Colored observers, a generalization of the observer property, together with local control consistency, allow to derive sufficient conditions for synthesizing modular and hierarchical control that are both strongly nonblocking (SNB) and maximally permissive. A polynomial procedure to verify if a projection fulfills the above properties is proposed and in the case they fail for a g...
Image generation using only a discriminator network with gradient norm penalty
Yeşilçimen, Cansu Cemre; Akbaş, Emre; Department of Computer Engineering (2022-9)
This thesis explores the idea of generating images using only a discriminator network by extending a previously proposed method (Tapli, 2021) in several ways. The base method works by iteratively updating the input image, which is pure noise at the beginning while increasing the discriminator's score. We extend the training procedure of the base network by adding the following new losses: (i) total variation, (ii) N-way classification (if labels are available), and (iii) gradient norm penalty on real exam...
Citation Formats
E. Dogan, M. SERT, and A. Yazıcı, “Content-Based Retrieval of Audio in News Broadcasts,” 2009, vol. 5822, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/52702.