Evaluation of voice activity and voicing detection

2008-09-22
Kotnik, Bojan
Sendorek, Pierre
Astrov, Sergey
Doco Fernndez, Laura
Banga, Eduardo Rodrguez
Höge, Harald
Kacic, Zdravko
Koç, Turgay
Çiloğlu, Tolga
This paper describes the ECESS evaluation campaign of voice activity and voicing detection. Standard VAD classifies signal into speech and non-speech, we extend it to VAD+ so that it classifies a signal as a sequence of non-speech, voiced and unvoiced segments. The evaluation is performed on a portion of the Spanish SPEECON database with manually labeled segmentation. To avoid errors caused by the limited precision of manual labeling we introduce "dead zones" -tolerance intervals ±5 ms around label changes in the data set. In these tolerance intervals we don't evaluate the signal.
INTERSPEECH 2008 - 9th Annual Conference of the International Speech Communication Association

Suggestions

Abnormal Crowd Behavior Detection Using Novel Optical Flow-Based Features
Direkoglu, Cem; Sah, Melike; O'Connor, Noel E. (2017-09-01)
In this paper, we propose a novel optical flow based features for abnormal crowd behaviour detection. The proposed feature is mainly based on the angle difference computed between the optical flow vectors in the current frame and in the previous frame at each pixel location. The angle difference information is also combined with the optical flow magnitude to produce new, effective and direction invariant event features. A one-class SVM is utilized to learn normal crowd behavior. If a test sample deviates si...
A FRAMEWORK FOR DETECTING COMPLEX EVENTS IN SURVEILLANCE VIDEOS
Onal, Itir; Kardas, Karani; Rezaeitabar, Yousef; Bayram, Ulya; Bal, Murat; Ulusoy, İlkay; Cicekli, Nihan Kesim (2013-07-19)
This paper presents a framework for detecting complex events in surveillance videos. Moving objects in the foreground are detected in the object detection component of the system. Whether these foregrounds are human or not is decided in the object recognition component. Then each detected object is tracked and labeled in the object tracking component, in which true labeling of objects in the occlusion situation is also provided. The extracted information is fed to the event detection component. Rule based e...
Dynamic Speech Spectrum Representation and Tracking Variable Number of Vocal Tract Resonance Frequencies With Time-Varying Dirichlet Process Mixture Models
Özkan, Emre; Demirekler, Muebeccel (Institute of Electrical and Electronics Engineers (IEEE), 2009-11-01)
In this paper, we propose a new approach for dynamic speech spectrum representation and tracking vocal tract resonance (VTR) frequencies. The method involves representing the spectral density of the speech signals as a mixture of Gaussians with unknown number of components for which time-varying Dirichlet process mixture model (DPM) is utilized. In the resulting representation, the number of formants is allowed to vary in time. The paper first presents an analysis on the continuity of the formants in the sp...
Evaluation of UAS Camera Operator Interfaces in a Simulated Task Environment An Optical Brain Imaging Approach
Çakır, Murat Perit; Akay, Daryal; Ayaz, Hasan; İşler, Veysi (null; 2012-07-11)
In this paper we focus on the effect of different interface designs on the performance and cognitive workload of sensor operators (SO) during a target detection task in a simulated environment. Functional near-infrared (fNIR) spectroscopy is used to investigate whether there is a relationship between target detection performance across three SO interfaces and brain activation data obtained from the subjects’ prefrontal cortices that are associated with relevant higher-order cognitive functions such as atten...
Application of Project-Based Learning in a Theoretical Course: Process, Difficulties and Recommendations
CODUR, K. Burak; Karatas, Sercin; Doğru, Ali Hikmet (2012-01-01)
This paper presents a case study about the application of a project-based learning approach. In this case study, software development projects are performed by the students, using historical software development methods in order to demonstrate evolution of the subject. The presented case study differs from others reported in the literature in its utilization of historical methods for project execution. Getting feedback and reaction of students and assessing the success of the project-based learning implemen...
Citation Formats
B. Kotnik et al., “Evaluation of voice activity and voicing detection,” Brisbane, QLD; Australia, 2008, p. 1642, Accessed: 00, 2021. [Online]. Available: https://hdl.handle.net/11511/87345.