Investigation of the significance of periodicity information in speaker identification

Download
2008
Gürsoy, Seçil
In this thesis; general feature selection methods and especially the use of periodicity and aperiodicity information in speaker verification task is searched. A software system is constructed to obtain periodicity and aperiodicity information from speech. Periodicity and aperiodicity information is obtained by using a 16 channel filterbank and analyzing channel outputs frame by frame according to the pitch of that frame. Pitch value of a frame is also found by using periodicity algorithms. Parzen window (kernel density estimation) is used to represent each person’s selected phoneme. Constructed method is tested for different phonemes in order to find out its usability in different phonemes. Periodicity features are also used with MFCC features to find out their contribution to speaker identification problem.

Suggestions

A switch mode power supply for producing half wave sine output
Kaya, İbrahim; Ertan, Hulusi Bülent; Department of Electrical and Electronics Engineering (2008)
In this thesis; analysis, design and implementation of a DC-DC converter with active clamp forward topology is presented. The main objective of this thesis is generating a rectified sinusoidal voltage at the output of the converter. This is accomplished by changing the reference signal of the converter. The converter output is applied to an inverter circuit in order to obtain sinusoidal waveform. The zero crossing points of the converter is detected and the inverter drive signals are generated in order to o...
Efficient fpga implementation of image enhancement using video streams
Günay, Hazan; Aşkar, Murat; Department of Electrical and Electronics Engineering (2010)
This thesis is composed of three main parts; displaying an analog composite video input by via converting to digital VGA format, license plate localization on a video image and image enhancement on FPGA. Analog composite video input, either PAL or NTSC is decoded on a video decoder board; then on FPGA, video data is converted from 4:2:2 YCbCr format to RGB. To display RGB data on the screen, line doubling de-interlacing algorithm is used since it is efficient considering computational complexity and timing....
Electromagnetic compatibility of electric power quality monitor according to EN 61326 standard
Yaman, Özgür; Ermiş, Muammer; Department of Electrical and Electronics Engineering (2007)
In this thesis; Electromagnetic Compatibility (EMC) of Electric Power Quality Monitor developed within the scope of National Power Quality Project has been investigated according to EN 61326 standard. Both immunity and emission tests have been carried out in EMC laboratories of ELDAS and ASELSAN for the device under test. Necessary counter measures such as using electromagnetic interference (EMI) filters and transient voltage suppressors, shielding the case of device with EMI protective materials have been ...
The effects of the material density and dimensions of the landslide on the generated tsunamis
İnsel, Işıl; Yalçıner, Ahmet Cevdet; Department of Civil Engineering (2009)
In this thesis study; mechanism and modeling of tsunamis generated by landslides are investigated. Landslide parameters affecting the surface wave characterisics are studied. In order to understand occurance of this kind of tsunamis, among many historical tsunamis, the ones that are triggered by landslides are detected and studied. The generation of the landslide generated tsunamis are modeled using TWO-LAYER model, which solves nonlinear long wave equations simultaneously within two interfacing layers with...
Robust quality metrics for assessing multimodal data
Konuk, Barış; Akar, Gözde; Department of Electrical and Electronics Engineering (2015)
In this thesis work; a novel, robust, objective, no-reference video quality assessment (VQA) metric, namely Spatio-Temporal Network aware Video Quality Metric (STNVQM), has been proposed for estimating perceived video quality under compression and transmission distortions. STN-VQM uses parameters reflecting the spatiotemporal characteristics of the video such as spatial complexity and motion. STN-VQM also utilizes parameters representing distortions due to compression and transmission such as bit rate and p...
Citation Formats
S. Gürsoy, “Investigation of the significance of periodicity information in speaker identification,” M.S. - Master of Science, Middle East Technical University, 2008.