Robust quality metrics for assessing multimodal data

Download

index.pdf

Date

2015

Author

Konuk, Barış

Metadata

Show full item record

Item Usage Stats

320
views

130
downloads

In this thesis work; a novel, robust, objective, no-reference video quality assessment (VQA) metric, namely Spatio-Temporal Network aware Video Quality Metric (STNVQM), has been proposed for estimating perceived video quality under compression and transmission distortions. STN-VQM uses parameters reflecting the spatiotemporal characteristics of the video such as spatial complexity and motion. STN-VQM also utilizes parameters representing distortions due to compression and transmission such as bit rate and packet loss ratio. STN-VQM has been trained on the Laboratory of Image and Video Engineering (LIVE) VQA database, owned by University of Texas at Austin, and evaluated on LIVE, Ecole Politechnique Federale de Lausanne (EPFL)- Politecnico di Milano (PoliMI) and Instituto de Telecomunicacoes, Instituto Superior Tecnico (IT-IST) VQA databases and also on video streams in University of Plymouth audiovisual quality assessment (AVQA) database. STN-VQM is proven to predict perceived video quality accurately on these databases, which span a wide range of video contents, video codecs, spatial resolutions, bit rates, frame rates, packet losses etc. Comparison to the existing state-of-the-art VQA metrics indicates that the STN-VQM provides promising results. Moreover, a novel, objective, no-reference audio quality assessment (AQA) metric has been introduced in order to predict perceived audio quality under compression and transmission distortions. Proposed AQA metric appraises perceived audio quality based on parameters such as sampling frequency, bit rate and packet loss ratio. Proposed AQA metric has been trained and evaluated on two different AQA databases. The AQA metric is shown to appraise perceived audio quality reliably on these AQA databases, which have different audio encoding types. Finally, an objective, no-reference AVQA metric (namely, Direct AudioVisual Quality Assessment – DAVQA) has been obtained by applying the classical approach in the literature, i.e., by combining perceived video quality estimate, perceived audio quality estimate and their product. Moreover, a novel video classification method which classifies videos according to their spatio-temporal characteristics has been developed. Using this spatio-temporal based video classification method, a novel, content-dependent AVQA algorithm (namely Content Dependent AudioVisual Quality Assessment – CDAVQA) has been designed. The CDAVQA model is shown to be more accurate than the DAVQA model on the audiovisual data in the University of Plymouth AVQA database.

Subject Keywords

Database management., Data structures (Computer science)., Digital video, Digital media, Content-based image retrieval.

URI

http://etd.lib.metu.edu.tr/upload/12618606/index.pdf
https://hdl.handle.net/11511/24529

Collections

Graduate School of Natural and Applied Sciences, Thesis

Suggestions

OpenMETU
Core

Intraday markets and potential benefits for Turkey İlseven, Engin; Sevaioğlu, Osman; Department of Electrical and Electronics Engineering (2014) In this thesis work; the characteristics, applications, logic, and potential benefits of intraday markets are investigated comprehensively. The properties of intraday markets are examined, and within this framework the applications of intraday markets in Europe and the mechanism that is going to be applied in Turkey are discussed. Also, considering the increasing importance of intraday markets, the logic behind these markets is examined. In this respect, the uncertainties which cause imbalances in the balan...
Electromagnetic compatibility of electric power quality monitor according to EN 61326 standard Yaman, Özgür; Ermiş, Muammer; Department of Electrical and Electronics Engineering (2007) In this thesis; Electromagnetic Compatibility (EMC) of Electric Power Quality Monitor developed within the scope of National Power Quality Project has been investigated according to EN 61326 standard. Both immunity and emission tests have been carried out in EMC laboratories of ELDAS and ASELSAN for the device under test. Necessary counter measures such as using electromagnetic interference (EMI) filters and transient voltage suppressors, shielding the case of device with EMI protective materials have been ...
Investigation of the significance of periodicity information in speaker identification Gürsoy, Seçil; Çiloğlu, Tolga; Department of Electrical and Electronics Engineering (2008) In this thesis; general feature selection methods and especially the use of periodicity and aperiodicity information in speaker verification task is searched. A software system is constructed to obtain periodicity and aperiodicity information from speech. Periodicity and aperiodicity information is obtained by using a 16 channel filterbank and analyzing channel outputs frame by frame according to the pitch of that frame. Pitch value of a frame is also found by using periodicity algorithms. Parzen window (ke...
A switch mode power supply for producing half wave sine output Kaya, İbrahim; Ertan, Hulusi Bülent; Department of Electrical and Electronics Engineering (2008) In this thesis; analysis, design and implementation of a DC-DC converter with active clamp forward topology is presented. The main objective of this thesis is generating a rectified sinusoidal voltage at the output of the converter. This is accomplished by changing the reference signal of the converter. The converter output is applied to an inverter circuit in order to obtain sinusoidal waveform. The zero crossing points of the converter is detected and the inverter drive signals are generated in order to o...
Comparison of Efficiency of Two dc-to-ac Converters for Grid Connected Solar Applications Ertan, Hulusi Bülent; Yilmaz, Arif (2012-05-26) In this paper; requirements from grid connected photovoltaic (PV) converters are briefly reviewed. Traditional buck-converter, line-frequency transformer topology is taken as reference, which satisfies all of the requirements imposed by standards and the utility. However, this topology employs a bulky transformer. Furthermore, a large electrolytic capacitor is needed in this circuit, which is expensive and also limits the life of the converter. This is not desirable in modern applications where PV module st...

Citation Formats

B. Konuk, “Robust quality metrics for assessing multimodal data,” Ph.D. - Doctoral Program, Middle East Technical University, 2015.