Multi-modal video summarization using hidden Markov models for content-based multimedia indexing

Yaşaroğlu, Yağız
This thesis deals with scene level summarization of story-based videos. Two different approaches for story-based video summarization are investigated. The first approach probabilistically models the input video and identifies scene boundaries using the same model. The second approach models scenes and classifies scene types by evaluating likelihood values of these models. In both approaches, hidden Markov models are used as the probabilistic modeling tools. The first approach also exploits the relationship between video summarization and video production, which is briefly explained, by means of content types. Two content types are defined, dialog driven and action driven content, and the need to define such content types is emonstrated by simulations. Different content types use different hidden Markov models and features. The selected model segments input video as a whole. The second approach models scene types. Two types, dialog scene and action scene, are defined with different features and models. The system classifies fixed sized partitions of the video as either of the two scene types, and segments partitions separately according to their scene types. Performance of these two systems are compared against a iv deterministic video summarization method employing clustering based on visual properties and video structure related rules. Hidden Markov model based video summarization using content types enjoys the highest performance.


Semantic video analysis for surveillance systems
Kardaş, Karani; Coşar, Ahmet; Çiçekli, Fehime Nihan; Department of Computer Engineering (2018)
This thesis presents novel studies about semantic inference of video events. In this respect, a surveillance video analysis system, called SVAS is introduced for surveillance domain, in which semantic rules and the definition of event models can be learned or defined by the user for automatic detection and inference of complex video events. In the scope of SVAS, an event model method named Interval-Based Spatio-Temporal Model (IBSTM) is proposed. SVAS can learn action models and event models without any pre...
Multimodal video database modeling, querying and browsing
Durak, N; Yazıcı, Adnan (2005-01-01)
In this paper, a multimodal video indexing and retrieval system, MMVIRS, is presented. MMVIRS models the auditory, visual, and textual sources of video collections from a semantic perspective. Besides multimodality, our model is constituted on semantic hierarchies that enable us to access the video from different semantic levels. MMVIRS has been implemented with data annotation, querying and browsing parts. In the annotation part, metadata information and video semantics are extracted in hierarchical ways. ...
Thermal changes in an artificial lake simulated using a one-dimensional numerical model
Tokyay Sinha, Talia Ekin; Yetgin, Mehmet Yücel (LookUs Bilisim A.S., 2019)
Bu çalışma yüksek sıcaklıktaki suyun göl ve rezervuar gibi akıntısız su kütlelerine verilmesini incelemektedir. Sayısal çalışmada PROBE isimli bir boyutlu (1B) sonlu hacim yazılımı kullanılmıştır. Yazılım, yüksek sıcaklıktaki suyun ve rüzgârın göl içindeki karışma süreçlerine etkisine, Koriolis etkisine ve güneş ışınımı etkisine açıklamalar getirmektedir. Bu koşullar termik santrallerdeki (kömür, doğalgaz, nükleer vb.) soğutma işlemleriyle alakalıdır. Mevsimsel doğal tabakalaşma ve termoklin oluşumu ...
Spatial-temporal fixed and hegemonic transitions in the historical capitalism
Taşkesen, Suat; Tayfur, Mehmet Fatih; Department of International Relations (2010)
This thesis analyzes the historical capitalism in a historical context. ccumulation cycles, hegemonic transitions, and their interrelated structures n the historical capitalism will be discussed alongside inspired prose, and completed final drafts. The thesis will also trace the causes and effects of accumulation cycles and hegemonic transitions and will seek to answer questions such as how and why those cycles and transitions ocur, what are the determinants and how and why those determinants effect those p...
Robust quality metrics for assessing multimodal data
Konuk, Barış; Akar, Gözde; Department of Electrical and Electronics Engineering (2015)
In this thesis work; a novel, robust, objective, no-reference video quality assessment (VQA) metric, namely Spatio-Temporal Network aware Video Quality Metric (STNVQM), has been proposed for estimating perceived video quality under compression and transmission distortions. STN-VQM uses parameters reflecting the spatiotemporal characteristics of the video such as spatial complexity and motion. STN-VQM also utilizes parameters representing distortions due to compression and transmission such as bit rate and p...
Citation Formats
Y. Yaşaroğlu, “Multi-modal video summarization using hidden Markov models for content-based multimedia indexing,” M.S. - Master of Science, Middle East Technical University, 2003.