Modeling of plosive to vowel transitions

Beköz, Alican
This thesis presents a study concerning stop consonant to vowel transitions which are modeled making use of acoustic tube model. Characteristics of the stop consonant to vowel transitions are tried to be obtained first. Therefore several transitions including fricative to vowel transitions are examined based on spectral and time related properties. In addition to these studies, x-ray snapshots, lip videos and also experiments including subjects are used to intensify the characterization, from the production and the perception side of views. As results of these studies the plosive to vowel transitions are observed to be uttered by exponential vocal tract movements and the perception mechanism is observed to be highly related with exponential spectral changes. A model, based on the acoustic tube model, is tried to be established using the knowledge and the experience gained during characterization therefore proposed model involves the vocal tract parameters observed in characterization part. Finally, plosive to vowel transitions including three types of plosives (alveolar, labial and velar) are synthesized by the proposed model. The formants of the synthesized sounds are compared with the formants of the natural sounds. Also the intelligibility tests of these sounds are done. Performance evaluation tests show the proposed model’s performance to be satisfactory.


Dynamic Speech Spectrum Representation and Tracking Variable Number of Vocal Tract Resonance Frequencies With Time-Varying Dirichlet Process Mixture Models
Özkan, Emre; Demirekler, Muebeccel (Institute of Electrical and Electronics Engineers (IEEE), 2009-11-01)
In this paper, we propose a new approach for dynamic speech spectrum representation and tracking vocal tract resonance (VTR) frequencies. The method involves representing the spectral density of the speech signals as a mixture of Gaussians with unknown number of components for which time-varying Dirichlet process mixture model (DPM) is utilized. In the resulting representation, the number of formants is allowed to vary in time. The paper first presents an analysis on the continuity of the formants in the sp...
Estimation of Articulatory Trajectories Based on Gaussian Mixture Model (GMM) With Audio-Visual Information Fusion and Dynamic Kalman Smoothing
ÖZBEK, İbrahim Yücel; Hasegawa-Johnson, Mark; Demirekler, Mübeccel (Institute of Electrical and Electronics Engineers (IEEE), 2011-07-01)
This paper presents a detailed framework for Gaussian mixture model (GMM)-based articulatory inversion equipped with special postprocessing smoothers, and with the capability to perform audio-visual information fusion. The effects of different acoustic features on the GMM inversion performance are investigated and it is shown that the integration of various types of acoustic (and visual) features improves the performance of the articulatory inversion process. Dynamic Kalman smoothers are proposed to adapt t...
Prediction of ducted diaphragm noise using a stochastic approach with adapted temporal filters
Karban, Ugur; Schram, Christophe; Sovardi, Carlo; Polifke, Wolfgang (SAGE Publications, 2019-01-01)
The noise production by ducted single- and double-diaphragm configurations is simulated using a stochastic noise generation and radiation numerical method. The importance of modeling correctly the anisotropy and temporal de-correlation is discussed, based on numerical results obtained by large eddy simulation. A new temporal filter is proposed, designed to provide the targeted spectral decay of energy in an Eulerian reference frame. An anisotropy correction is implemented using a non-linear model. The acous...
Diffusion Equation-Based Finite Element Modeling of a Monumental Worship Space
Gul, Zuhre Su; Xiang, Ning; Çalışkan, Mehmet (World Scientific Pub Co Pte Lt, 2017-12-01)
In this work, a diffusion equation model (DEM) is applied to a room acoustics case for in-depth sound field analysis. Background of the theory, the governing and boundary equations specifically applicable to this study are presented. A three-dimensional geometric model of a monumental worship space is composed. The DEM is solved over this model in a finite element framework to obtain sound energy densities. The sound field within the monument is numerically assessed; spatial sound energy distributions and f...
Predictions on absorption and scattering characteristics of acoustic scatterers modified with micro-perforated panels
Odabaş, Erinç; Çalışkan, Mehmet; Department of Mechanical Engineering (2012)
In this study, the basic absorption and scattering characteristics of acoustic scatterers, specifically Schroeder Diffusers, are investigated. Schroeder Diffusers are one of the most widely used acoustic scatterers in which the scattering phenomenon is predictable due to the geometry of the diffuser, based on a particular mathematical sequence. It is shown that it is possible to increase the amount of absorption by modifying the diffuser structure by means of adding perforated panels into the wells or narro...
Citation Formats
A. Beköz, “Modeling of plosive to vowel transitions,” M.S. - Master of Science, Middle East Technical University, 2007.