Simulation of Turkish lip motion and facial expressions in a 3D environment and synchronization with a Turkish speech engine

2004-04-30
AKAGUNDUZ, ERDEM
Halıcı, Uğur
ULUSOY PARNAS, İLKAY
In this thesis, 3D animation of human facial expressions and lip motion and their synchronization with a Turkish Speech engine using JAVA programming language, JAVA3D API and Java Speech API, is analyzed. A three-dimensional animation model for simulating Turkish lip motion and facial expressions is developed. In addition to lip motion, synchronization with a Turkish speech engine is achieved. The output of the study is facial expressions and Turkish lip motion synchronized with Turkish speech, where the input is Turkish text in Java Speech Markup Language (JSML) format, also indicating expressions. The animation is created using JAVA3D API. 3D facial models corresponding to different lip positions of the same person are morphed to each other to construct the animation. Moreover, simulations of human facial expressions of emotions are created within the animation. Expression weight parameter, which states the weight of the given expression, is introduced. The synchronization of lip motion with Turkish speech is achieved via CloudGarden(R)'s Java Speech API interface [2]. "Levent16k SAPI 4-5 Male Voice" of G-V.S Voice Technologies Software Firm was used for Turkish speech engine [3]. As a final point a virtual Turkish speaker with facial expression of emotions is created for JAVA3D animation.

Suggestions

Simulation of Turkish lip motion and facial expressions in a 3D environment and synchronization with a Turkish speech engine
Akagündüz, Erdem; Halıcı, Uğur; Department of Electrical and Electronics Engineering (2004)
In this thesis, 3D animation of human facial expressions and lip motion and their synchronization with a Turkish Speech engine using JAVA programming language, JAVA3D API and Java Speech API, is analyzed. A three-dimensional animation model for simulating Turkish lip motion and facial expressions is developed. In addition to lip motion, synchronization with a Turkish speech engine is achieved. The output of the study is facial expressions and Turkish lip motion synchronized with Turkish speech, where the in...
Optimization of physical parameters of an underactuated quadrupedal robot
Karagoz, Osman Kaan; Ankaralı, Mustafa Mert (2018-01-01)
In this paper, we present the comparison of different optimization algorithms that are used to optimize the parameters of a simulated legged robotic platform. We compare the results obtained by applying different algorithms on the same model and show the relative advantages and disadvantages of these algorithms. The tested algorithms are Particle Swarm Optimization, Binary Coded Genetic Algorithm, Broyden-Fletcher-Goldfrab-Shannon Algorithm and Method of Zoutendijk. We showed that the globally optimal param...
Bimodal automatic speech segmentation based on audio and visual information fusion
Akdemir, Eren; Çiloğlu, Tolga (2011-07-01)
Bimodal automatic speech segmentation using visual information together with audio data is introduced. The accuracy of automatic segmentation directly affects the quality of speech processing systems using the segmented database. The collaboration of audio and visual data results in lower average absolute boundary error between the manual segmentation and automatic segmentation results. The information from two modalities are fused at the feature level and used in a HMM based speech segmentation system. A T...
3D synthetic human face modelling tool based on T-spline surfaces
Aydoğan, Ali; Ulusoy, İlkay; Department of Electrical and Electronics Engineering (2007)
In this thesis work, a 3D Synthetic Human Face Modelling Software is implemented using C++ and OpenGL. Bézier surfaces, B-spline surfaces, Nonuniform Rational B-spline surfaces, Hierarchical B-Spline surfaces and T-spline surfaces are evaluated as options for the surface description method. T-spline surfaces are chosen since they are found to be superior considering the requirements of the work. In the modelling process, a modular approach is followed. Firstly, high detailed facial regions (i.e. nose, eyes,...
Implementation of an 8-bit microcontroller with system c
Kesen, Lokman; Aşkar, Murat; Department of Electrical and Electronics Engineering (2004)
In this thesis, an 8-bit microcontroller, 8051 core, is implemented using SystemC programming language. SystemC is a new generation co-design language which is capable of both programming software and describing hardware parts of a complete system. The benefit of this design environment appears while developing a System-on-Chip (SoC), that is a system consisting both custom hardware parts and embedded software parts. SystemC is not a completely new language, but based on C++ with some additional class libra...
Citation Formats
E. AKAGUNDUZ, U. Halıcı, and İ. ULUSOY PARNAS, “Simulation of Turkish lip motion and facial expressions in a 3D environment and synchronization with a Turkish speech engine,” 2004, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/33041.