Performance evaluation of real-time noisy speech recognition for mobile devices

Download

index.pdf

Date

2019

Author

Yurtcan, Yaser

Metadata

Show full item record

Item Usage Stats

437
views

208
downloads

Communication is important for people. There are many available communication methods. One of the most effective methods is through the use of speech. People can comfortably express their feelings and thoughts by using speech. However, some people may have a hearing problem. Furthermore, understanding spoken words in a noisy environment could be a challenge even for healthy people. Speech recognition systems enable real-time speech to text conversion. They mainly involve capturing of the sound waves and converting them into meaningful texts. The use of speech recognition on mobile devices has been possible with the development of cloud systems. However, delivering a robust and low error rate speech recognition system in a noisy environment still is a major problem. In this study, different speech samples have been recorded using a compact microphone array in noisy environments and a data set has been created by processing them through a real-time noise cancellation algorithm. A portable design of a mobile system with noise cancellation hardware and software was proposed to convert spoken words to a meaningful text. Comprehensive tests were performed on several clean, noisy and denoised speech samples to measure the speech recognition performance of different cloud systems, noise robustness of the proposed system, the effect of gender on the speech recognition performance, and the performance improvement. The experimental results show that the proposed system provides good performance even in a noisy environment. It is also inferred from the results that in order to apply speech recognition using cloud based systems on mobile devices, the noise level has to be low or real-time noise cancellation algorithms are needed. The proposed system improves speech recognition accuracy in noisy environments. Thus, the achieved performance and portable design together enable the system to be used in daily life

Subject Keywords

Speech perception., Speech processing systems., Automatic speech recognition., Mobile communication systems.

URI

http://etd.lib.metu.edu.tr/upload/12623092/index.pdf
https://hdl.handle.net/11511/28055

Collections

Graduate School of Informatics, Thesis

Suggestions

OpenMETU
Core

Wireless Body Area Network Studies for Telemedicine Applications Using IEEE 802.15.6 Standard Ozderya, Hasan Yavuz; ERDÖL, HAKAN; KAYIKÇIOĞLU, TEMEL; Yılmaz, Ali Özgür; KAYA, İSMAİL (2017-03-18) Wireless communication is becoming a part of our life at every step. But widespread use in medical applications is yet to come. We are developing a wireless communication system based on 802.15.6 MAC and 802.15.4 PHY for use in transmitting ECG data from a remote patient monitoring device which is used for home based telemedicine applications. The paper concentrates on explaining the stack program development phases of the standard IEEE 802.15.6 and its flexible access features. It is believed that the subj...
Keystroke transcription from acoustic emanations using continuous wavelet transform Özkan, Abdullah; Günel Kılıç, Banu; Acartürk, Cengiz; Department of Cybersecurity (2022-9) One of the most common methods of communication is written communication. Written communication has been found in various forms over the years and has changed shape with technical and technological developments. Today, written communication has shifted to digital media and keyboards have become one of the most frequently used entry points. This makes keyboards a critical node in the flow of information. There are several ways in which information entered through the keyboard can leak. Acoustic propagation i...
A Review of Research on Intercultural Learning through Computer-Based Digital Technologies Çiftçi, Emrullah Yasin (2016-04-01) Intercultural communication is now a crucial part of our globalizing lives; however, not everyone has an opportunity to engage in an intercultural interaction with people from different cultures. Computer-based technologies are promising in creating environments for people to communicate with people from diverse cultures. This qualitative synthesis of quantitative and qualitative research therefore aimed to investigate the literature in respect to intercultural learning through technology use. Besides repor...
Ad Hoc routing simulation and tactical picture display tool for navy Aymak, Onur; Coşar, Ahmet; Department of Computer Engineering (2004) The importance of communication is vital in wartime. The capability of having all the position information of the allied and enemy forces in a single Tactical Information Display System (TIDS), maintains a great advantage for deciding what to do before the enemy reacts. A Naval Information Distributing System (NIDS) is developed for building an effective communication infrastructure between the war ships. In the designed network, besides the mobile platforms (ships), some fixed platforms (land stations) are...
Gestures production under instructional context The role of mode of instruction Melda, Coşkun; Acartürk, Cengiz (Cognitive Science Society ; 2015-09-25) We aim at examining how communication mode influences the production of gestures under specific contextual environments. Twenty-four participants were asked to present a topic of their choice under three instructional settings: a blackboard, paper-and-pencil, and a tablet. Participants’ gestures were investigated in three groups: deictic gestures that point to entities, representational gestures that present picturable aspects of semantic content, and beat gestures that are speech-related rhythmic hand move...

Citation Formats

Y. Yurtcan, “Performance evaluation of real-time noisy speech recognition for mobile devices,” M.S. - Master of Science, Middle East Technical University, 2019.