Robotics & Automation Open Access Platform
         
ITECH BOOKS JOURNALS EVENTS
 
Robust Speech Recognition and Understanding

ISBN 978-3-902613-08-0
hard cover, 460 pages
Edited by: Michael Grimm and Kristian Kroschel
Publisher: I-Tech Education and Publishing, Vienna, Austria
Publication date: June 2007
Price: 80 Euro incl. package & postage

|Download the full text of this book|
   

About the Book

This book on Robust Speech Recognition and Understanding brings together many different aspects of the current research on automatic speech recognition and language understanding. The first four chapters address the task of voice activity detection which is considered an important issue for all speech recognition systems. The next chapters give several extensions to state-of-the-art HMM methods. Furthermore, a number of chapters particularly address the task of robust ASR under noisy conditions. Two chapters on the automatic recognition of a speaker’s emotional state highlight the importance of natural speech understanding and interpretation in voice-driven systems. The last chapters of the book address the application of conversational systems on robots, as well as the autonomous acquisition of vocalization skills.

ITECH
The Smart Choice

 

Table of Contents

 

01Voice Activity Detection. Fundamentals and Speech Recognition System Robustness
J. Ramirez, J. M. Gorriz and J. C. Segura

02Novel Approaches to Speech Detection in the Processing of Continuous Audio Streams
Janez Sibert, Botjan Vesnicer and France Mihelic

03New Advances in Voice Activity Detection using HOS and Optimization Strategies
J.M. Gorriz, J. Ramirez and C.G. Puntonet

04Voice and Noise Detection with AdaBoost
T. Takiguchi, N. Miyake, H. Matsuda, and Y. Ariki

05Evolutionary speech recognition
Anne Spalanzani

06Using Genetic Algorithm to Improve the Performance of Speech Recognition Based on Artificial Neural Network 
Shing-Tai Pan and Chih-Chin Lai

07A General Approximation-Optimization Approach to Large Margin Estimation of HMMs
Hui Jiang and Xinwei Li

08Double Layer Architectures for Automatic Speech Recognition using HMM
Marta Casar and Jose A.R. Fonollosa

09Audio Visual Speech Recognition and Segmentation Based on DBN Models
Dongmei Jiang, Guoyun Lv, Ilse Ravyse, Xiaoyue Jiang, Yanning Zhang, Hichem Sahli and Rongchun Zhao

10Discrete-Mixture HMMs-based Approach for Noisy Speech Recognition 
Tetsuo Kosaka, Masaharu Katoh and Masaki Kohda

11Speech Recognition in Unknown Noisy Conditions
Ji Ming and Baochun Hou

12Uncertainty in signal estimation and stochastic weighted Viterbi algorithm: A unified framework to address robustness in speech recognition and speaker verification
Nestor Becerra Yoma, Carlos Molina, Claudio Garreton and Fernando Huenupan

13The Research of Noise-Robust Speech Recognition Based on Frequency Warping Wavelet
Xueying Zhang and Wenjun Meng

14Autocorrelation-based Methods for Noise-Robust Speech Recognition
Gholamreza Farahani, Mohammad Ahadi and Mohammad Mehdi

15Bimodal Emotion Recognition using Speech and Physiological Changes
Jonghwa Kim

16Emotion Estimation in Speech Using a 3D Emotion Space Concept
Michael Grimm and Kristian Kroschel

17Linearly Interpolated Hierarchical N-gram Language Models for Speech Recognition Engines
Imed Zitouni and Qiru Zhou

18A factored language model for prosody dependent speech recognition
Ken Chen, Mark A. Hasegawa-Johnson and Jennifer S. Cole

19Early decision making in continuous speech
Odette Scharenborg, Louisten Bosch and Lou Boves

20Analysis and implementation of an automated delimiter of "Quranic" verses in audio files using speech recognition techniques
Tabbal Hassan, Al-Falou Wassim and Monla Bassem

21An Improved GA Based Modified Dynamic Neural Network for Cantonese-Digit Speech Recognition
S.H. Ling, F.H.F. Leung, K.F. Leung, H.K. Lam and H.H.C. Iu

22Talking Robot and the Autonomous Acquisition of Vocalization and Singing Skill
Hideyuki Sawada

23Conversation system of an everyday robot Robovie-IV
Noriaki Mitsunaga, Zenta Miyashita, Takahiro Miyashita, Hiroshi Ishiguro and Norihiro Hagita

24Sound Localization of Elevation using Pinnae for Auditory Robots
Tomoko Shimoda, Toru Nakashima, Makoto Kumon, Ryuichi Kohzawa, Ikuro Mizumoto and Zenta Iwai

25Speech Recognition Under Noise Conditions: Compensation Methods
Angel de la Torre, Jose C. Segura, Carmen Benitez, Javier Ramirez, Luz Garcia and Antonio J. Rubio