Loading...

Speech in Mobile and Pervasive Environments

ISBN: 978-1-119-96688-3

January 2012

352 pages

Description
This book provides a cross-disciplinary reference to speech in mobile and pervasive environments

Speech in Mobile and Pervasive Environments  addresses the issues related to speech processing on resource-constrained mobile devices. These include speech recognition in noisy environments, specialised hardware for speech recognition and synthesis, the use of context to enhance recognition and user experience, and the emerging software standards required for interoperability.  This book takes a multi-disciplinary look at these matters, while offering an insight into the opportunities and challenges of speech processing in mobile environs. In developing regions, speech-on-mobile is set to play a momentous role, socially and economically; the authors discuss how voice-based solutions and applications offer a compelling and natural solution in this setting.

Key Features

  • Provides a holistic overview of all speech technology related topics in the context of mobility
  • Brings together the latest research in a logically connected way in a single volume
  • Covers hardware, embedded recognition and synthesis, distributed speech recognition, software technologies, contextual interfaces
  • Discusses multimodal dialogue systems and their evaluation
  • Introduces speech in mobile and pervasive environments for developing regions

This book provides a comprehensive overview for beginners and experts alike. It can be used as a textbook for advanced undergraduate and postgraduate students in electrical engineering and computer science. Students, practitioners or researchers in the areas of mobile computing, speech processing, voice applications, human-computer interfaces, and information and communication technologies will also find this reference insightful. For experts in the above domains, this book complements their strengths. In addition, the book will serve as a guide to practitioners working in telecom-related industries.

About the Author

Mr Nitendra Rajput, IBM Research, New Delhi, India
Nitendra Rajput is a Research Staff Member with IBM India Reseach Lab (IRL) in New Delhi since 1998. Prior to this, he finished his Masters from Indian Institute of Technology, Bombay in Communications. At IRL, he has been working in the field of conversational systems for the last nine years. He has worked on Audio Visual Speech recognition, speech recognition systems for Indian languages. His interests are in statistical signal processing, dialog management, speech and image processing.

Mr Amit A. Nanavati, IBM Research, New Delhi, India
Amit A. Nanavati is a Research Staff Member, working in the Telecom Research Innovation Centre at IBM India Research Lab. For the last four years, he has been actively working in the area of mobile and pervasive computing. He was involved with the MDAT (Multi-device Authoring Technology) project (now a product) for adapting applications to pervasive devices. His research interests include information retrieval and constructing models for evaluation. Prior to joining IBM, he was working with Netscape for 4 years.