Multimodal Signal Processing

Theory and applications for human-computer interaction

Multimodal Signal Processing on ScienceDirect(Opens new window)
Hardbound, 352 Pages
Published: NOV-2009
ISBN 13: 978-0-12-374825-6
Imprint: ACADEMIC PRESS


Edited by
Jean-Philippe Thiran, EPFL, Lausanne, Switzerland
Ferran Marqués, Technical University of Catalonia, Spain
Hervé Bourlard, Director, IDIAP Research Institute, EPFL, Lausanne, Switzerland

Description
  • Presents state-of-art methods for multimodal signal processing, analysis, and modeling
  • Contains numerous examples of systems with different modalities combined
  • Describes advanced applications in multimodal Human-Computer Interaction (HCI) as well as in computer-based analysis and modelling of multimodal human-human communication scenes.


Multimodal signal processing is an important research and development field that processes signals and combines information from a variety of modalities – speech, vision, language, text – which significantly enhance the understanding, modelling, and performance of human-computer interaction devices or systems enhancing human-human communication. The overarching theme of this book is the application of signal processing and statistical machine learning techniques to problems arising in this multi-disciplinary field. It describes the capabilities and limitations of current technologies, and discusses the technical challenges that must be overcome to develop efficient and user-friendly multimodal interactive systems.



With contributions from the leading experts in the field, the present book should serve as a reference in multimodal signal processing for signal processing researchers, graduate students, R&D engineers, and computer engineers who are interested in this emerging field.



Audience:
Signal, acoustic, speech, image and video processing university (applied) researchers, R&D engineers, computer engineers


 
Last update: 6 Nov 2011