Multimodal Processing and Interaction: Audio, Video and Text presents high quality, state-of-the-art research ideas and results from theoretic, algorithmic and application viewpoints. This edited volume contains both state-of-the-art reviews and original contributions by leading experts in the scientific and technological field of multimedia. It grew out of a four-year collaboration among research groups participating in the European network of Excellence on Multimedia Understanding, Semantics, Computation and Learning (MUSCLE).
Multimodal Processing and Interaction: Audio, Video and Text covers a broad spectrum of novel perspectives, analytic tools, algorithms, design practices and applications in multimedia science and engineering with emphasis on multimodal integration and modality fusion. This volume also contains contributions in the area of interaction with multimedia, especially multimodal interfaces for accessing multimedia content.
Multimodal Processing and Interaction: Audio, Video and Text is designed for a professional audience composed of practitioners and researchers in industry and academia. This book is suitable for advanced-level students in computer science and engineering as well.
|Review of the State-of-the-Art|
|Cross-Modal Integration for Performance Improving in Multimedia: A Review||p. 3|
|Human-Computer Interfaces to Multimedia Content: a Review||p. 49|
|Integrated Multimedia Analysis and Recognition|
|Stochastic Models for Multimodal Video Analysis||p. 91|
|Adaptive Multimodal Fusion by Uncertainty Compensation with Application to Audio-Visual Speech Recognition||p. 111|
|Action Recognition in Multimedia Streams||p. 127|
|Surveillance Using Both Video and Audio||p. 143|
|Movie Analysis with Emphasis to Dialogue and Action Scene Detection||p. 157|
|Audiovisual Attention Modeling and Salient Event Detection||p. 179|
|Toward the Integration of Natural Language Processing and Automatic Speech Recognition: Using Morpho-Syntax and Pragmatics for Transcription||p. 201|
|Searching Multimedia Content|
|Interactive Image Retrieval Using a Hybrid Visual and Conceptual Content Representation||p. 221|
|Multimodal Analysis of Text and Audio Features for Music Information Retrieval||p. 241|
|Intelligent Search for Image Information on the Web through Text and Link Structure Analysis||p. 259|
|Interfaces to Multimedia Content|
|Design Principles for Multimodal Spoken Dialogue Systems||p. 279|
|Eye Tracking: A New Interface for Visual Exploration||p. 297|
|User Interaction for Mobile Devices||p. 311|
|Table of Contents provided by Ingram. All Rights Reserved.|
Series: Multimedia Systems and Applications
Audience: Tertiary; University or College
Number Of Pages: 374
Published: 1st August 2008
Publisher: Springer-Verlag New York Inc.
Country of Publication: US
Dimensions (cm): 23.5 x 15.5 x 2.54
Weight (kg): 0.75