+612 9045 4394
$7.95 Delivery per order to Australia and New Zealand
100% Australian owned
Over a hundred thousand in-stock titles ready to ship
Handbook of Multimodal and Spoken Dialogue Systems : Resources, Terminology and Product Evaluation - Dafydd Gibbon

Handbook of Multimodal and Spoken Dialogue Systems

Resources, Terminology and Product Evaluation

By: Dafydd Gibbon (Editor), Inge Mertins (Editor), Roger K. Moore (Editor)

Hardcover Published: 31st August 2000
ISBN: 9780792379041
Number Of Pages: 520

Share This Book:


or 4 easy payments of $118.72 with Learn more
Ships in 10 to 15 business days

Earn 950 Qantas Points
on this Book

Other Available Editions (Hide)

  • Paperback View Product Published: 1st March 2013
    Ships in 10 to 15 business days

Dictation systems, read-aloud software for the blind, speech control of machinery, geographical information systems with speech input and output, and educational software with `talking head' artificial tutorial agents are already on the market. The field is expanding rapidly, and new methods and applications emerge almost daily. But good sources of systematic information have not kept pace with the body of information needed for development and evaluation of these systems. Much of this information is widely scattered through speech and acoustic engineering, linguistics, phonetics, and experimental psychology.
The Handbook of Multimodal and Spoken Dialogue Systems presents current and developing best practice in resource creation for speech input/output software and hardware. This volume brings experts in these fields together to give detailed `how to' information and recommendations on planning spoken dialogue systems, designing and evaluating audiovisual and multimodal systems, and evaluating consumer off-the-shelf products.
In addition to standard terminology in the field, the following topics are covered in depth:

  • How to collect high quality data for designing, training, and evaluating multimodal and speech dialogue systems;
  • How to evaluate real-life computer systems with speech input and output;
  • How to describe and model human-computer dialogue precisely and in depth.
Also included:
  • A fully searchable CD-ROM containing a hypertext version of the book in HTML format for fast look-up of specific points, convenient desktop use, and lightweight mobile reference; and
  • The first systematic medium-scale compendium of terminology with definitions.
This handbook has been especially designed for the needs of development engineers, decision-makers, researchers, and advanced level students in the fields of speech technology, multimodal interfaces, multimedia, computational linguistics, and phonetics.

Editorial Prefacep. xvii
Representation and annotation of dialoguep. 1
Introductionp. 1
Goalsp. 1
What is meant by 'Integrated Resources'?p. 2
Limitationsp. 3
A preliminary classification of dialogue corporap. 5
Dialogue actsp. 6
Towards a dialogue typologyp. 6
General coding issuesp. 11
Orthographyp. 12
Orthographic representationp. 12
Recommendationsp. 24
Morphosyntaxp. 26
Morphosyntactic (POS) annotationp. 26
Recommendationsp. 32
Syntaxp. 32
Syntactic annotationp. 32
Recommendationsp. 39
Prosodyp. 39
Prosodic annotationp. 39
Recommendationsp. 53
Pragmaticsp. 54
Pragmatic annotation: functional dialogue annotationp. 54
Recommendationsp. 66
TEI paralinguistic featuresp. 67
TEI P3 DTD: base tag set for transcribed speechp. 68
A few relevant web linksp. 70
Specimen Annotated Dialoguep. 70
Orthographic Transcriptionp. 71
Morphosyntactic annotationp. 72
Syntactic annotationp. 73
Prosodic Annotationp. 75
Pragmatic (Dialogue Act) Annotationp. 84
Combined Multi-level Annotationp. 87
Morphosyntactic annotation of corporap. 89
English tagsetp. 89
Italian DMI codesp. 95
Audio-visual and multimodal speech-based systemsp. 102
Introductionp. 102
Terminologyp. 103
Chapter outlinep. 106
Benefits of multimodal systemsp. 106
Input modalities associated with speechp. 109
Output modalities associated with speechp. 112
Taxonomies of multimodal applicationsp. 114
Survey of multimodal systemsp. 118
Evaluation of multimodal systemsp. 122
Types of evaluationp. 123
Evaluation methodologiesp. 124
Specific evaluation issuesp. 127
Recommendationsp. 129
Speech input with facial information (audio-visual speech recognition)p. 129
Face recognitionp. 129
Locating and tracking of other facial featuresp. 130
Automatic lipreading systemsp. 131
Integration of audio and visual signalsp. 131
Speech output with talking headsp. 132
Control techniquesp. 132
Lip shape computationp. 137
Talking heads: audio and video output synchronisationp. 138
Speech input with modalities other than facesp. 138
Recognition of non-speech input modalitiesp. 139
Integration in multimodal applicationsp. 140
Speech output in multimedia systemsp. 145
Taxonomy of output modalitiesp. 146
Output devicesp. 146
Theoretical issuesp. 147
Summary of recommendationsp. 155
Technology of multimodal system componentsp. 157
Techniques related to face recognition systemsp. 157
Synthesis modulep. 163
Facial modelsp. 164
Building conversational agentsp. 173
On-line character and handwriting recognitionp. 178
Gesture recognitionp. 183
Technical issuesp. 190
Standards and resources for multimodal/multimedia systemsp. 190
Standards and resources for monomodal processingp. 190
Towards standards for multimedia systemsp. 191
Towards standards for hypermedia systemsp. 193
Architectures and toolkits for multimodal integrationp. 193
Notational systemsp. 195
Face and audio databasesp. 196
Consumer off-the-shelf (COTS) product and service evaluationp. 204
Introductionp. 204
Purpose and scope of this chapterp. 204
Introduction to speech technologies and classificationp. 204
Automatic speech recognitionp. 205
Text-to-speech and speech synthesisp. 206
Speaker recognition and verificationp. 208
Speech understandingp. 208
Dialogue controlp. 209
General remarksp. 209
Assessment methodologyp. 209
Subjective assessment measuresp. 213
Acoustic environmentp. 214
Comparing several systemsp. 216
Command and control systemsp. 216
Typical systemsp. 216
Typical issuesp. 218
Evaluation designp. 220
Examplesp. 222
Document generationp. 227
Typical systemsp. 227
Typical issuesp. 228
Evaluation designp. 229
Examplesp. 229
Services and telephone applicationsp. 233
Typical systemsp. 233
Typical issuesp. 234
Evaluation designp. 234
Examplesp. 235
Conclusion and summary of recommendationsp. 238
Terminology for spoken language systemsp. 240
Introductionp. 240
Terminology standardsp. 240
Termbank usersp. 242
Chapter outlinep. 243
Terminological basicsp. 243
Central notions in terminological theoryp. 243
Relations between termsp. 247
The organisation of terminologyp. 249
The onomasiological and semasiological perspectivesp. 249
Terminological macrostructures and microstructuresp. 251
Spoken Language terminologyp. 252
The hybrid character of SL terminologyp. 252
Toward a microstructure for SL terminologyp. 253
Recommendations on termbank developmentp. 259
Recommendations for further readingp. 260
Relational databasesp. 261
Components of a relational databasep. 261
Structures in the relational modelp. 261
Codd's definition of a relational database systemp. 262
Query languagep. 262
Software implementationsp. 262
Distribution of data generation over timep. 263
Distribution of data generation over resourcesp. 263
Required system componentsp. 264
Terminology Management Systems (TMSs), databases, and interchange formatsp. 264
MultiTermp. 264
ITU Telecommunication Terminology Database: Termitep. 265
Termium--Canadian Linguistic Data Bankp. 267
Eurodicautomp. 268
Martif terminology interchange format (ISO 12200)p. 269
The Eaglet Term Database: an SL termbankp. 271
A hypergraph--based approachp. 271
Conceptual partsp. 272
Information storagep. 272
System componentsp. 272
Structurep. 273
Eaglet macrostructure for SL terminologyp. 273
Eaglet microstructure for SL terminologyp. 275
Using the Eaglet Term Databasep. 277
Future workp. 280
Reference materialsp. 281
Introductionp. 281
Organisations and infrastructurep. 282
Speech resources, agencies, and associationsp. 282
Archives, general informationp. 291
Education and conferencesp. 293
"SLP at Work"p. 296
Speech interfacesp. 296
Telecommunications and broadcastp. 297
New servicesp. 298
SLP as a research toolp. 298
SLP procedures, tools, and formatsp. 301
Annotationp. 302
Validation, evaluationp. 303
Tools and standardsp. 304
Textp. 308
Technologyp. 309
Alphabetsp. 310
Networksp. 310
File formatsp. 322
Programmingp. 324
Storagep. 326
Bibliographical referencesp. 329
Sampa and X-Sampa phonetic symbolsp. 359
The Eaglet term databasep. 367
Introductionp. 367
Eaglet termbank (abridged)p. 369
List of abbreviationsp. 497
Indexp. 503
CD--ROM disclaimerp. 521
Table of Contents provided by Syndetics. All Rights Reserved.

ISBN: 9780792379041
ISBN-10: 0792379047
Series: Kluwer International Series in Engineering & Computer Science
Audience: General
Format: Hardcover
Language: English
Number Of Pages: 520
Published: 31st August 2000
Country of Publication: US
Dimensions (cm): 24.79 x 15.65  x 3.56
Weight (kg): 1.01

Earn 950 Qantas Points
on this Book