Get Free Shipping on orders over $79
Multilingual Phoneme Models for Rapid Speech Processing System Development - Eric G. Hansen

Multilingual Phoneme Models for Rapid Speech Processing System Development

By: Eric G. Hansen

Hardcover | 22 May 2025

At a Glance

Hardcover


$43.95

or 4 interest-free payments of $10.99 with

 or 

Ships in 5 to 7 business days

Current speech recognition systems tend to be developed only for commercially viable languages. The resources needed for a typical speech recognition system include hundreds of hours of transcribed speech for acoustic models and 10 to 100 million words of text for language models; both of these requirements can be costly in time and money. The goal of this research is to facilitate rapid development of speech systems to new languages by using multilingual phoneme models to alleviate requirements for large amounts of transcribed speech. The GlobalPhone database, which contains transcribed speech from 15 languages, is used as source data to derive multilingual phoneme models. Various bootstrapping processes are used to develop an Arabic speech recognition system starting from monolingual English models, International Phonetic Association (IPA) based multilingual models, and data-driven multilingual models. The Kullback-Leibler distortion measure is used to derive datadriven phoneme clusters. It was found that multilingual bootstrapping methods outperform monolingual English bootstrapping methods on the Arabic evaluation data initially, and after three iterations of bootstrapping all systems show similar performance levels. Applications of this research are in speech recognition, word spotting, information retrieval, and speech-to-speech translation.

This work has been selected by scholars as being culturally important, and is part of the knowledge base of civilization as we know it. This work was reproduced from the original artifact, and remains as true to the original work as possible. Therefore, you will see the original copyright references, library stamps (as most of these works have been housed in our most important libraries around the world), and other notations in the work.

This work is in the public domain in the United States of America, and possibly other nations. Within the United States, you may freely copy and distribute this work, as no entity (individual or corporate) has a copyright on the body of the work.

As a reproduction of a historical artifact, this work may contain missing or blurred pages, poor pictures, errant marks, etc. Scholars believe, and we concur, that this work is important enough to be preserved, reproduced, and made generally available to the public. We appreciate your support of the preservation process, and thank you for being an important part of keeping this knowledge alive and relevant.


More in Audio Processing

Handbook of Speech Recognition - Warren Hanna
Mixing Secrets for the Small Studio : 2nd Edition - Mike Senior

RRP $101.00

$85.99

15%
OFF
An Introduction to Music Technology : 2nd edition - Dan Hosken

RRP $154.00

$113.75

26%
OFF
Small Signal Audio Design - Douglas Self
Sound and Music for the Theatre : The Art and Technique of Design - Deena Kaye
Bluetooth LE Audio : Fundamental to Recent Advances - Himanshu Bhalla

RRP $183.00

$162.75

11%
OFF
Mix It! : Understanding and Controlling the Mix Process - Ian Corbett
Collaborative Songwriting : In the Room - Adam Martin
Collaborative Songwriting : In the Room - Adam Martin