
Spoken Language Processing
Xuedong Huang, Alex Acero, Hsiao-Wuen Hon
Résumé
- In-depth coverage of speech processing, speech recognition, speech synthesis, spoken language understanding, and speech interface design
- Many case studies from state-of-the-art systems, including examples from Microsoft's advanced research labs
Spoken Language Processing draws on the latest advances and techniques from multiple fields: computer science, electrical engineering, acoustics, linguistics, mathematics, psychology, and beyond. Starting with the fundamentals, it presents all this and more:
- Essential background on speech production and perception, probability and information theory, and pattern recognition
- Extracting information from the speech signal: useful representations and practical compression solutions
- Modern speech recognition techniques: hidden Markov models, acoustic and language modeling, improving resistance to environmental noises, search algorithms, and large vocabulary speech recognition
- Text-to-speech: analyzing documents, pitch and duration controls; trainable synthesis, and more
- Spoken language understanding: dialog management, spoken language applications, and multimodal interfaces
To illustrate the book's methods, the authors present detailed case studies based on state-of-the-art systems, including Microsoft's Whisper speech recognizer, Whistler text-to-speech system, Dr. Who dialog system, and the MiPad handheld device. Whether you're planning, designing, building, or purchasing spoken language technology, this is the state of the art—from algorithms through business productivity.
(NOTE: Each chapter ends with Historical Perspective and Further Reading.)1. Introduction.
I. FUNDAMENTAL THEORY.
2. Spoken language Structure.
3. Probability, Statistics, and Information Theory.
4. Pattern Recognition.
II. SPEECH PROCESSING.
5. Digital Signal Processing.
6. Speech Signal Representations.
7. Speech Coding.
III. SPEECH RECOGNITION.
8. Hidden Markov Models.
9. Acoustic Modeling.
10. Environmental Robustness.
11. Language Modeling.
12. Basic Search Algorithms.
13. Large-Vocabulary Search Algorithms.
IV. TEXT-TO-SPEECH SYSTEMS.
14. Text and Phonetic Analysis.
15. Prosody.
16. Speech Synthesis.
V. SPOKEN LANGUAGE SYSTEMS.
17. Spoken Language Understanding.
18. Applications and User Interfaces.
Index.
Caractéristiques techniques
PAPIER | |
Éditeur(s) | Prentice Hall |
Auteur(s) | Xuedong Huang, Alex Acero, Hsiao-Wuen Hon |
Parution | 01/06/2001 |
Nb. de pages | 980 |
Format | 18 x 24 |
Couverture | Relié |
Poids | 1616g |
Intérieur | Noir et Blanc |
EAN13 | 9780130226167 |
Avantages Eyrolles.com
Consultez aussi
- Les meilleures ventes en Graphisme & Photo
- Les meilleures ventes en Informatique
- Les meilleures ventes en Construction
- Les meilleures ventes en Entreprise & Droit
- Les meilleures ventes en Sciences
- Les meilleures ventes en Littérature
- Les meilleures ventes en Arts & Loisirs
- Les meilleures ventes en Vie pratique
- Les meilleures ventes en Voyage et Tourisme
- Les meilleures ventes en BD et Jeunesse