Electrical Engineering

Research > Faculty Projects

Human-Like Speech

Principal Investigator
Jeff Bilmes, Katrin Kirchhoff, Dani Byrd (University of Southern California), Shrikanth Narayanan (University of Southern California), Daniel Jurafsky (Stanford), Christopher Manning (Stanford)

Sponsor(s)
Office of Naval Research (ONR)

Award Period
05/01/2005 - 04/30/2009

Abstract
Computer recognition of speech is a crucial application for the Department of Defense and is a key challenge application for the scientific and engineering goals of the nation. The field has made enormous progress in the last twenty years by applying and extending the Hidden Markov Model (HMM) paradigm. But our most successful HMM systems are still too tied to specific domains such as recognizing carefully pronounced, read, or highly constrained speech. The HMM paradigm has not extended well to address accented or highly variable speech, nor has it been able to handle the crucial problem of recognition of natural conversational human-to-human speech. We believe the failure of HMMs on natural conversational speech is due to fundamental problems in this current paradigm. We propose a radically new approach to the speech-to-text problem, motivated by recent psychological results on how humans map speech to words, and grounded in powerful new statistical and machine-learning techniques. We propose to replace the HMM model at every level of speech processing, with an exciting new approach to acoustic, phonetic, and pronunciation modeling, a completely new discriminative model of word recognition, and advances in language modeling for natural conversational speech. These new model components are unified into a rich multi-stream architecture based on new probabilistic models.

Updates or corrections to this page should be sent to gheaton@u.washington.edu.