Research > Faculty Projects
Human-Like Speech
Principal Investigator
Jeff Bilmes, Katrin Kirchhoff, Dani Byrd (University of Southern California), Shrikanth Narayanan (University of Southern California), Daniel Jurafsky (Stanford), Christopher Manning (Stanford)
Sponsor(s)
Office of Naval Research (ONR)
Award Period
05/01/2005 - 04/30/2009
Abstract
Computer recognition of speech is a crucial application for
the Department of Defense and is a key challenge
application for the scientific and engineering goals of the
nation. The field has made enormous progress in the last
twenty years by applying and extending the Hidden Markov
Model (HMM) paradigm. But our most successful HMM systems
are still too tied to specific domains such as recognizing
carefully pronounced, read, or highly constrained speech.
The HMM paradigm has not extended well to address accented
or highly variable speech, nor has it been able to handle
the crucial problem of recognition of natural
conversational human-to-human speech. We believe the
failure of HMMs on natural conversational speech is due to
fundamental problems in this current paradigm. We propose a
radically new approach to the speech-to-text problem,
motivated by recent psychological results on how humans map
speech to words, and grounded in powerful new statistical
and machine-learning techniques. We propose to replace the
HMM model at every level of speech processing, with an
exciting new approach to acoustic, phonetic, and
pronunciation modeling, a completely new discriminative
model of word recognition, and advances in language
modeling for natural conversational speech. These new model
components are unified into a rich multi-stream
architecture based on new probabilistic models.
Updates or corrections to this page should be sent to gheaton@u.washington.edu.
