P.S. Gopalakrishnan, D. Kanevsky, et al.
ICASSP 1989
A description is presented of the authors' current research on automatic speech recognition of continuously read sentences from a naturally-occurring corpus: office correspondence. The recognition system combines features from their current isolated-word recognition system and from their previously developed continuous-speech recognition systems. It consists of an acoustic processor, an acoustic channel model, a language model, and a linguistic decoder. Some new features in the recognizer relative to the isolated-word speech recognition system include the use of a fast match to prune rapidly to a manageable number the candidates considered by the detailed match, multiple pronunciations of all function words, and modeling of interphone coarticulatory behavior. The authors recorded training and test data from a set of ten male talkers. The perplexity of the test sentences was found to be 93; none of the sentences was part of the data used to generate the language model. Preliminary (speaker-dependent) recognition results on these talkers yielded an average word error rate of 11.0%.
P.S. Gopalakrishnan, D. Kanevsky, et al.
ICASSP 1989
E. Eide, A. Aaron, et al.
SSW 2004
L.R. Bahl, S. De Gennaro, et al.
ICSLP 1998
Jerome R. Bellegarda, Edward L. Titlebaum
IEEE Transactions on Aerospace and Electronic Systems