State-dependent time warping in the trended hidden Markov model

Sun, D. X.; Deng, L.; Wu, C. F. J.

State-dependent time warping in the trended hidden Markov model

Sun, D. X.; Deng, L.; Wu, C. F. J.

1994-09

View/Open

0000269.pdf

(851KB

PDF)

Citation

Sun, D. X., Deng, L., Wu, C. F. J. (1994/09)."State-dependent time warping in the trended hidden Markov model." Signal Processing 39(3): 263-275. <http://hdl.handle.net/2027.42/31358>

Abstract

In this paper we present an algorithm for estimating state-dependent polynomial coefficients in the nonstationary-state hidden Markov model (or the trended HMM) which allows for the flexibility of linear time warping or scaling in individual model states. The need for the state-dependent time warping arises from the consideration that due to speaking rate variation and other temporal factors in speech, multiple state-segmented speech data sequences used for training a single set of polynomial coefficients often vary appreciably in their sequence lengths. The algorithm is developed based on a general framework with use of auxiliary parameters, which, of no interests in themselves, nevertheless provide an intermediate tool for achieving maximal accuracy for estimating the polynomial coefficients in the trended HMM. It is proved that the proposed estimation algorithm converges to a solution equivalent to the state-optimized maximum likelihood estimate. Effectiveness of the algorithm is demonstrated in experiments designed to fit a single trended HMM simultaneously to multiple sequences of speech data which are different renditions of the same word yet vary over a wide range in the sequence length. Speech recognition experiments have been performed based on the standard acoustic-phonetic TIMIT database. The speech recognition results demonstrate the advantages of the time-warping trended HMMs over the regular trended HMMs measured about 10 to 15% improvement in terms of the recognition rate.

Publisher

Elsevier

Other DOIs

http://dx.doi.org/10.1016/0165-1684(94)90089-2

Types

Article

Handle

https://hdl.handle.net/2027.42/31358

URI

http://www.sciencedirect.com/science/article/B6V18-48XCYN8-J8/2/932b8ad1501e3c709c3ce76290992de4

Metadata

Show full item record

Collections

Interdisciplinary and Peer-Reviewed

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.