State-dependent time warping in the trended hidden Markov model
dc.contributor.author | Sun, D. X. | en_US |
dc.contributor.author | Deng, L. | en_US |
dc.contributor.author | Wu, C. F. J. | en_US |
dc.date.accessioned | 2006-04-10T17:55:44Z | |
dc.date.available | 2006-04-10T17:55:44Z | |
dc.date.issued | 1994-09 | en_US |
dc.identifier.citation | Sun, D. X., Deng, L., Wu, C. F. J. (1994/09)."State-dependent time warping in the trended hidden Markov model." Signal Processing 39(3): 263-275. <http://hdl.handle.net/2027.42/31358> | en_US |
dc.identifier.uri | http://www.sciencedirect.com/science/article/B6V18-48XCYN8-J8/2/932b8ad1501e3c709c3ce76290992de4 | en_US |
dc.identifier.uri | https://hdl.handle.net/2027.42/31358 | |
dc.description.abstract | In this paper we present an algorithm for estimating state-dependent polynomial coefficients in the nonstationary-state hidden Markov model (or the trended HMM) which allows for the flexibility of linear time warping or scaling in individual model states. The need for the state-dependent time warping arises from the consideration that due to speaking rate variation and other temporal factors in speech, multiple state-segmented speech data sequences used for training a single set of polynomial coefficients often vary appreciably in their sequence lengths. The algorithm is developed based on a general framework with use of auxiliary parameters, which, of no interests in themselves, nevertheless provide an intermediate tool for achieving maximal accuracy for estimating the polynomial coefficients in the trended HMM. It is proved that the proposed estimation algorithm converges to a solution equivalent to the state-optimized maximum likelihood estimate. Effectiveness of the algorithm is demonstrated in experiments designed to fit a single trended HMM simultaneously to multiple sequences of speech data which are different renditions of the same word yet vary over a wide range in the sequence length. Speech recognition experiments have been performed based on the standard acoustic-phonetic TIMIT database. The speech recognition results demonstrate the advantages of the time-warping trended HMMs over the regular trended HMMs measured about 10 to 15% improvement in terms of the recognition rate. | en_US |
dc.format.extent | 871451 bytes | |
dc.format.extent | 3118 bytes | |
dc.format.mimetype | application/pdf | |
dc.format.mimetype | text/plain | |
dc.language.iso | en_US | |
dc.publisher | Elsevier | en_US |
dc.title | State-dependent time warping in the trended hidden Markov model | en_US |
dc.type | Article | en_US |
dc.rights.robots | IndexNoFollow | en_US |
dc.subject.hlbsecondlevel | Science (General) | en_US |
dc.subject.hlbsecondlevel | Education | en_US |
dc.subject.hlbtoplevel | Science | en_US |
dc.subject.hlbtoplevel | Social Sciences | en_US |
dc.description.peerreviewed | Peer Reviewed | en_US |
dc.contributor.affiliationum | University of Michigan, Ann Arbor, MI 48109-1027, USA | en_US |
dc.contributor.affiliationother | State University of New York at Stony Brook, NY 11794-3600, USA | en_US |
dc.contributor.affiliationother | Department of Electrical and Computer Engineering, University of Waterloo, Waterloo, Ontario, Canada N2L 3G1 | en_US |
dc.description.bitstreamurl | http://deepblue.lib.umich.edu/bitstream/2027.42/31358/1/0000269.pdf | en_US |
dc.identifier.doi | http://dx.doi.org/10.1016/0165-1684(94)90089-2 | en_US |
dc.identifier.source | Signal Processing | en_US |
dc.owningcollname | Interdisciplinary and Peer-Reviewed |
Files in this item
Remediation of Harmful Language
The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.
Accessibility
If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.