State-dependent time warping in the trended hidden Markov model

Sun, D. X.; Deng, L.; Wu, C. F. J.

State-dependent time warping in the trended hidden Markov model

dc.contributor.author	Sun, D. X.	en_US
dc.contributor.author	Deng, L.	en_US
dc.contributor.author	Wu, C. F. J.	en_US
dc.date.accessioned	2006-04-10T17:55:44Z
dc.date.available	2006-04-10T17:55:44Z
dc.date.issued	1994-09	en_US
dc.identifier.citation	Sun, D. X., Deng, L., Wu, C. F. J. (1994/09)."State-dependent time warping in the trended hidden Markov model." Signal Processing 39(3): 263-275. <http://hdl.handle.net/2027.42/31358>	en_US
dc.identifier.uri	http://www.sciencedirect.com/science/article/B6V18-48XCYN8-J8/2/932b8ad1501e3c709c3ce76290992de4	en_US
dc.identifier.uri	https://hdl.handle.net/2027.42/31358
dc.description.abstract	In this paper we present an algorithm for estimating state-dependent polynomial coefficients in the nonstationary-state hidden Markov model (or the trended HMM) which allows for the flexibility of linear time warping or scaling in individual model states. The need for the state-dependent time warping arises from the consideration that due to speaking rate variation and other temporal factors in speech, multiple state-segmented speech data sequences used for training a single set of polynomial coefficients often vary appreciably in their sequence lengths. The algorithm is developed based on a general framework with use of auxiliary parameters, which, of no interests in themselves, nevertheless provide an intermediate tool for achieving maximal accuracy for estimating the polynomial coefficients in the trended HMM. It is proved that the proposed estimation algorithm converges to a solution equivalent to the state-optimized maximum likelihood estimate. Effectiveness of the algorithm is demonstrated in experiments designed to fit a single trended HMM simultaneously to multiple sequences of speech data which are different renditions of the same word yet vary over a wide range in the sequence length. Speech recognition experiments have been performed based on the standard acoustic-phonetic TIMIT database. The speech recognition results demonstrate the advantages of the time-warping trended HMMs over the regular trended HMMs measured about 10 to 15% improvement in terms of the recognition rate.	en_US
dc.format.extent	871451 bytes
dc.format.extent	3118 bytes
dc.format.mimetype	application/pdf
dc.format.mimetype	text/plain
dc.language.iso	en_US
dc.publisher	Elsevier	en_US
dc.title	State-dependent time warping in the trended hidden Markov model	en_US
dc.type	Article	en_US
dc.rights.robots	IndexNoFollow	en_US
dc.subject.hlbsecondlevel	Science (General)	en_US
dc.subject.hlbsecondlevel	Education	en_US
dc.subject.hlbtoplevel	Science	en_US
dc.subject.hlbtoplevel	Social Sciences	en_US
dc.description.peerreviewed	Peer Reviewed	en_US
dc.contributor.affiliationum	University of Michigan, Ann Arbor, MI 48109-1027, USA	en_US
dc.contributor.affiliationother	State University of New York at Stony Brook, NY 11794-3600, USA	en_US
dc.contributor.affiliationother	Department of Electrical and Computer Engineering, University of Waterloo, Waterloo, Ontario, Canada N2L 3G1	en_US
dc.description.bitstreamurl	http://deepblue.lib.umich.edu/bitstream/2027.42/31358/1/0000269.pdf	en_US
dc.identifier.doi	http://dx.doi.org/10.1016/0165-1684(94)90089-2	en_US
dc.identifier.source	Signal Processing	en_US
dc.owningcollname	Interdisciplinary and Peer-Reviewed

Files in this item

Name:: 0000269.pdf
Size:: 851.0KB
Format:: PDF

View/Open

Interdisciplinary and Peer-Reviewed

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.