Show simple item record

State-dependent time warping in the trended hidden Markov model

dc.contributor.authorSun, D. X.en_US
dc.contributor.authorDeng, L.en_US
dc.contributor.authorWu, C. F. J.en_US
dc.date.accessioned2006-04-10T17:55:44Z
dc.date.available2006-04-10T17:55:44Z
dc.date.issued1994-09en_US
dc.identifier.citationSun, D. X., Deng, L., Wu, C. F. J. (1994/09)."State-dependent time warping in the trended hidden Markov model." Signal Processing 39(3): 263-275. <http://hdl.handle.net/2027.42/31358>en_US
dc.identifier.urihttp://www.sciencedirect.com/science/article/B6V18-48XCYN8-J8/2/932b8ad1501e3c709c3ce76290992de4en_US
dc.identifier.urihttps://hdl.handle.net/2027.42/31358
dc.description.abstractIn this paper we present an algorithm for estimating state-dependent polynomial coefficients in the nonstationary-state hidden Markov model (or the trended HMM) which allows for the flexibility of linear time warping or scaling in individual model states. The need for the state-dependent time warping arises from the consideration that due to speaking rate variation and other temporal factors in speech, multiple state-segmented speech data sequences used for training a single set of polynomial coefficients often vary appreciably in their sequence lengths. The algorithm is developed based on a general framework with use of auxiliary parameters, which, of no interests in themselves, nevertheless provide an intermediate tool for achieving maximal accuracy for estimating the polynomial coefficients in the trended HMM. It is proved that the proposed estimation algorithm converges to a solution equivalent to the state-optimized maximum likelihood estimate. Effectiveness of the algorithm is demonstrated in experiments designed to fit a single trended HMM simultaneously to multiple sequences of speech data which are different renditions of the same word yet vary over a wide range in the sequence length. Speech recognition experiments have been performed based on the standard acoustic-phonetic TIMIT database. The speech recognition results demonstrate the advantages of the time-warping trended HMMs over the regular trended HMMs measured about 10 to 15% improvement in terms of the recognition rate.en_US
dc.format.extent871451 bytes
dc.format.extent3118 bytes
dc.format.mimetypeapplication/pdf
dc.format.mimetypetext/plain
dc.language.isoen_US
dc.publisherElsevieren_US
dc.titleState-dependent time warping in the trended hidden Markov modelen_US
dc.typeArticleen_US
dc.rights.robotsIndexNoFollowen_US
dc.subject.hlbsecondlevelScience (General)en_US
dc.subject.hlbsecondlevelEducationen_US
dc.subject.hlbtoplevelScienceen_US
dc.subject.hlbtoplevelSocial Sciencesen_US
dc.description.peerreviewedPeer Revieweden_US
dc.contributor.affiliationumUniversity of Michigan, Ann Arbor, MI 48109-1027, USAen_US
dc.contributor.affiliationotherState University of New York at Stony Brook, NY 11794-3600, USAen_US
dc.contributor.affiliationotherDepartment of Electrical and Computer Engineering, University of Waterloo, Waterloo, Ontario, Canada N2L 3G1en_US
dc.description.bitstreamurlhttp://deepblue.lib.umich.edu/bitstream/2027.42/31358/1/0000269.pdfen_US
dc.identifier.doihttp://dx.doi.org/10.1016/0165-1684(94)90089-2en_US
dc.identifier.sourceSignal Processingen_US
dc.owningcollnameInterdisciplinary and Peer-Reviewed


Files in this item

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.