Show simple item record

Keyword Co-articulatory Boundary Segmentation of the CRM Speech Corpus

dc.contributor.authorWakefield, Gregory
dc.date.accessioned2014-08-20T14:07:23Z
dc.date.available2014-08-20T14:07:23Z
dc.date.issued2014-08-20
dc.identifier.urihttps://hdl.handle.net/2027.42/108223
dc.description.abstractcrmSegmentCallColorDigitNow_v1 contains segmentation data for the CRM speech corpus [R. Bolia, W. Nelson, M. Ericson, and B. Simpson, “A speech corpus for multitalker communications research,” Journal of the Acoustical Society of America, vol. 107, pp. 1065-1066, 2000.] Each line of the file corresponds to one of the corpus utterances and contains the ID of the Talker, the audio filename, the sample frequency, and the start and stop sample indices for the Callsign, Color, Digit, and Now words. For example, the fourth line of the data Talker: 0, Filename: 000003, SampleFreq: 44100, CallStart: 10557, CallStop: 26975, ColorStart: 40317, ColorStop: 48910, DigitStart: 52218, DigitStop: 63916, NowStart: 67223, NowStop: 79767 lists the samples indices for CallStart (10557), CallStop (26975), ..., NowStop (79767) for the file 000003.wav from Talker 0, where the first two elements in the filename indicate Callsign (0-7), the next two elements indicate Color (0-3), and the final two elements indicate Digit (0-7). The results were obtained by pre-segmenting the data using a combination of acoustic features. Fine-tuning of the segmentation was performed by a single listener (the author, G. Wakefield) with particular emphasis placed on minimizing the co-articulatory "leakage" at the segment boundaries. It should be emphasized that these particular segment boundaries, particularly for the word offsets, are often judgment calls and are typically based on whether the following phoneme can be predicted on the basis of the current segmentation.en_US
dc.description.sponsorshipOffice of Naval Research (N00014-10-1-0152, N00014-13-1-0358)en_US
dc.language.isoen_USen_US
dc.subjectSpeech Segmentation, CRM Corpus, Coordinate Response Measure Corpusen_US
dc.titleKeyword Co-articulatory Boundary Segmentation of the CRM Speech Corpusen_US
dc.typeDataseten_US
dc.subject.hlbsecondlevelElectrical Engineering
dc.subject.hlbsecondlevelComputer Science
dc.subject.hlbtoplevelEngineering
dc.contributor.affiliationumcampusAnn Arboren_US
dc.description.bitstreamurlhttp://deepblue.lib.umich.edu/bitstream/2027.42/108223/1/crmSegmentCallColorDigitNow_v1.txt
dc.description.filedescriptionDescription of crmSegmentCallColorDigitNow_v1.txt : Dataset with explanatory header
dc.owningcollnameElectrical Engineering and Computer Science, Department of (EECS)


Files in this item

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.