Keyword Co-articulatory Boundary Segmentation of the CRM Speech Corpus
dc.contributor.author | Wakefield, Gregory | |
dc.date.accessioned | 2014-08-20T14:07:23Z | |
dc.date.available | 2014-08-20T14:07:23Z | |
dc.date.issued | 2014-08-20 | |
dc.identifier.uri | https://hdl.handle.net/2027.42/108223 | |
dc.description.abstract | crmSegmentCallColorDigitNow_v1 contains segmentation data for the CRM speech corpus [R. Bolia, W. Nelson, M. Ericson, and B. Simpson, “A speech corpus for multitalker communications research,” Journal of the Acoustical Society of America, vol. 107, pp. 1065-1066, 2000.] Each line of the file corresponds to one of the corpus utterances and contains the ID of the Talker, the audio filename, the sample frequency, and the start and stop sample indices for the Callsign, Color, Digit, and Now words. For example, the fourth line of the data Talker: 0, Filename: 000003, SampleFreq: 44100, CallStart: 10557, CallStop: 26975, ColorStart: 40317, ColorStop: 48910, DigitStart: 52218, DigitStop: 63916, NowStart: 67223, NowStop: 79767 lists the samples indices for CallStart (10557), CallStop (26975), ..., NowStop (79767) for the file 000003.wav from Talker 0, where the first two elements in the filename indicate Callsign (0-7), the next two elements indicate Color (0-3), and the final two elements indicate Digit (0-7). The results were obtained by pre-segmenting the data using a combination of acoustic features. Fine-tuning of the segmentation was performed by a single listener (the author, G. Wakefield) with particular emphasis placed on minimizing the co-articulatory "leakage" at the segment boundaries. It should be emphasized that these particular segment boundaries, particularly for the word offsets, are often judgment calls and are typically based on whether the following phoneme can be predicted on the basis of the current segmentation. | en_US |
dc.description.sponsorship | Office of Naval Research (N00014-10-1-0152, N00014-13-1-0358) | en_US |
dc.language.iso | en_US | en_US |
dc.subject | Speech Segmentation, CRM Corpus, Coordinate Response Measure Corpus | en_US |
dc.title | Keyword Co-articulatory Boundary Segmentation of the CRM Speech Corpus | en_US |
dc.type | Dataset | en_US |
dc.subject.hlbsecondlevel | Electrical Engineering | |
dc.subject.hlbsecondlevel | Computer Science | |
dc.subject.hlbtoplevel | Engineering | |
dc.contributor.affiliationumcampus | Ann Arbor | en_US |
dc.description.bitstreamurl | http://deepblue.lib.umich.edu/bitstream/2027.42/108223/1/crmSegmentCallColorDigitNow_v1.txt | |
dc.description.filedescription | Description of crmSegmentCallColorDigitNow_v1.txt : Dataset with explanatory header | |
dc.owningcollname | Electrical Engineering and Computer Science, Department of (EECS) |
Files in this item
Remediation of Harmful Language
The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.
Accessibility
If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.