Keyword Co-articulatory Boundary Segmentation of the CRM Speech Corpus

Wakefield, Gregory

Keyword Co-articulatory Boundary Segmentation of the CRM Speech Corpus

dc.contributor.author	Wakefield, Gregory
dc.date.accessioned	2014-08-20T14:07:23Z
dc.date.available	2014-08-20T14:07:23Z
dc.date.issued	2014-08-20
dc.identifier.uri	https://hdl.handle.net/2027.42/108223
dc.description.abstract	crmSegmentCallColorDigitNow_v1 contains segmentation data for the CRM speech corpus [R. Bolia, W. Nelson, M. Ericson, and B. Simpson, “A speech corpus for multitalker communications research,” Journal of the Acoustical Society of America, vol. 107, pp. 1065-1066, 2000.] Each line of the file corresponds to one of the corpus utterances and contains the ID of the Talker, the audio filename, the sample frequency, and the start and stop sample indices for the Callsign, Color, Digit, and Now words. For example, the fourth line of the data Talker: 0, Filename: 000003, SampleFreq: 44100, CallStart: 10557, CallStop: 26975, ColorStart: 40317, ColorStop: 48910, DigitStart: 52218, DigitStop: 63916, NowStart: 67223, NowStop: 79767 lists the samples indices for CallStart (10557), CallStop (26975), ..., NowStop (79767) for the file 000003.wav from Talker 0, where the first two elements in the filename indicate Callsign (0-7), the next two elements indicate Color (0-3), and the final two elements indicate Digit (0-7). The results were obtained by pre-segmenting the data using a combination of acoustic features. Fine-tuning of the segmentation was performed by a single listener (the author, G. Wakefield) with particular emphasis placed on minimizing the co-articulatory "leakage" at the segment boundaries. It should be emphasized that these particular segment boundaries, particularly for the word offsets, are often judgment calls and are typically based on whether the following phoneme can be predicted on the basis of the current segmentation.	en_US
dc.description.sponsorship	Office of Naval Research (N00014-10-1-0152, N00014-13-1-0358)	en_US
dc.language.iso	en_US	en_US
dc.subject	Speech Segmentation, CRM Corpus, Coordinate Response Measure Corpus	en_US
dc.title	Keyword Co-articulatory Boundary Segmentation of the CRM Speech Corpus	en_US
dc.type	Dataset	en_US
dc.subject.hlbsecondlevel	Electrical Engineering
dc.subject.hlbsecondlevel	Computer Science
dc.subject.hlbtoplevel	Engineering
dc.contributor.affiliationumcampus	Ann Arbor	en_US
dc.description.bitstreamurl	http://deepblue.lib.umich.edu/bitstream/2027.42/108223/1/crmSegmentCallColorDigitNow_v1.txt
dc.description.filedescription	Description of crmSegmentCallColorDigitNow_v1.txt : Dataset with explanatory header
dc.owningcollname	Electrical Engineering and Computer Science, Department of (EECS)

Files in this item

Name:: crmSegmentCallColorDigitNow_v1.txt
Size:: 380.6KB
Format:: Text file
Description:: Dataset with explanatory header

View/Open

Electrical Engineering and Computer Science, Department of (EECS)

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe its collections in a way that respects the people and communities who create, use, and are represented in them. We encourage you to Contact Us anonymously if you encounter harmful or problematic language in catalog records or finding aids. More information about our policies and practices is available at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.