Authors: Craig Greenberg, Alvin Martin, David Graff, Linda Brandschain, Kevin Walker.
Data source: microphone speech.
Data type: sound.
Applications: speaker identification.
LDC number: LDC2017S06.
Data: The speech recordings in this release were collected in 2009 and 2010 by LDC at its Human Subjects Collection facility in Philadelphia. This collection was part of the Mixer 6 project, which was designed to support the development of robust speaker recognition technology by providing carefully collected and audited speech from a large pool of speakers recorded simultaneously across numerous microphones. The telephone speech segments include two-channel excerpts of approximately 10 seconds and 5 minutes. There are also summed-channel excerpts in the range of 5 minutes. The microphone excerpts are 3-15 minutes in duration. As in prior evaluations, intervals of silence were not removed. The data included in this release is 8 bit ulaw with a sample rate of 8000. In addition to evaluation data, this package also consists of answer keys, trial and train files, development data and evaluation documentation.
Audio in English.
Title from resource home page (LDC website, viewed May 20, 2024).