Books+ Search Results

2010 NIST language recognition evaluation : test set

Title
2010 NIST language recognition evaluation : test set / Linguistic Data Consortium.
ISBN
1585637955
Publication
[Philadelphia, PA] : [Linguistic Data Consortium], [2017]
Physical Description
1 online resource
Local Notes
Access is available to the Yale community.
Notes
Authors: Craig Greenberg, Alvin Martin, David Graff, Linda Brandschain, Kevin Walker.
Data source: microphone speech.
Data type: sound.
Applications: speaker identification.
LDC number: LDC2017S06.
Data: The speech recordings in this release were collected in 2009 and 2010 by LDC at its Human Subjects Collection facility in Philadelphia. This collection was part of the Mixer 6 project, which was designed to support the development of robust speaker recognition technology by providing carefully collected and audited speech from a large pool of speakers recorded simultaneously across numerous microphones. The telephone speech segments include two-channel excerpts of approximately 10 seconds and 5 minutes. There are also summed-channel excerpts in the range of 5 minutes. The microphone excerpts are 3-15 minutes in duration. As in prior evaluations, intervals of silence were not removed. The data included in this release is 8 bit ulaw with a sample rate of 8000. In addition to evaluation data, this package also consists of answer keys, trial and train files, development data and evaluation documentation.
Audio in English.
Title from resource home page (LDC website, viewed May 20, 2024).
Access and use
Access restricted by licensing agreement.
Variant and related titles
NIST language recognition evaluation test set, 2010
National Institute of Standards and Technology language recognition evaluation test set, 2010
Format
Audio / Data Sets / Online
Language
English
Added to Catalog
May 20, 2024
Contents
data/test file (contains the development data)
data/eval file (contains the evaluation data, answer keys, and trial and train files)
docs file (this directory contains the evaluation plan, a readme and a file table)
Genre/Form
Data sets.
Speech corpora.
Text corpora.
Sound recordings.
Also listed under
Greenberg, Craig, creator.
Linguistic Data Consortium, issuing body.
Citation

Available from:

Loading holdings.
Unable to load. Retry?
Loading holdings...
Unable to load. Retry?