Books+ Search Results

Mixer 3 speech

Title
Mixer 3 speech / Linguistic Data Consortium.
Publication
[Philadelphia, PA] : [Linguistic Data Consortium], [2023]
Physical Description
1 online resource
Local Notes
Access is available to the Yale community.
Notes
Authors: Shudong Huang, Kevin Walker, David Graff.
Data source: telephone conversations.
Data type: sound.
Applications: language identification, speaker identification
LDC number: LDC2023S02.
Audio in Mandarin English, Mandarin Chinese, Min Nan Chinese, Amharic, Bengali, Persian, Hindi, Italian, Japanese, Georgian, Khmer, Korean, Lao, Panjabi, Western Panjabi, Russian, Spanish, Tamil, Tagalog, Thai, Tigrinya, Urdu, Uzbek, Vietnamese, and Wu Chinese
Title from resource home page (LDC website, viewed May 15, 2023).
Access and use
Access restricted by licensing agreement.
Summary
"Mixer 3 Speech was developed by the Linguistic Data Consortium (LDC) and comprises 3,200 hours of audio recordings of conversational telephone speech involving 3,875 speakers and 26 distinct languages. This material was collected by LDC from 2005-2007 as part of the Mixer project, and recordings in this corpus were used in NIST Speaker Recognition Evaluation (SRE) and NIST Language Recognition Evaluation (LRE) corpora, including 2006 SRE and 2007 LRE. Data. The audio recordings were generated using LDC's computer telephony system capable of collecting speech from the telephone network. Recruited speakers were connected through a robot operator to carry on casual conversations lasting up to 10 minutes. Subjects fluent in languages other than English were asked to complete at least one non-English call. The documentation for this release contains information about the number of calls per subject and the number of calls per language. It also includes certain speaker demographic information, such as date of birth, level of education, native language, other language capability, place of birth, place of residence and occupation. The Mixer 3 collection contains 19,595 telephone recordings. The raw digital audio content for each call side was captured as a separate channel, then merged to be presented as a 2-channel file; the files are formatted as 8kHz, 8000 samples/second, u-law encoded NIST SPHERE files."--LDC online catalog.
Variant and related titles
Mixer three speech
Format
Audio / Data Sets / Online
Language
Multiple languages; English; Amharic; Bengali; Persian; Hindi; Italian; Japanese; Georgian; Khmer; Korean; Lao; Panjabi; Russian; Spanish; Tamil; Tagalog; Thai; Tigrinya; Urdu; Uzbek; Vietnamese
Added to Catalog
May 15, 2023
Genre/Form
Data sets.
Speech corpora.
Text corpora.
Sound recordings.
Also listed under
Huang, Shudong, creator.
Linguistic Data Consortium, issuing body.
Citation

Available from:

Loading holdings.
Unable to load. Retry?
Loading holdings...
Unable to load. Retry?