Mixer 3 speech

Advanced Search

Basic Search

Help

AND OR NOT

Add a row

Reset

Limit results by

Books+ Search Results

Title

Mixer 3 speech / Linguistic Data Consortium.

Publication

[Philadelphia, PA] : [Linguistic Data Consortium], [2023]

Physical Description

1 online resource

Local Notes

Access is available to the Yale community.

Notes

Authors: Shudong Huang, Kevin Walker, David Graff.

Data source: telephone conversations.

Data type: sound.

Applications: language identification, speaker identification

LDC number: LDC2023S02.

Audio in Mandarin English, Mandarin Chinese, Min Nan Chinese, Amharic, Bengali, Persian, Hindi, Italian, Japanese, Georgian, Khmer, Korean, Lao, Panjabi, Western Panjabi, Russian, Spanish, Tamil, Tagalog, Thai, Tigrinya, Urdu, Uzbek, Vietnamese, and Wu Chinese

Title from resource home page (LDC website, viewed May 15, 2023).

Access and use

Access restricted by licensing agreement.

Summary

"Mixer 3 Speech was developed by the Linguistic Data Consortium (LDC) and comprises 3,200 hours of audio recordings of conversational telephone speech involving 3,875 speakers and 26 distinct languages. This material was collected by LDC from 2005-2007 as part of the Mixer project, and recordings in this corpus were used in NIST Speaker Recognition Evaluation (SRE) and NIST Language Recognition Evaluation (LRE) corpora, including 2006 SRE and 2007 LRE. Data. The audio recordings were generated using LDC's computer telephony system capable of collecting speech from the telephone network. Recruited speakers were connected through a robot operator to carry on casual conversations lasting up to 10 minutes. Subjects fluent in languages other than English were asked to complete at least one non-English call. The documentation for this release contains information about the number of calls per subject and the number of calls per language. It also includes certain speaker demographic information, such as date of birth, level of education, native language, other language capability, place of birth, place of residence and occupation. The Mixer 3 collection contains 19,595 telephone recordings. The raw digital audio content for each call side was captured as a separate channel, then merged to be presented as a 2-channel file; the files are formatted as 8kHz, 8000 samples/second, u-law encoded NIST SPHERE files."--LDC online catalog.

Variant and related titles

Mixer three speech

Format

Audio / Data Sets / Online

Language

Multiple languages; English; Amharic; Bengali; Persian; Hindi; Italian; Japanese; Georgian; Khmer; Korean; Lao; Panjabi; Russian; Spanish; Tamil; Tagalog; Thai; Tigrinya; Urdu; Uzbek; Vietnamese

Added to Catalog

May 15, 2023

Subjects

Language and languages > Data processing.

Language and languages > Discourse analysis.

Conversation > Data processing.

Telephone calls > Data processing.

Speech perception > Data processing.

Automatic speech recognition.

Genre/Form

Data sets.

Speech corpora.