Books+ Search Results

RATS speaker identification

Title
RATS speaker identification / Linguistic Data Consortium.
ISBN
1585639745
Publication
[Philadelphia, PA] : [Linguistic Data Consortium], [2021]
Physical Description
1 online resource
Local Notes
Access is available to the Yale community.
Notes
Authors: David Graff, Xiaoyi Ma, Stephanie Strassel, Kevin Walker, Karen Jones.
Data source: telephone conversations.
Data type: software, text.
Applications: speaker identification.
LDC number: LDC2021S08.
Audio in Levantine Arabic, Farsi, Dari, Pashto and Urdu.
Title from resource home page (LDC website, viewed July 25, 2022).
Access and use
Access restricted by licensing agreement.
Summary
"The source audio consists of conversational telephone speech recordings collected by LDC specifically for the RATS program from Levantine Arabic, Pashto, Urdu, Farsi and Dari native speakers. Annotations on the audio files include start time, end time, speech activity detection (SAD) label, SAD provenance, speaker ID, speaker ID provenance, language ID, and language ID provenance. The data is divided into training and development sets, each containing their own audio and annotation subdirectories.All audio files are presented as single-channel, 16-bit PCM, 16000 samples per second; lossless FLAC compression is used on all files. When uncompressed, the files have typical "MS-WAV" (RIFF) file headers.Annotation files are presented as tab-delimited, UTF-8 encoded, plain text."--LDC online catalog.
Format
Audio / Data Sets / Online
Language
Multiple languages; Arabic; Urdu; Persian
Added to Catalog
July 25, 2022
Genre/Form
Data sets.
Speech corpora.
Text corpora.
Sound recordings.
Also listed under
Graff, David, creator.
Linguistic Data Consortium, issuing body.
Citation

Available from:

Loading holdings.
Unable to load. Retry?
Loading holdings...
Unable to load. Retry?