Books+ Search Results

RATS language identification

Title
RATS language identification / Linguistic Data Consortium.
ISBN
1585638528
9781585638529
Publication
[Philadelphia, Pennsylvania] : Linguistic Data Consortium, 2017.
Physical Description
1 removable computer hard disc ; 8 x 12 x 1 cm + 1 USB connector
Local Notes
Access is available to the Yale community.
Notes
"LDC2017S20"
Applications: keyword spotting.
Authors: David Graff, Xiaoyi Ma, Stephanie Strassel, Kevin Walker, Karen Jones.
Data source: telephone conversations.
Data type: sound, text.
In South Levantine Arabic, North Levantine Arabic, Persian.
Title from index page.
Access and use
Access restricted by licensing agreement.
Summary
"Comprised of approximately 5,400 hours of Levantine Arabic, Farsi, Dari, Pashto and Urdu conversational telephone speech with annotation of speech segments. The corpus was created to provide training, development and initial test sets for the Language Identification (LID) task in the DARPA RATS (Robust Automatic Transcription of Speech) program"--Index.htm.
Variant and related titles
Robust Automatic Transcription of Speech language identification
Format
Audio / Data Sets
Language
Arabic; Persian; Pushto; Urdu
Added to Catalog
August 14, 2019
Genre/Form
Data sets.
Sound recordings.
Speech corpora.
Text corpora.
Also listed under
Linguistic Data Consortium, issuing body.
Citation

Available from:

Loading holdings.
Unable to load. Retry?
Loading holdings...
Unable to load. Retry?