Books+ Search Results

GALE phase 4 Chinese broadcast news : speech

Title
GALE phase 4 Chinese broadcast news : speech / Linguistic Data Consortium.
ISBN
1585638269
9781585638260
Publication
[Philadelphia, PA] : Linguistic Data Consortium, [2017]
Physical Description
2 DVD-ROMs ; 4 3/4 in.
Local Notes
Access is available to the Yale community.
Notes
Title from disc label.
Authors: Kevin Walker, Christopher Caruso, Kazuaki Maeda, Denise DiPersio, Stephanie Strassel.
Data type(s): sound.
Data source(s): broadcast news.
Application(s): speech recognition.
"LDC2017S25"--Disc surface.
In Mandarin Chinese and Chinese.
Access and use
Access restricted by licensing agreement.
Summary
"GALE Phase 4 Chinese Broadcast News Speech was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 134 hours of Mandarin Chinese broadcast news speech collected in 2008 by LDC and Hong University of Science and Technology (HKUST), Hong Kong, during Phase 4 of the DARPA GALE (Global Autonomous Language Exploitation) Program... The broadcast news recordings in this release feature news broadcasts focusing principally on current events from the following sources: China Central TV (CCTV), a national and international broadcaster in Mainland China; Phoenix TV, a Hong Kong-based satellite television station; and Voice of America (VOA), a U.S. government-funded broadcast programmer. This release contains 256 audio files presented in FLAC-compressed Waveform Audio File format (.flac), 16000 Hz single-channel 16-bit PCM. Each file was audited by a native Chinese speaker following Audit Procedure Specification Version 2.0 which is included in this release. The broadcast auditing process served three principal goals: as a check on the operation of the broadcast collection system equipment by identifying failed, incomplete or faulty recordings; as an indicator of broadcast schedule changes by identifying instances when the incorrect program was recorded; and as a guide for data selection by retaining information about a program's genre, data type and topic. --LDC online catalog.
Format
Audio / Data Sets
Language
Chinese
Added to Catalog
July 03, 2019
Genre/Form
Data sets.
Speech corpora.
Sound recordings.
Also listed under
Walker, Kevin, creator.
Linguistic Data Consortium, issuing body.
Citation

Available from:

Loading holdings.
Unable to load. Retry?
Loading holdings...
Unable to load. Retry?