Books+ Search Results

BOLT Egyptian-English word alignment : discussion forum training

Title
BOLT Egyptian-English word alignment : discussion forum training / Linguistic Data Consortium.
ISBN
1585638811
Publication
Philadelphia : Linguistic Data Consortium, University of Pennsylvania, [2019]
Physical Description
1 CD-ROM ; 4 3/4 in.
Local Notes
Access is available to the Yale community.
Notes
"LDC2019T06."
Applications: Automatic content extraction, machine translation.
Authors: Xuansong Li, Katherine Peterson, Stephen Grimes, Stephanie Strassel.
Data source: Discussion forum.
Data type: Text.
Title from disc label.
In Egyptian Arabic, English.
Access and use
Access restricted by licensing agreement.
Summary
"This release consists of Egyptian source discussion forum threads harvested from the Internet by LDC using a combination of manual and automatic processes. The source data is released as BOLT Arabic Discussion Forums (LDC2018T10). The BOLT word alignment task was built on treebank annotation. Specifically, Egyptian source tree tokens for word alignment were automatically extracted from tree files of BOLT Egyptian Arabic Treebank annotation on the source discussion forum data harvested by LDC. Human annotators then followed LDC guidelines to link words and phrases in Arabic to those in English."--LDC online catalog.
Variant and related titles
Broad Operational Language Translation Egyptian-English word alignment : discussion forum training
Format
Books / Data Sets
Language
Arabic
Added to Catalog
July 11, 2019
Genre/Form
Data sets.
Text corpora.
Also listed under
Li, Xuansong, creator.
Linguistic Data Consortium, issuing body.
Citation

Available from:

Loading holdings.
Unable to load. Retry?
Loading holdings...
Unable to load. Retry?