Books+ Search Results

LORELEI Oromo incident language pack

Title
LORELEI Oromo incident language pack / Linguistic Data Consortium.
ISBN
158563929X
Publication
[Philadelphia, PA] : [Linguistic Data Consortium], [2020]
Physical Description
1 CD-ROM ; 3 3/4 in
Local Notes
Access is available to the Yale community.
Notes
Applications: information extraction, knowledge base population, relation extraction, knowledge representation, temporal analysis.
Authors: Jennifer Tracey, David Graff, Stephanie Strassel, Michael Arrigo, Jonathan Wright, Ann Bies.
Data source: web collection, newswire, weblogs, newsgroups, discussion forum, religious texts.
Data type: text.
LDC number: LDC2020T11.
In Oromo and English.
Title from resource home page (LDC website, viewed September 8, 2020).
Access and use
Access restricted by licensing agreement.
Summary
"Oromo is a Cushitic language spoken in Ethiopia, Kenya, Somalia and Egypt, and it is the third largest language in Africa. Data was collected in the following genres: news, social network, weblog, newsgroup, discussion forum, and reference material. Entity detection and linking annotation identified entities to be detected by systems for scoring purposes. Situation frame analysis was designed to extract basic information about needs and relevant issues for planning a disaster response effort. Also included in this release are lexical and grammatical resources as well as three tools: two to recreate original source data from the processed XML material and the other to condition text data users download from Twitter." --LDC online catalog.
Variant and related titles
Low Resource Languages for Emergent Incidents Oromo incident language pack
Oromo incident language pack
Format
Books / Data Sets
Language
Oromo; English
Added to Catalog
February 26, 2021
Genre/Form
Data sets.
Text Corpora.
Citation

Available from:

Online
Loading holdings.
Unable to load. Retry?
Loading holdings...
Unable to load. Retry?