Books+ Search Results

Machine reading phase 1 IC : training data

Title
Machine reading phase 1 IC : training data / Linguistic Data Consortium.
ISBN
1585639168
Publication
[Philadelphia, PA] : [Linguistic Data Consortium], [2020]
Physical Description
1 online resource
Local Notes
Access is available to the Yale community.
Notes
Applications: machine reading, knowledge representation, information extraction.
Authors: Heather Simpson, Stephanie Strassel, Jonathan Wright, Kira Griffitt.
Data source: newswire.
Data type: text.
LDC number: LDC2020T04.
In English.
Title from resource home page (LDC website, viewed September 1, 2020).
Access and use
Access restricted by licensing agreement.
Summary
"The data in this release constitutes the training data for the IC (Core Domain) task. The IC Use Cases tested the core domain by extracting information about about Entities (people, organizations, geopolitical entities or "GPEs") and their involvement in four types of Relations: Attack Relations (e.g. bombings), Biographical Relations (e.g. being a citizen of a country), Affiliation Relations (e.g. being a leader of an organization), and Family Relations (e.g. having a spouse) as described in newswire text. This information was then aligned with an IC Use Cases ontology that would allow automated reasoning about the extracted Entities and Relations. This release contains 248 source documents (108,960 words) from English newswire stories in English Gigaword Fourth Edition (LDC2009T13). Roughly half of those documents (116) were annotated for IC/Core Use Cases. Annotation was non-exhaustive, but an attempt was made to provide instances of all relations and their arguments where explicitly stated in a single sentence, as well as some non-explicit relations, which were marked with an "Inferred" tag by the annotator. " --LDC online catalog.
Format
Books / Data Sets / Online
Language
English
Added to Catalog
September 01, 2020
Genre/Form
Data sets.
Text corpora.
Citation

Available from:

Loading holdings.
Unable to load. Retry?
Loading holdings...
Unable to load. Retry?