Books+ Search Results

Phrase detectives corpus

Title
Phrase detectives corpus / Linguistic Data Consortium ; Jon Chamberlain, Massimo Poesio, Udo Kruschwitz.
ISBN
158563798x
Publication
[Philadelphia, PA] : Linguistic Data Consortium, [2017]
Physical Description
1 online resource
Local Notes
Access is available to the Yale community.
Notes
"LDC2017T08."
Application(s): information detection, parsing, information extraction, tagging.
Data source(s): fiction, web collection
Data type(s): text.
Access and use
Access restricted by licensing agreement.
Summary
"The documents in the corpus are taken from Wikipedia articles and from narrative text in Project Gutenberg. Wikipedia articles and annotation files are presented as XML and Project Gutenberg source files are presented as plain text. All text is encoded as UTF-8. Annotations are comprised of a gold standard version created by multiple experts, as well as a set created by a large non-expert crowd (via the Phase Detectives game). The data was annotated according to a prevalent linguistically-oriented approach for anaphora used in several tasks, including OntoNotes Release 5.0 (LDC2013T19), SemEval-2010 Task 1 Ontonotes English: Coreference Resolution in Multiple Languages (LDC2011T01) and The ARRAU Corpus of Anaphoric Information (LDC2013T22)."--LDC catalog
Format
Books / Data Sets / Online
Language
English; Spanish
Added to Catalog
February 19, 2018
Genre/Form
Data sets.
Text corpora.
Also listed under
Citation

Available from:

Loading holdings.
Unable to load. Retry?
Loading holdings...
Unable to load. Retry?