Books+ Search Results

Phrase detectives corpus

Title
Phrase detectives corpus / Linguistic Data Consortium ; Jon Chamberlain, Silviu Paun, Juntao Yu, Udo Kruschwitz, Massimo Poesio.
ISBN
1585638935
9781585638932
Edition
Version 2.
Publication
[Philadelphia, PA] : Linguistic Data Consortium, [2019]
Copyright Notice Date
©2019
Physical Description
1 CD-ROM ; 4 3/4 in.
Local Notes
Access is available to the Yale community.
Notes
"LDC2019T10"
Applications: information detection, information extraction, parsing, tagging.
Data source: web collection, fiction.
Data type: text.
Tittle from disc label.
Access and use
Access restricted by licensing agreement.
Summary
"Phrase Detectives Corpus Version 2 was developed by the School of Computer Science and Electronic Engineering at the University of Essex and consists of approximately 407,000 tokens across 537 documents anaphorically-annotated by the Phrase Detectives Game, an online interactive "game-with-a-purpose" (GWAP) designed to collect data about English anaphoric coreference. This release constitutes a new version of the Phrase Detectives Corpus (LDC2017T08) that adds significantly more annotated tokens to the data set and supplies for each markable a substantial number of judgments expressed by the players and a silver label annotation based on the probabilistic aggregation method for anaphoric information. GWAPs for creating language resources are growing. In general, they employ non-monetary incentives, such as entertainment, to motivate participation and can be successful for large-scale persistent annotation efforts. Two projects that collect linguistic resources via Phrase Detectives and other similar language-oriented GWAPs are DALI (Disagreements and Language Interpretation), led by Queen Mary University of London and the University of Essex, and the LDC NIEUW (Novel Incentives and Workflows in Linguistic Data Annotation) project through its game site Lingo Boingo, in collaboration with Queen Mary University, the University of Essex and other partners." --via LDC online catalog.
Format
Books / Data Sets
Language
English
Added to Catalog
November 22, 2019
Genre/Form
Data sets.
Databases.
Text corpora.
Also listed under
Linguistic Data Consortium, issuing body.
Citation

Available from:

Loading holdings.
Unable to load. Retry?
Loading holdings...
Unable to load. Retry?