Books+ Search Results

AttImam

Title
AttImam / Linguistic Data Consortium.
ISBN
1585639877
Publication
[Philadelphia, PA] : [Linguistic Data Consortium], [2022]
Physical Description
1 online resource
Local Notes
Access is available to the Yale community.
Notes
Authors: Amal Alsaif, Tasniem Alyahya, Madawi Alotibi, Huda Almuzaini, Abeer Alqahtani.
Data source: newswire.
Data type: software, text.
Applications: discourse analysis, entity extraction, language identification.
LDC number: LDC2022T02.
In Arabic.
Title from resource home page (LDC website, viewed July 19, 2022).
Access and use
Access restricted by licensing agreement.
Summary
"The source Arabic newswire was collected by LDC from Agence France Presse articles published in 2000. Annotation was performed by two native Arabic speakers. Each file has the following four elements: Cue: the lexical anchor that connects the source with the content -- Source: the entity or the agent that owns the content -- Content: the basic element expressing the claim or the reported news -- General Features: these can include such features as attribution style (direct or indirect), determinacy (factual or non-factual), and purpose (e.g., assertion, expression). The corpus contains 532 files in UTF-8 encoded plain text. Also included are annotation guidelines and the ESNAD (Extracting Sentence Attribution in Arabic Discourse) annotation tool." --LDC online catalog.
Format
Books / Data Sets / Online
Language
Arabic
Added to Catalog
July 19, 2022
Contents
data file (contains the annotated data)
docs file (contains additional documentation, annotation guidelines, a paper about the corpus and a file table)
tools file (contains the annotation tool).
Genre/Form
Data sets.
Text corpora.
Also listed under
Alsaif, Alma, creator.
Linguistic Data Consortium, issuing body.
Citation

Available from:

Loading holdings.
Unable to load. Retry?
Loading holdings...
Unable to load. Retry?