Books+ Search Results

Arabic treebank - weblog

Title
Arabic treebank - weblog / Linguistic Data Consortium.
ISBN
1585637416
9781585637416
Published
[Philadelphia, PA] : Linguistic Data Consortium, ©2016.
Physical Description
1 online resource
Local Notes
Access is available to the Yale community.
Notes
Application(s): automatic content extraction, cross-lingual information retrieval, information detection.
Author(s): Mohamed Maamouri, Ann Bies, Seth Kulick, Sondos Krouna, Dalila Tabassi, Michael Ciul.
Data source(s): weblogs.
Data type(s): text.
Arabic, standard Arabic.
Access and use
Access restricted by licensing agreement.
Summary
"The ongoing Penn Arabic Treebank Project (PATB) supports research in Arabic-language natural language processing and human language technology development. ... This release contains 243,117 source tokens before clitics were split, and 308,996 tree tokens after clitics were separated for treebank annotation. The source material is weblogs collected by LDC from various sources."--LDC catalog.
Format
Books / Data Sets / Online
Language
Arabic
Added to Catalog
May 31, 2017
Genre/Form
Data sets.
Text corpora.
Also listed under
Maamouri, Mohamed.
Linguistic Data Consortium.
Citation

Available from:

Loading holdings.
Unable to load. Retry?
Loading holdings...
Unable to load. Retry?