Authors: Jennifer Tracey, Stephanie Strassel, David Graff, Jonathan Wright, Song Chen, Neville Ryant, Kira Griffitt, Dana Delgado, Michael Arrigo.
Data source: discussion forum, newswire, web collection, weblogs.
Data type: software, text.
Applications: cross-language transfer, entity extraction, information extraction, machine translation.
LDC number: LDC2023T01.
In Swahili and English.