Books+ Search Results

British periodicals dataset. Collection I

British periodicals dataset. Collection I, 1681-1937.
[Ann Arbor, Michigan] : [ProQuest LLC], [between 2010 and 2018?]
Physical Description
1 online resource (approximatedly 14,000 files)
Local Notes
Access is available to the Yale community.
Title and variant titles devised by cataloger.
The XML directory contains one zip file per year (for example, 1741_0). The uncompressed folder, once expanded from the zip file, will have a different name that does *not* represent the year. Instead, it will match the 'xxxx' part of the PDF zip files (see below), such as in the PDF directory. Inside the XML folder is a year folder and a sequence number, such as 1741_0. Inside this are article-segmented XML files. It would take some work to reconnect the XML files to the PDF files programmatically. The PDF directory contains one or more zip file per year (for example, 1741_0_xxxx and 1741_1_xxxx). The numbers after the year do *not* refer to the month. Inside these archives are page-segmented PDFs that do not contain OCR (Optical Character Recognition). These are named identically to the files in the XML folder, above.
Description based on record for source database.
Access and use
Access restricted by licensing agreement and agreement to terms of use.
Dataset of articles for text data mining (TDM) from 160 British periodicals on literature, philosophy, history, science, the fine arts and the social sciences dating from 1681-1937, selected from the UMI microfilm collection Early British periodicals. The set contains data in PDF format (segmented into articles) and XML format (segmented into pages).
Variant and related titles
British periodicals online dataset. Collection I
British periodicals online dataset. Collection 1
British periodicals I dataset
Early British periodicals dataset
Archives unbound dataset collection.
Books / Data Sets / Online
Added to Catalog
April 28, 2021
System details note
System requirements: PDF viewer and XML reader.
Text corpora.
Data sets.
Also listed under
ProQuest (Firm), publisher.

Available from:

Loading holdings.
Unable to load. Retry?
Loading holdings...
Unable to load. Retry?