ODE II - Work Package 6: Data

Data sets created for the ODE II (Dutch) project.


For the first deliverable M1 individual gzipped xml files were created. These files are available for reference but not actively used.


For the final data application, three data sets have been created. The first two data sets are packaged as eXist app. The data can easily be extracted by observing that the .xar is a zip-file and copy the data from dat data directory within.

The third data set is a collection of +/- 20,000 files of Dutch proceedings in a .tar.gz. A subset of only 20112012 data from the proceedings is also available for testing a smaller data set.

M3 / Final data sets