View Source

h2. Investigator(s)

Per Møldrup-Dalum ([email protected])

h2. Dataset

8TB of non-ordered data from the Danish Web Archive

h2. Platform

http://wiki.opf-labs.org/display/SP/SB+Hadoop+Platform

h2. Workflow

A set of file references as NFS paths were aggregated into a set of input files. There input files were, one at a time, fed to the pre-programmed Hadoop module of the [Nanite|https://github.com/openplanets/nanite] project. During the set-up phase of the project a few corrections were implemented in this module.

h2. Requirements and Policies

_Policy statements that relate to this experiment and any evaluation criteria taken from SCAPE metrics_

h2. Evaluations

{pageTree:[email protected]}