Investigator(s)
Per Møldrup-Dalum ([email protected])
Dataset
8TB of non-ordered data from the Danish Web Archive
Platform
http://wiki.opf-labs.org/display/SP/SB+Hadoop+Platform
Workflow
A set of file references as NFS paths were aggregated into a set of input files. There input files were, one at a time, fed to the pre-programmed Hadoop module of the Nanite project. During the set-up phase of the project a few corrections were implemented in this module.
Requirements and Policies
Policy statements that relate to this experiment and any evaluation criteria taken from SCAPE metrics
Evaluations
Labels:
None
Page:
EVAL-SB-WCT-04