h2. Investigator(s)
Rune Ferneke-Nielsen (SB)
h2. Dataset
[SP:SB Web Archive Data]
h2. Platform
[SP:SB Hadoop Platform]
h2. Workflow
_Description and ideally a link to a Taverna workflow._
Intention is to redo the ONB experiment on SB content.
Also, we would like to extract web crawler information from ARC files and add this to WARC files
h2. Requirements and Policies
_Policy statements that relate to this experiment and any evaluation criteria taken from SCAPE metrics_
h2. Evaluations
_Links to results of the experiment using the evaluation template._
{pageTree:[email protected]}
Rune Ferneke-Nielsen (SB)
h2. Dataset
[SP:SB Web Archive Data]
h2. Platform
[SP:SB Hadoop Platform]
h2. Workflow
_Description and ideally a link to a Taverna workflow._
Intention is to redo the ONB experiment on SB content.
Also, we would like to extract web crawler information from ARC files and add this to WARC files
h2. Requirements and Policies
_Policy statements that relate to this experiment and any evaluation criteria taken from SCAPE metrics_
h2. Evaluations
_Links to results of the experiment using the evaluation template._
{pageTree:[email protected]}