|
Key
This line was removed.
This word was removed. This word was added.
This line was added.
|
Changes (3)
View Page History
| *Description* | The Austrian National Library uses a representative datasets from their webarchive: \\
\- events selective crawls: during an event frequently harvested sites, e.g. EU election 2009, Olympia 2010, \\
\- events selective crawls: during an event frequently harvested sites, e.g. EU election 2009, Olympia 2010, \\
\- domain crawls 2009 from about 1 million domains. \\
\\
The web archive data is available in the ARC.GZ format.\\
The size of the ARC.GZ data set is 1377 GB. \\
The web archive data is available in the ARC.GZ format.\\
The size of the ARC.GZ data set is 1377 GB. \\
\\
The web archive data is available in the ARC.GZ format. \\
The size of the ARC.GZ data set is 1377GB. \\
\\
The size of the ARC.GZ data set is 1377GB. \\
\\
The metadata log file produced during the crawl process is available as txt file and has a size of 197GB. \\ |
| *Licensing* | Sample only available to SCAPE partners. \\ |
| *Licensing* | Sample only available to SCAPE partners. \\ |
