Version 1 by Becky McGuinness
on Nov 19, 2013 09:48.

compared with
Current by Becky McGuinness
on Nov 19, 2013 09:50.

Key
This line was removed.
This word was removed. This word was added.
This line was added.

Changes (1)

View Page History
Using a small data set for local development and testing, the Hadoop job will later be executed on a Hadoop cluster using a representative real world web archive data sample.


{anchor:Digital Books}

h3. Digital Books: Quality Assurance, text mining (OCR Quality)