compared with
Current by Niels Bjarke Reimer
on Jul 19, 2013 12:16.

Key
This line was removed.
This word was removed. This word was added.
This line was added.

Changes (6)

View Page History
| *Scalability Challenge* \\ | _Large amounts of data (200Tbytes \+). For simple characterisation not much CPU is required but a lot of I/O is needed._ \\
Some of the files are rather large (8Gbytes) - could be a problem for some characterisation tools (not problematic for tools that only reads header information and magic bytes) \\ |
| *[Issue champion|SP:Responsibilities of the roles described on these pages]* | [Bjarne Andersen|https://portal.ait.ac.at/sites/Scape/_layouts/userdisp.aspx?ID=8] [Gry Elstrøm|https://portal.ait.ac.at/sites/Scape/_layouts/userdisp.aspx?ID=65] (SB) |
| *Other interested parties* \\ | |
| *Possible Solution approaches* | Should be simple. \\
| *Lessons Learned* | \\ |
| *Training Needs* | \\ |
| *Datasets* | [WAV with Danish Radio broadcasts, ripped audio CD’s, and SB in-house audio digitization|SP:WAV with Danish Radio broadcasts, ripped audio CD’s, and SB in-house audio digitization (WAVfiles)]\\ |
| *Solutions* | [SO06 Use Ffprobe to characterise Wav]\\
[SO25 Rosetta v3.0 Implementation Integrated with DROID 6|http://wiki.opf-labs.org/display/SP/SO25+Rosetta+v3.0+Implementation+Integrated+with+DROID+6]\\ |
| *Solutions* | [SO06 Use Ffprobe to characterise audio+video|SP:SO06 Use Ffprobe to characterise audio+video]\\
\\
[SO06 Use Ffprobe to characterise audio+video|SO06 Use Ffprobe to characterise audio+video][SP:SO25 Rosetta v3.0 Implementation Integrated with DROID 6, JHOVE1, NLNZ tool and more...]\\ |

h1. Evaluation

| *Objectives* | _Which scape objectives does this issues and a future solution relate to? e.g. scaleability, rubustness, reliability, coverage, preciseness, automation_ |
| *Success criteria* | _Describe the success criteria for solving this issue - what are you able to do? - what does the world look like?_ |
| *Automatic measures* | _What automated measures would you like the solution to give to evaluate the solution for this specific issue? which measures are important?_ \\
_If possible specify very specific measures and your goal - e.g._ \\
_ \* process 50 documents per second_ \\
_ \* handle 80Gb files without crashing_ \\
_ \* identify 99.5% of the content correctly_ \\ |
| *Manual assessment* | _Apart from automated measures that you would like to get do you foresee any necessary manual assessment to evaluate the solution of this issue?_ \\
_If possible specify measures and your goal - e.g._ \\
_ \* Solution installable with basic linux system administration skills_ \\
_ \* User interface understandable by non developer curators_ \\ |
| *Objectives* | This is about scaleability and functionality |
| *Success criteria* | We will have a workflow that can process WAV (and BWF) files - also larger files up to 10Gb \\ |
| *Automatic measures* | 1. Support for both WAV and BWF \\
2. Support for larger files - up to 10Gb \\
3. Process 2Tbytes of sample content in less than 24 hours \\
4. 100% of the files are identified correctly \\
5. 100% of the files gets useful and correct characterisation output \\ |
| *Manual assessment* | 1. Sample checking of the generated characterisation output \\ |
| *Actual evaluations* | links to acutual evaluations of this Issue/Scenario |