|One line summary|| Tool that used Apache Tika to parse & compare documents.
|Detailed description|| AQDC is a Spring MVC Framework based Web application that wraps Apache Tika to provide a quick analysis of two documents (typically the original and its migration).
There is a simple Web form containing two fields (original and migrated file). On submit these files are uploaded to the Web app and parsed by Tika's "AutoDetectParser".
There is no error checking so don't be surprised to see 500 errors from time to time! :-)
The parser is hooked into a couple of parsers - notably text and xhtml generators, along with some basic (and usually wrong! :-)) language identification.
Armed with the text the Web app then performs a couple of checks:
|Solution champion|| Peter Cliff
|Tool (link)|| There is a self-contained (Jetty) server that can be downloaded from GitHub and run on your own machine.
Skip to end of metadata Go to start of metadata