|Purpose||Tool for the web pages comparison based on structural and visual approach. Research challenge for this tool is the learning algorithm based on frequency.|
| Source Code Repository
|| As Is
|Debian Package|| not available yet
Our system is based on: (1) a combination of structural and visual comparison methods embedded in a statistical discriminative model, (2) a visual similarity measure designed for Web pages that improves change detection, (3) a supervised feature selection method adapted to Web archiving. We train a Support Vector Machine model with vectors of similarity scores between successive versions of pages. The trained model then determines whether two versions, defined by their vector of similarity scores, are similar or not. Experiments on real Web archives validate our approach.
Link to any RSS feed that is updated when new releases occur, if any, e.g:
Link to any RSS feed that is updated when issue or code updates occur, if any, e.g: