MarcAlizer

Skip to end of metadata
Go to start of metadata
You are viewing an old version of this page. View the current version. Compare with Current  |   View Page History

Summary

Purpose Tool for the web pages comparison based on structural and visual approach. Research challenge for this tool is the learning algorithm based on frequency.
Homepage

Source Code Repository
not available yet
License
As Is
Debian Package not available yet

Description

Our system is based on: (1) a combination of structural and visual comparison methods embedded in a statistical discriminative model, (2) a visual similarity measure designed for Web pages that improves change detection, (3) a supervised feature selection method adapted to Web archiving. We train a Support Vector Machine model with vectors of similarity scores between successive versions of pages. The trained model then determines whether two versions, defined by their vector of similarity scores, are similar or not. Experiments on real Web archives validate our approach.

User Experiences

SO18 Comparing two web page versions for web archiving

News Feeds

Release Feed

Link to any RSS feed that is updated when new releases occur, if any, e.g:

rss: javax.net.ssl.SSLException: Received fatal alert: protocol_version

Activity Feed

Link to any RSS feed that is updated when issue or code updates occur, if any, e.g:

rss: javax.net.ssl.SSLException: Received fatal alert: protocol_version
Labels:
None
Enter labels to add to this page:
Please wait 
Looking for a label? Just start typing.