Matchbox

Version 3 by Roman Graf
on Feb 07, 2013 14:49.

compared with
Current by Roman Graf
on Feb 07, 2013 14:51.

Key
This line was removed.
This word was removed. This word was added.
This line was added.

Changes (12)

View Page History
| Homepage \\ | [https://github.com/openplanets/scape/tree/master/pc-qa-matchbox] |
| Source Code Repository \\ | [https://github.com/openplanets/scape/tree/master/pc-qa-matchbox] |
| License \\ | Open source \\ |
| Debian Package | [http://packages.debian.org/stable/package] |


The image processing algorithm can be described in 4 steps:
# 1. Document feature extraction

* Interest point detection (applying Scale Invariant Feature Transform (SIFT) keypoint extraction)
* Derivation of local feature descriptors (invariant to geometrical or radiometrical distortions)

# 2. Learning visual dictionary

* Clustering method applied to all SIFT descriptors of all images using k-means algorithm
* Run over collection and collect local descriptors in a visual dictionary using Bag-Of-Words (BoW) algorithm

# 3. Create visual histogram for each image document
# 4. Detect similar images based on visual histogram and local descriptors. Evaluate similarity score -- pair-wise comparison of corresponding keyword frequency histograms for all documents. Conduct structural similarity analysis applying Sturctural SIMilarity (SSIM) approach (1 means identical and 0 means very different)

* Rotate
* Scale
FindDuplicates script can be invoked from command line. For standard usage two parameters are required: path to the collection documents and ‘all’.
scape/pc-qa-matchbox/Python# python2.7 FindDuplicates.py \-h
usage: FindDuplicates.py [-h] [\--threads THREADS|--threads THREADS] [\--sdk SDK|--sdk SDK] [\--precluster PRECLUSTER|--precluster PRECLUSTER] [\--clahe CLAHE|--clahe CLAHE] [\--config CONFIG|--config CONFIG] [\--featdir FEATDIR|--featdir FEATDIR] [\--bowsize BOWSIZE|--bowsize BOWSIZE] [\--csv|--csv] [-v] dir _all,extract,compare,train,bowhist,clean_
[\--precluster PRECLUSTER|--precluster PRECLUSTER] [\--clahe CLAHE|--clahe CLAHE]
[\--config CONFIG|--config CONFIG] [\--featdir FEATDIR|--featdir FEATDIR]
[\--bowsize BOWSIZE|--bowsize BOWSIZE] [\--csv|--csv] [-v]
dir _all,extract,compare,train,bowhist,clean_