Sparse Quantization for Patch Description

Xavier Boix, Michael Gygli, Gemma Roig, Luc Van Gool; The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2013, pp. 2842-2849


The representation of local image patches is crucial for the good performance and efficiency of many vision tasks. Patch descriptors have been designed to generalize towards diverse variations, depending on the application, as well as the desired compromise between accuracy and efficiency. We present a novel formulation of patch description, that serves such issues well. Sparse quantization lies at its heart. This allows for efficient encodings, leading to powerful, novel binary descriptors, yet also to the generalization of existing descriptors like SIFT or BRIEF. We demonstrate the capabilities of our formulation for both keypoint matching and image classification. Our binary descriptors achieve state-of-the-art results for two keypoint matching benchmarks, namely those by Brown [6] and Mikolajczyk [18]. For image classification, we propose new descriptors that perform similar to SIFT on Caltech101 [10] and PASCAL VOC07 [9].

Related Material

author = {Boix, Xavier and Gygli, Michael and Roig, Gemma and Van Gool, Luc},
title = {Sparse Quantization for Patch Description},
booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2013}