Semantics-Preserving Hashing for Cross-View Retrieval

Zijia Lin, Guiguang Ding, Mingqing Hu, Jianmin Wang; The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015, pp. 3864-3872


With benefits of low storage costs and high query speeds, hashing methods are widely researched for efficiently retrieving large-scale data, which commonly contains multiple views, e.g. a news report with images, videos and texts. In this paper, we study the problem of cross-view retrieval and propose an effective Semantics-Preserving Hashing method, termed SePH. Given semantic affinities of training data as supervised information, SePH transforms them into a probability distribution and approximates it with to-be-learnt hash codes in Hamming space via minimizing the Kullback-Leibler divergence. Then kernel logistic regression with a sampling strategy is utilized to learn the nonlinear projections from features in each view to the learnt hash codes. And for any unseen instance, predicted hash codes and their corresponding output probabilities from observed views are utilized to determine its unified hash code, using a novel probabilistic approach. Extensive experiments conducted on three benchmark datasets well demonstrate the effectiveness and reasonableness of SePH.

Related Material

author = {Lin, Zijia and Ding, Guiguang and Hu, Mingqing and Wang, Jianmin},
title = {Semantics-Preserving Hashing for Cross-View Retrieval},
booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2015}