Joint Multi-Feature Spatial Context for Scene Recognition on the Semantic Manifold

Xinhang Song, Shuqiang Jiang, Luis Herranz; The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015, pp. 1312-1320

Abstract


In the semantic multinomial framework patches and images are modeled as points in a semantic probability simplex. Patch theme models are learned resorting to weak supervision via image labels, which leads the problem of scene categories co-occurring in this semantic space. Fortunately, each category has its own co-occurrence patterns that are consistent across the images in that category. Thus, discovering and modeling these patterns is critical to improve the recognition performance in this representation. In this paper, we observe that not only global co-occurrences at the image-level are important, but also different regions have different category co-occurrence patterns. We exploit local contextual relations to address the problem of discovering consistent co-occurrence patterns and removing noisy ones. Our hypothesis is that a less noisy semantic representation, would greatly help the classifier to model consistent co-occurrences and discriminate better between scene categories. An important advantage of modeling features in a semantic space, is that this space is feature independent. Thus, we can combine multiple features and spatial neighbors in the same common space, and formulate the problem as minimizing a context-dependent energy. Experimental results show that exploiting different types of contextual relations consistently improves the recognition accuracy. In particular, larger datasets benefit more from the proposed method, leading to very competitive performance.

Related Material


[pdf]
[bibtex]
@InProceedings{Song_2015_CVPR,
author = {Song, Xinhang and Jiang, Shuqiang and Herranz, Luis},
title = {Joint Multi-Feature Spatial Context for Scene Recognition on the Semantic Manifold},
booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2015}
}