Learning Important Spatial Pooling Regions for Scene Classification

Di Lin, Cewu Lu, Renjie Liao, Jiaya Jia; The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014, pp. 3726-3733

Abstract


We address the false response influence problem when learning and applying discriminative parts to construct the mid-level representation in scene classification. It is often caused by the complexity of latent image structure when convolving part filters with input images. This problem makes mid-level representation, even after pooling, not distinct enough to classify input data correctly to categories. Our solution is to learn important spatial pooling regions along with their appearance. The experiments show that this new framework suppresses false response and produces improved results on several datasets, including MIT-Indoor, 15-Scene, and UIUC 8-Sport. When combined with global image features, our method achieves state-of-the-art performance on these datasets.

Related Material


[pdf]
[bibtex]
@InProceedings{Lin_2014_CVPR,
author = {Lin, Di and Lu, Cewu and Liao, Renjie and Jia, Jiaya},
title = {Learning Important Spatial Pooling Regions for Scene Classification},
booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2014}
}