A Linear Approach to Matching Cuboids in RGBD Images

Hao Jiang, Jianxiong Xiao; The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2013, pp. 2171-2178

Abstract


We propose a novel linear method to match cuboids in indoor scenes using RGBD images from Kinect. Beyond depth maps, these cuboids reveal important structures of a scene. Instead of directly fitting cuboids to 3D data, we first construct cuboid candidates using superpixel pairs on a RGBD image, and then we optimize the configuration of the cuboids to satisfy the global structure constraints. The optimal configuration has low local matching costs, small object intersection and occlusion, and the cuboids tend to project to a large region in the image; the number of cuboids is optimized simultaneously. We formulate the multiple cuboid matching problem as a mixed integer linear program and solve the optimization efficiently with a branch and bound method. The optimization guarantees the global optimal solution. Our experiments on the Kinect RGBD images of a variety of indoor scenes show that our proposed method is efficient, accurate and robust against object appearance variations, occlusions and strong clutter.

Related Material


[pdf]
[bibtex]
@InProceedings{Jiang_2013_CVPR,
author = {Jiang, Hao and Xiao, Jianxiong},
title = {A Linear Approach to Matching Cuboids in RGBD Images},
booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2013}
}