Holistic 3D Scene Understanding From a Single Geo-Tagged Image

Shenlong Wang, Sanja Fidler, Raquel Urtasun; Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015, pp. 3964-3972

Abstract


In this paper we are interested in exploiting geographic priors to help outdoor scene understanding. Towards this goal we propose a holistic approach that reasons jointly about 3D object detection, pose estimation, semantic segmentation as well as depth reconstruction from a single image. Our approach takes advantage of large-scale crowd-sourced maps to generate dense geographic, geometric and semantic priors by rendering the 3D world. We demonstrate the effectiveness of our holistic model on the challenging KITTI dataset, and show significant improvements over the baselines in all metrics and tasks.

Related Material


[pdf]
[bibtex]
@InProceedings{Wang_2015_CVPR,
author = {Wang, Shenlong and Fidler, Sanja and Urtasun, Raquel},
title = {Holistic 3D Scene Understanding From a Single Geo-Tagged Image},
booktitle = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2015}
}