Robust Multi-resolution Pedestrian Detection in Traffic Scenes

Junjie Yan, Xucong Zhang, Zhen Lei, Shengcai Liao, Stan Z. Li; The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2013, pp. 3033-3040


The serious performance decline with decreasing resolution is the major bottleneck for current pedestrian detection techniques [14, 23]. In this paper, we take pedestrian detection in different resolutions as different but related problems, and propose a Multi-Task model to jointly consider their commonness and differences. The model contains resolution aware transformations to map pedestrians in different resolutions to a common space, where a shared detector is constructed to distinguish pedestrians from background. For model learning, we present a coordinate descent procedure to learn the resolution aware transformations and deformable part model (DPM) based detector iteratively. In traffic scenes, there are many false positives located around vehicles, therefore, we further build a context model to suppress them according to the pedestrian-vehicle relationship. The context model can be learned automatically even when the vehicle annotations are not available. Our method reduces the mean miss rate to 60% for pedestrians taller than 30 pixels on the Caltech Pedestrian Benchmark, which noticeably outperforms previous state-of-the-art (71%).

Related Material

author = {Yan, Junjie and Zhang, Xucong and Lei, Zhen and Liao, Shengcai and Li, Stan Z.},
title = {Robust Multi-resolution Pedestrian Detection in Traffic Scenes},
booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2013}