Fast Action Proposals for Human Action Detection and Search

Gang Yu, Junsong Yuan; The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015, pp. 1302-1311

Abstract


In this paper we target at generating generic action proposals in unconstrained videos. Each action proposal corresponds to a temporal series of spatial bounding boxes, i.e., a spatio-temporal video tube, which has a good potential to locate one human action. Assuming each action is performed by a human with meaningful motion, both appearance and motion cues are utilized to measure the actionness of the video tubes. After picking those spatiotemporal paths of high actionness scores, our action proposal generation is formulated as a maximum set coverage problem, where greedy search is performed to select a set of action proposals that can maximize the overall actionness score. Compared with existing action proposal approaches, our action proposals do not rely on video segmentation and can be generated in nearly real-time. Experimental results on two challenging datasets, MSRII and UCF 101, validate the superior performance of our action proposals as well as competitive results on action detection and search.

Related Material


[pdf]
[bibtex]
@InProceedings{Yu_2015_CVPR,
author = {Yu, Gang and Yuan, Junsong},
title = {Fast Action Proposals for Human Action Detection and Search},
booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2015}
}