SOLD: Sub-Optimal Low-rank Decomposition for Efficient Video Segmentation

Chenglong Li, Liang Lin, Wangmeng Zuo, Shuicheng Yan, Jin Tang; The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015, pp. 5519-5527


This paper investigates how to perform robust and efficient unsupervised video segmentation while suppressing the effects of data noises and/or corruptions. We propose a general algorithm, called Sub-Optimal Low-rank Decomposition (SOLD), which pursues the low-rank representation for video segmentation. Given the supervoxels affinity matrix of an observed video sequence, SOLD seeks a sub-optimal solution by making the matrix rank explicitly determined. In particular, the affinity matrix with the rank fixed can be decomposed into two sub-matrices of low rank, and then we iteratively optimize them with closed-form solutions. Moreover, we incorporate a discriminative replication prior into our framework based on the obervation that small-size video patterns tend to recur frequently within the same object. The video can be segmented into several spatio-temporal regions by applying the Normalized-Cut (NCut) algorithm with the solved low-rank representation. To process the streaming videos, we apply our algorithm sequentially over a batch of frames over time, in which we also develop several temporal consistent constraints improving the robustness. Extensive experiments on the public benchmarks demonstrate superior performance of our framework over other state-of-the-art approaches.

Related Material

author = {Li, Chenglong and Lin, Liang and Zuo, Wangmeng and Yan, Shuicheng and Tang, Jin},
title = {SOLD: Sub-Optimal Low-rank Decomposition for Efficient Video Segmentation},
booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2015}