Martin Bäuml Publications Projects Datasets

Contextual Constraints for Person Retrieval in Camera Networks

M. Bäuml, M. Tapaswi, A. Schumann, R. Stiefelhagen
International Conference on Advanced Video and Signal-based Surveillance (AVSS), Beijing, China, September 2012
[paper] [bib]

Abstract

We use contextual constraints for person retrieval in camera networks. We start by formulating a set of general positive and negative constraints on the identities of person tracks in camera networks, such as a person cannot appear twice in the same frame. We then show how these constraints can be used to improve person retrieval. First, we use the constraints to obtain training data in an unsupervised way to learn a general metric that is better suited to discriminate between different people than the Euclidean distance. Second, starting from an initial query track, we enhance the query-set using the constraints to obtain additional positive and negative samples for the query. Third, we formulate the person retrieval task as an energy minimization problem, integrate track scores and constraints in a common framework and jointly optimize the retrieval over all interconnected tracks. We evaluate our approach on the CAVIAR dataset and achieve 22% relative performance improvement in terms of mean average precision over standard retrieval where each track is treated independently.

Data set

The following package contains the person tracks and labels as used in this paper.

PersonRetrieval_CAVIAR_AVSS2012_v1.0.tar.gz (1MB)