Publications

Stable Multi-Target Tracking in Real-Time Surveillance Video

B Benfold and I D Reid
Proc Computer Vision and Pattern Recognition (CVPR), Colorado Springs, June 2011

Links to Authors: bbenfold ian

Abstract

The majority of existing pedestrian trackers concentrate on maintaining the identities of targets, however systems for remote biometric analysis or activity recognition in surveillance video often require stable bounding-boxes around pedestrians rather than approximate locations. We present a multi-target tracking system that is designed specifically for the provision of stable and accurate head location estimates. By performing data association over a sliding window of frames, we are able to correct many data association errors and fill in gaps where observations are missed. The approach is multi-threaded and combines asynchronous HOG detections with simultaneous KLT tracking and Markov-Chain Monte-Carlo Data Association (MCMCDA) to provide guaranteed real-time tracking in high definition video. Where previous approaches have used ad-hoc models for data association, we use a more principled approach based on a Minimal Description Length (MDL) objective which accurately models the affinity between observations. We demonstrate by qualitative and quantitative evaluation that the system is capable of providing precise location estimates for large crowds of pedestrians in real-time. To facilitate future performance comparisons, we make a new dataset with hand annotated ground truth head locations publicly available.

Links

PDF: File (8.7MB)
BIB: citation

Additional Material

Video

The video demonstrates the MCMCDA based head tracking system, which runs at 25fps on 1920x1080 video using a standard desktop computer. The system is capable of obtaining stable head images and is robust to temporary occlusions.

Dataset

The Town Centre video and ground truth data can be found on the project page

Copyright Notice

This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.

Active Vision Laboratory

Department of Engineering Science

University of Oxford