Document Type

Conference Paper

Publication Date


Publication Source

Proceedings ofSPIE 9407, Video Surveillance and Transportation Imaging Applications 2015


We propose a real time person identification algorithm for surveillance based scenarios from low-resolution streaming video, based on mid-level features extracted from the joint distribution of various types of human actions and human poses.

The proposed algorithm uses the combination of an auto-encoder based action association framework which produces per-frame probability estimates of the action being performed, and a pose recognition framework which gives per-frame body part locations.

The main focus in this manuscript is to effectively combine these per-frame action probability estimates and pose trajectories from a short temporal window to obtain mid-level features. We demonstrate that these mid-level features captures the variation in the action performed with respect to an individual and can be used to distinguish one person from the next. Preliminary analysis on the KTH action dataset where each sequence is annotated with a specific person and a specific action is provided and shows some interesting results which verify this concept.

Inclusive pages

94070P-1 to 94070P-7



Document Version

Published Version


This document is provided for download in compliance with the publisher's policy on self-archiving. Permission documentation is on file.



Society of Photo-optical Instrumentation Engineers

Place of Publication

San Francisco, CA