You are here

Person Tracking and Monitoring

Main activities

Non-invasive technologies for monitoring complex environments have applications in diverse domains such as security and surveillance, traffic analysis, assisted living, marketing research, sports analysis etc. This macro-activity is concerned with the development of computer vision technologies for dynamic scene understanding in such application contexts.

Our activities focus on real time tracking of multiple people using multiple cameras to observe a space from different viewpoints.

We develop methods for estimating the head orientation of people from multiple distant cameras integrated with real-time tracking.

Person re-identification consists in matching observations of individuals across disjoint views in a network of surveillance cameras (this task is some times also referred to as multi-camera single person tracking).

We combine real-time tracking with acoustic source localization techniques into an integrated solution for audio-visual monitoring in smart spaces.


  • M. Rusci, D. Rossi, M. Lecca, M. Gottardi, L. Benini, E. Farella
    Energy-efficient design of an always-on smart visual trigger
    IEEE International Smart Cities Conference (ISC2), September 2016, Trento, Italy
  • N. Conci, F.G.B. De Natale, S. Messelodi, C.M. Modena, M. Verza, R. Fioravanti
    An integrated framework for video surveillance in complex environments
    IEEE International Smart Cities Conference (ISC2), September 2016, Trento, Italy
  • S. Messelodi, C.M. Modena
    Boosting Fisher Vector based Scoring Functions for Person Re-Identification
    Image and Vision Computing, Vol. 44, pp 44-58, 2015


TEV4ANALYTICS - The project develops solutions for identity-preserving tracking in multi-camera environment and tools to facilitate their deployment. Main applications are with real-time people monitoring and behaviour analytics in indoor spaces.

EIT TIK - The Interaction Toolkit creates a living repository of some of key enabling interaction technologies and methods in Smart Spaces to accelerate development in other activities, carrier projects or spin-offs.

VIDEO SURVEILLANCE PLATFORM - Building a flexible video surveillance platform to collect data and detect events, designed to assist security operators in their decisions.

Platform for stereo surveillance - The idea is to build a flexible video surveillance platform to collect data and detect events by means of algorithms embedded in high performance stereo cameras.

ACUBE - Ambient Aware Assistance develops technologies for monitoring complex environments that can be applied in areas such as assisted living homes to help personnel, as well as to support the independence and safety of users.

MY-E-DIRECTOR 2012 - Real-Time Context-Aware and Personalized Media Streaming Environments for Large Scale Broadcasting Applications, is an FP7-ICT Project. The user becomes the director in personalized tailored sports broadcasting.

PUMALAB - Multimodal Monitoring and Behavior Analysis is to advance the vision-based and audio-visual monitoring of people and their behavior, and to enable the inference of attention patterns as well as of physically observable social signals.

NETCARITY - A NETworked multisensor system for elderly people: health CARe, safety and securITY in home environment - is an EC FP6 project that proposes a new integrated paradigm for supporting independence and engagement in elderly people living alone at their own home place.

CHIL - Computers in the Human Interaction Loop is a FP6 IST project. Explore and create environments in which computers serve humans who focus on interacting with other humans as opposed to having to attend to and being preoccupied by the machines themselves.

PEACH - Personal Experience with Active Cultural Heritage is about designing and developing tools with which to put personal experience and the formative aspect of cultural heritage appreciation on the foreground, while combining the latter with educational entertainment.


SmarTrack is a patented multi-camera person tracker. It computes the ground location of people utilizing a coarse shape-plus-color signature, and is designed to work effectively in multi-person scenario where frequent and persistent occlusions occur among the persons.