A tutorial on deep and unsupervised feature learning for activity recognition
published: Aug. 24, 2011, recorded: June 2011, views: 12184
Report a problem or upload filesIf you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Recognition of human activity from video data is a challenging problem that has received an increasing amount of attention from the computer vision community in recent years. Currently the best performing methods at this task are based on engineered descriptors with explicit local geometric cues and other heuristics. Until very recently, learning has not played a major role until the classification stage, at which point much of the input is lost. It has been shown that learning features in a supervised, unsupervised, or semi-supervised setting can improve performance in other vision tasks, but most of these works have concentrated on static images rather than video. In this tutorial, we will review a number of recently proposed methods that attempt to learn low and mid-level features for use in activity recognition. This includes deep and unsupervised feature learning methods such as convolutional networks, convolutional deep belief networks and other approaches which learn sparse, overcomplete representations.
Download slides: gesturerecognition2011_taylor_tutorial_01.pdf (23.9 MB)
Link this pageWould you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !