People In Motion: Pose, Action and Communication
published: Oct. 9, 2012, recorded: September 2012, views: 3616
Report a problem or upload filesIf you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
This talk will give an overview of some of the research in the Image and Video Computing Group at Boston University related to tracking, analysis, recognition and retrieval of images and video based on humans and their actions. First, efficient methods for inference of human pose will be presented. Linearly augmented tree models are proposed that enable efficient scale and rotation invariant matching. In another approach, articulated pose estimation with loopy graph models is made efficient via a branch-and-bound strategy for finding the globally optimal pose. Second, methods for learning human action models from Web images and video will be presented; the methods require no human intervention other than the action keywords to be used to form text queries to Web image and video search engines. A Multiple Instance Learning framework for exploiting properties of the scene, objects, and humans in video is also proposed. Third, work towards automatic recognition and retrieval of American Sign Language (ASL) in video databases will be presented. The goal is to enable users to search ASL video content simply by video-recording a query sign and relying on computer-based sign-recognition for lookup.
Link this pageWould you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !