Towards On-the-fly Large Scale Video Search
published: July 30, 2014, recorded: September 2013, views: 4706
Report a problem or upload filesIf you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
We would like to be able to find anything in an image or video dataset. The talk will describe our progress on visual search for finding people, specific objects and categories in large scale video datasets. The novelty is that the item of interest can be specified at run time by a text query, and a discriminative classifier for that item is then learnt on-the-fly using images downloaded from Google Image search. We will compare state of the art encoding methods for the problem, and discuss the choices in achieving the best trade-off between three important performance measures for a realtime system of this kind, namely: (i) accuracy, (ii) memory footprint, and (iii) speed. We will also describe steps to achieving `total recall'. There will be demonstrations on a large scale video dataset of BBC broadcasts. This is joint work with Relja Arandjelovic, Ken Chatfield and Omkar Parkhi.
Link this pageWould you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !