Neighbourhood Components Analysis and Metric Learning
Description
Say you want to do K-Nearest Neighbour classification. Besides
selecting K, you also have to chose a distance function, in order to
define ”nearest”. I’ll talk about a method for learning – from the
data itself – a distance measure to be used in KNN classification. The
learning algorithm, Neighbourhood Components Analysis (NCA) directly
maximizes a stochastic variant of the leave-one-out KNN score
on the training set. Of course, the resulting classification model is
non-parametric, making no assumptions about the shape of the class
distributions or the boundaries between them. I will also discuss an
variant of the method which is a generalization of Fisher’s discriminant
and defines a convex optimization problem by trying to collapse
all examples in the same class to a single point and trying to push
examples in other classes infinitely far away. By approximating the
metric with a low rank matrix, these learning algorithms, can also be
used to obtain a low-dimensional linear embedding of the original input
features allowing that can be used for data visualization and very
fast classification in high dimensions.
| Slides | |
| 0:00 | Learning Quadratic Metrics For Classification |
| 0:42 | DistanceMetric Learning |
| 2:43 | Basic Classifiers Perform Annoyingly Well |
| 3:25 | Instance/Memory Based Classification |
| 5:00 | Problems with Semi-Parametric Classification |
| 5:43 | Link with Feature Extraction/Data Transformation |
| 6:54 | Cross Validation for Metric Learning? |
| 8:12 | Cross-Validation Performance is Hard to Optimize |
| 9:20 | Stochastic Neighbour Selection |
| 11:17 | Expected Leave-One-Out Error |
| 13:07 | Quadratic Metrics Ì Linear TransformsQuadratic Metrics - Linear Transforms |
| 14:55 | Optimizing Expected Performance |
| 17:54 | Neighbourhood Components Analysis |
| 19:27 | Scale of Transformation A is also learned |
| 20:28 | Low RankMetric -Nonsquare A |
| 23:32 | Illustration: Concentric Rings |
| 25:27 | Face Data |
| 26:23 | Related Objective Functions |
| 28:16 | Geometric Intuition-Class Collapsing |
| 29:11 | Maximally Collapsing Metrics |
| 30:54 | MCMLearning is a Convex Optimization Problem |
| 32:05 | Relationship to Fisher’s Discriminant |
| 33:22 | Learning Low-Rank CollapsingMetrics |
| 34:55 | Results |
| 36:27 | Results |
Lecture rating
| People found this lecture: | ||
| Worth seeing | ||
| because it is: | ||
| Valuable and informative | ||
| Well presented | ||
| Easily understandable | ||
| Acceptably recorded | ||
| You need to login to cast your vote. | ||
Report a problem or upload files
If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Related content
SEE ALSO:
Link this page
Would you like to put a link to this lecture on your homepage?Go ahead! Copy the HTML snippet !





