event thumbnail image
The 25th International Conference on Machine Learning (ICML 2008)

Nearest Hyperdisk Methods for High-Dimensional Classification

author: Bill Triggs, Laboratoire Jean Kuntzmann

Description

In high-dimensional classification problems it is infeasible to include enough training samples to cover the class regions densely. Irregularities in the resulting sparse sample distributions cause local classifiers such as Nearest Neighbors (NN) and kernel methods to have irregular decision boundaries. One solution is to "fill in the holes" by building a convex model of the region spanned by the training samples of each class and classifying examples based on their distances to these approximate models. Methods of this kind based on affine and convex hulls and bounding hyperspheres have already been studied. Here we propose a method based on the bounding hyperdisk of each class -- the intersection of the affine hull and the smallest bounding hypersphere of its training samples. We argue that in many cases hyperdisks are preferable to affine and convex hulls and hyperspheres: they bound the classes more tightly than affine hulls or hyperspheres while avoiding much of the sample overfitting and computational complexity that is inherent in high-dimensional convex hulls. We show that the hyperdisk method can be kernelized to provide nonlinear classifiers based on non-Euclidean distance metrics. Experiments on several classification problems show promising results.

You might be experiencing some problems with Your Video player.
Slides
0:00 Nearest Hyperdisk Methods for High-Dimensional Classification
0:12 Context
0:59 “Hole” Artifacts
2:35 General Strategy
3:27 Model 1 - Convex Hull of Training Samples
5:26 Model 2 - Affine Hull of Training Samples
6:03 Model 3 - Bounding Hypersphere of Training Samples
7:03 Our Model - Bounding Hyperdisk of Training Samples
7:42 Computing the Bounding Hyperdisk - 1
8:12 Computing the Bounding Hyperdisk - 2
8:33 Kernelizing the Bounding Hyperdisk Model
11:36 Experiments - Toy “4 Disks” Dataset
12:06 Linear Methods on “4 Disks” Dataset
13:04 Experiments - Toy “4 Disks” Dataset
13:21 Linear Methods on “4 Disks” Dataset
14:36 Experiments - Toy “4 Disks” Dataset
15:02 Linear Methods on “4 Disks” Dataset
15:05 Linear Methods on COIL-100 Object Dataset
15:34 Linear Methods on UIUC Birds Dataset
16:15 Kernelized Methods on UCI Datasets
16:44 Summary
17:23 Bonus (CVPR’08) - Using Convex Models to Find Discriminant Directions
18:24 Formulation
18:50 Experiments: Extended Yale-B Face Dataset - 1
18:54 Experiments: Extended Yale-B Face Dataset - 2
19:18 PASCAL Visual Object Challenge 2006 Dataset
19:30 UIUC Birds Dataset
19:42 - Questions
22:34 - Questions

Lecture rating

People found this lecture:
Worth seeing
because it is:
 Valuable and informative
Well presented
Easily understandable
Acceptably recorded
You need to login to cast your vote.

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment:

make sure you have javascript enabled or clear this field: