event thumbnail image
The 25th International Conference on Machine Learning (ICML 2008)

An Empirical Evaluation of Supervised Learning in High Dimensions

author: Nikos Karampatziakis, Department of Computer Science, Cornell University

Description

In this paper we perform an empirical evaluation of supervised learning methods on high dimensional data. We evaluate learning performance on three metrics: accuracy, AUC, and squared loss. We also study the effect of increasing dimensionality on the relative performance of the learning algorithms. Our findings are consistent with previous studies for problems of relatively low dimension, but suggest that as dimensionality increases the relative performance of the various learning algorithms changes. To our surprise, the methods that seem best able to learn from high dimensional data are random forests and neural nets.

You might be experiencing some problems with Your Video player.
Slides
0:00 An Empirical Evaluation of Supervised Learning in High Dimensions
0:10 Previous Empirical Comparisons - 1
1:04 Previous Empirical Comparisons - 2
1:12 Previous Empirical Comparisons - 3
1:22 Previous Empirical Comparisons - 4
1:40 Motivation
2:27 Outline
2:40 Datasets
3:05 Learning Algorithms - 1
3:14 Learning Algorithms - 2
3:17 Learning Algorithms - 3
3:19 Learning Algorithms - 4
3:21 Learning Algorithms - 5
3:24 Learning Algorithms - 6
3:27 Learning Algorithms - 7
3:39 Learning Algorithms - 8
3:43 Learning Algorithms - 9
3:46 Learning Algorithms - 10
4:07 Performance Metrics - 1
4:47 Performance Metrics - 2
5:08 Calibration
6:22 Small Difficulty - 1
6:28 Small Difficulty - 2
6:51 Small Difficulty - 3
7:16 Standardization
7:50 Summary of Methodology
8:06 Scale of the Study
8:35 Implementation Tricks
9:57 Caveats
11:05 Average Over All Three Metrics - 1
11:08 Average Over All Three Metrics - 2
12:21 Average Over All Three Metrics - 3
12:38 Average Over All Three Metrics - 4
13:19 Average Over All Three Metrics - 5
13:49 Average Over All Three Metrics - 6
14:00 Trends: Moving Average - 1
14:38 Trends: Moving Average - 2
15:14 Trends: Moving Average - 3
15:49 Trends: Moving Average - 4
16:13 Trends: Cumulative Performance - 1
16:54 Trends: Cumulative Performance - 2
17:18 Trends: Cumulative Performance - 3
17:49 Trends: Cumulative Performance - 4
18:08 Conclusions
19:07 Acknowledgments
19:42 - Questions

Lecture rating

People found this lecture:
Worth seeing
because it is:
 Valuable and informative
Well presented
Easily understandable
Acceptably recorded
You need to login to cast your vote.

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Reviews and comments:

Comment1 athlion, September 21, 2008 at 8:21 p.m.:

I am unable to view this lecture in neither Windows XP, nor MacOS X. I can watch other lectures but not this one!

Write your own review or comment:

make sure you have javascript enabled or clear this field: