An Empirical Evaluation of Supervised Learning in High Dimensions
author:
Nikos Karampatziakis,
Department of Computer Science, Cornell University
Description
In this paper we perform an empirical evaluation of supervised learning methods on high dimensional data. We evaluate learning performance on three metrics: accuracy, AUC, and squared loss. We also study the effect of increasing dimensionality on the relative performance of the learning algorithms. Our findings are consistent with previous studies for problems of relatively low dimension, but suggest that as dimensionality increases the relative performance of the various learning algorithms changes. To our surprise, the methods that seem best able to learn from high dimensional data are random forests and neural nets.
You might be experiencing some problems with Your Video player.
| Slides | |
| 0:00 | An Empirical Evaluation of Supervised Learning in High Dimensions |
| 0:10 | Previous Empirical Comparisons - 1 |
| 1:04 | Previous Empirical Comparisons - 2 |
| 1:12 | Previous Empirical Comparisons - 3 |
| 1:22 | Previous Empirical Comparisons - 4 |
| 1:40 | Motivation |
| 2:27 | Outline |
| 2:40 | Datasets |
| 3:05 | Learning Algorithms - 1 |
| 3:14 | Learning Algorithms - 2 |
| 3:17 | Learning Algorithms - 3 |
| 3:19 | Learning Algorithms - 4 |
| 3:21 | Learning Algorithms - 5 |
| 3:24 | Learning Algorithms - 6 |
| 3:27 | Learning Algorithms - 7 |
| 3:39 | Learning Algorithms - 8 |
| 3:43 | Learning Algorithms - 9 |
| 3:46 | Learning Algorithms - 10 |
| 4:07 | Performance Metrics - 1 |
| 4:47 | Performance Metrics - 2 |
| 5:08 | Calibration |
| 6:22 | Small Difficulty - 1 |
| 6:28 | Small Difficulty - 2 |
| 6:51 | Small Difficulty - 3 |
| 7:16 | Standardization |
| 7:50 | Summary of Methodology |
| 8:06 | Scale of the Study |
| 8:35 | Implementation Tricks |
| 9:57 | Caveats |
| 11:05 | Average Over All Three Metrics - 1 |
| 11:08 | Average Over All Three Metrics - 2 |
| 12:21 | Average Over All Three Metrics - 3 |
| 12:38 | Average Over All Three Metrics - 4 |
| 13:19 | Average Over All Three Metrics - 5 |
| 13:49 | Average Over All Three Metrics - 6 |
| 14:00 | Trends: Moving Average - 1 |
| 14:38 | Trends: Moving Average - 2 |
| 15:14 | Trends: Moving Average - 3 |
| 15:49 | Trends: Moving Average - 4 |
| 16:13 | Trends: Cumulative Performance - 1 |
| 16:54 | Trends: Cumulative Performance - 2 |
| 17:18 | Trends: Cumulative Performance - 3 |
| 17:49 | Trends: Cumulative Performance - 4 |
| 18:08 | Conclusions |
| 19:07 | Acknowledgments |
| 19:42 | - Questions |
Lecture rating
| People found this lecture: | ||
| Worth seeing | ||
| because it is: | ||
| Valuable and informative | ||
| Well presented | ||
| Easily understandable | ||
| Acceptably recorded | ||
| You need to login to cast your vote. | ||
Report a problem or upload files
If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Related content
Visitors who watched this lecture also watched...
Link this page
Would you like to put a link to this lecture on your homepage?Go ahead! Copy the HTML snippet !






I am unable to view this lecture in neither Windows XP, nor MacOS X. I can watch other lectures but not this one!