event thumbnail image
Tutorials

From Trees to Forests and Rule Sets - A Unified Overview of Ensemble Methods

author: Giovanni Seni, Santa Clara University
author: John Elder, Elder Research, Inc.

Description

Ensemble methods are one of the most influential developments in Machine Learning over the past decade. They perform extremely well in a variety of problem domains, have desirable statistical properties, and scale well computationally. By combining competing models into a committee, they can strengthen “weak” learning procedures.

This tutorial explains two recent developments with ensemble methods:
Importance Sampling reveals “classic” ensemble methods (bagging, random forests, and boosting) to be special cases of a single algorithm. This unified view clarifies the properties of these methods and suggests ways to improve their accuracy and speed.
Rule Ensembles are linear rule models derived from decision tree ensembles. While maintaining (and often improving) the accuracy of the tree ensemble, the rule-based model is much more interpretable.

This tutorial is aimed at both novice and advanced data mining researchers and practitioners especially in Engineering, Statistics, and Computer Science. Users with little exposure to ensemble methods will gain a clear overview of each method. Advanced practitioners already employing ensembles will gain insight into this breakthrough way to create next-generation models.

John Elder's lecture:
In a Nutshell, Examples & Timeline
Predictive Learning
Decision Trees
Giovanni Seni's lecture:
Model Selection (Bias-Variance Tradeoff , Regularization via shrinkage)
Ensemble Learning & Importance Sampling (ISLE)
Generic Ensemble Generation
Bagging, Random Forest, AdaBoost, MART
Rule Ensembles
Interpretation

You might be experiencing some problems with Your Video player.
Slides
0:00 From Trees to Forests and Rule Sets - A Unified Overview of Ensemble Methods
1:14 Tutorial Goals
2:00 Overview
4:09 Overview (2)
4:54 Ensemble Methods in a Nutshell
6:58 Ensemble Methods In a Nutshell (2)
8:20 Examples - Data Mining Products
8:57 Examples - Algorithm Response Surfaces
11:07 Examples - Relative Performance
13:12 Examples - “Bundling” Improves Performance
14:32 Examples - Relative Performance
14:54 Examples - “Bundling” Improves Performance
15:03 Examples - Credit Scoring Performace
18:38 Timeline
20:31 Timeline (2)
24:19 Overview - Predictive Learning
24:23 Predictive Learning - Example
25:20 Predictive Learning - Example (2)
26:33 Predictive Learning - Example (3)
27:55 Predictive Learning - Procedure Summary
29:05 Predictive Learning - Procedure Summary (2)
30:07 Overview - Decision Trees
30:09 Decision Trees - Induction Overview
30:50 Decision Trees - Induction Overview (2)
31:18 Decision Trees - Induction Overview (3)
32:09 Decision Trees - Growing Algorithm
33:40 Decision Trees - Desirable Data Mining Properties
34:36 Decision Trees - Desirable Data Mining Properties (2)
35:40 Decision Trees - Limitations
36:03 Decision Trees - Limitations (2)
36:31 Decision Trees - Limitations (3)
37:42 Overview - Model Selection
40:49 Model Selection - What is the “right” size of a tree?
42:48 Model Selection - Bias−Variance Decomposition
46:19 Model Selection - Bias−Variance Schematic
48:53 Model Selection - Bias−Variance Tradeoff
50:31 Model Selection - Tree Pruning
53:03 Model Selection - Tree Pruning (2)
55:10 Model Selection - Cross−Validation
56:43 Model Selection - Cross−Validation (2)
57:47 Model Selection - Cross−Validation (3)
58:01 - Questions

Lecture rating

People found this lecture:
Worth seeing
because it is:
 Valuable and informative
Well presented
Easily understandable
Acceptably recorded
You need to login to cast your vote.

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.

 Watch videos:   (click on thumbnail to launch)

Watch Part 1
Part 1 0:59:31
Flash video Windows Media video

!NOW PLAYING
Watch Part 2
Part 2 1:45:08
Flash video Windows Media video

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment:

make sure you have javascript enabled or clear this field: