event thumbnail image
Carnegie Mellon Machine Learning Lunch seminar

Feature Selection via Block-Regularized Regression

author: Seyoung Kim, Carnegie Mellon University

Description

Identifying co-varying causal elements in very high dimensional feature space with internal structures, e.g., a space with as many as millions of linearly ordered features, as one typically encounters in problems such as whole genome association (WGA) mapping, remains an open problem in statistical learning. We propose a block-regularized regression model for sparse variable selection in a high-dimensional space where the covariates are linearly ordered, and are possibly subject to local statistical linkages (e.g., block structures) due to spacial or temporal proximity of the features.

Our goal is to identify a small subset of relevant covariates that are not merely from random positions in the ordering, but grouped as contiguous blocks from large number of ordered covariates. Following a typical linear regression framework between the features and the response, our proposed model employs a sparsity-enforcing Laplacian prior for the regression coefficients, augmented by a 1st-order Markovian process along the feature sequence that "activates" the regression coefficients in a coupled fashion. We describe a sampling-based learning algorithm and demonstrate the performance of our method on simulated and biological data for marker identification under WGA.

You might be experiencing some problems with Your Video player.
Slides
0:00 Feature Selection via Block Regularized Regression
0:17 Block-Regularized Regression
0:58 Sparse Regression (1)
1:31 Sparse Regression (2)
2:12 Sparse Regression (3)
3:28 Single Nucleotide Polymorphism (SNP)
4:02 Association Mapping
5:20 Association Mapping as Regression
6:04 Sparse Regression
6:28 Recombination
7:20 After Many Generations with Recombination (1)
7:46 After Many Generations with Recombination (2)
8:16 Variable Selection Methods for Association Mapping
8:41 Bayesian Variable Selection (1)
9:04 Bayesian Variable Selection (2)
9:32 Bayesian Variable Selection (3)
10:06 Block-regularized Regression with Markov Chain Prior (1)
10:23 Block-regularized Regression with Markov Chain Prior (2)
11:11 Block-regularized Regression with Markov Chain Prior (3)
11:56 Block-regularized Regression
12:47 Learning with MCMC
13:09 Experiments
14:01 Simulations (1)
14:37 Simulations (2)
14:57 Simulations (3)
15:01 Simulations (4)
15:04 Simulations (5)
15:20 Posterior Probabilities for Being Relevant
16:21 Precision and Recall (1)
17:26 Precision and Recall (2)
17:43 Precision and Recall (3)
17:53 Mouse Data (BROAD institute)
18:05 Conclusions

Lecture rating

People found this lecture:
Worth seeing
because it is:
 Valuable and informative
Well presented
Easily understandable
Acceptably recorded
You need to login to cast your vote.

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment:

make sure you have javascript enabled or clear this field: