event thumbnail image
Solomonovi seminarji

Structured Output Prediction of Enzyme Function via Reaction Kernels

author: Juho Rousu, University of Helsinki

Description

Enzyme function prediction is an important problem in post-genomic bioinformatics. There are two general methods for solving the problem: transfer of annotation from a similar, already annotated protein, and machine learning approaches that treat the problem as classification against a fixed taxonomy, such as Gene Ontology or the EC hierarchy. These methods are suitable in cases where the function has been previously characterized and included in the taxonomy. However, given a new function that is not previously described, existing approaches arguably do not offer adequate support for the human expert.

In this presentation, we I will present a structured output learning approach, where the enzyme function, an enzymatic reaction, is described in fine-grained fashion with so called reaction kernels which allow interpolation and extrapolation in the output (reaction) space. A structured output model is learned to predict enzymatic reactions from sequence motifs. We bring forward several choices for constructing reaction kernels and experiment with them in the remote homology case where the functions in the test set have not been seen in the training phase. Our experiments demonstrate the viability of our approach.

You might be experiencing some problems with Your Video player.
Slides
0:00 Structured Output Prediction of Enzyme Function via Reaction Kernels
1:39 Structured Output Prediction
4:36 Example: sequence-to-sequence learning
6:01 Example: Hierachical Multilabel Classification
7:01 Prediction Enzyme Function -1
9:56 Prediction Enzyme Function -2
11:56 Structured Output Prediction with Kernels
16:06 The "standard" optimization problem
21:58 Max-Margin Regression, MMR
25:27 Towards simple(r) structured prediction
26:51 Structured prediction via Kernel density estimation
30:05 Tensor product feature map
32:41 Input feature maps 1: string kernels
34:24 Input features 2: Bag of conserved resitudes - GTG
37:17 Output features 1: Embedding of a hierarchy
38:18 Classification hierarchies for protein function
39:06 Output features 2: Reactant Matching kernel
41:39 Leveraging the kernel trick
43:39 Experimental setup
46:22 Importance of the polynomial kernel
47:56 Predicting EC codes not seen in training
51:07 Conclusions
55:35 Structured Output Prediction of Enzyme Function via Reaction Kernels

Lecture rating

People found this lecture:
Worth seeing
because it is:
 Valuable and informative
Well presented
Easily understandable
Acceptably recorded
You need to login to cast your vote.

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Reviews and comments:

Comment1 suhail, October 17, 2009 at 3:07 p.m.:

Structured Output Prediction of Enzyme Function via Reaction Kernels
author: Juho Rousu, University of Helsinki
Description

Enzyme function prediction is an important problem in post-genomic bioinformatics. There are two general methods for solving the problem: transfer of annotation from a similar, already annotated protein, and machine learning approaches that treat the problem as classification against a fixed taxonomy, such as Gene Ontology or the EC hierarchy. These methods are suitable in cases where the function has been previously characterized and included in the taxonomy. However, given a new function that is not previously described, existing approaches arguably do not offer adequate support for the human expert.

In this presentation, we I will present a structured output learning approach, where the enzyme function, an enzymatic reaction, is described in fine-grained fashion with so called reaction kernels which allow interpolation and extrapolation in the output (reaction) space. A structured output model is learned to predict enzymatic reactions from sequence motifs. We bring forward several choices for constructing reaction kernels and experiment with them in the remote homology case where the functions in the test set have not been seen in the training phase. Our experiments demonstrate the viability of our approach.

Write your own review or comment:

make sure you have javascript enabled or clear this field: