Structured Output Prediction of Enzyme Function via Reaction Kernels
Description
Enzyme function prediction is an important problem in post-genomic
bioinformatics. There are two general methods for solving the problem:
transfer of annotation from a similar, already annotated protein, and
machine learning approaches that treat the problem as classification
against a fixed taxonomy, such as Gene Ontology or the EC hierarchy.
These methods are suitable in cases where the function has been
previously characterized and included in the taxonomy. However, given a
new function that is not previously described, existing approaches
arguably do not offer adequate support for the human expert. In this presentation, we I will present a structured output learning
approach, where the enzyme function, an enzymatic reaction, is described
in fine-grained fashion with so called reaction kernels which allow
interpolation and extrapolation in the output (reaction) space. A
structured output model is learned to predict enzymatic reactions from
sequence motifs. We bring forward several choices for constructing
reaction kernels and experiment with them in the remote homology case
where the functions in the test set have not been seen in the training
phase. Our experiments demonstrate the viability of our approach.
| Slides | |
| 0:00 | Structured Output Prediction of Enzyme Function via Reaction Kernels |
| 1:39 | Structured Output Prediction |
| 4:36 | Example: sequence-to-sequence learning |
| 6:01 | Example: Hierachical Multilabel Classification |
| 7:01 | Prediction Enzyme Function -1 |
| 9:56 | Prediction Enzyme Function -2 |
| 11:56 | Structured Output Prediction with Kernels |
| 16:06 | The "standard" optimization problem |
| 21:58 | Max-Margin Regression, MMR |
| 25:27 | Towards simple(r) structured prediction |
| 26:51 | Structured prediction via Kernel density estimation |
| 30:05 | Tensor product feature map |
| 32:41 | Input feature maps 1: string kernels |
| 34:24 | Input features 2: Bag of conserved resitudes - GTG |
| 37:17 | Output features 1: Embedding of a hierarchy |
| 38:18 | Classification hierarchies for protein function |
| 39:06 | Output features 2: Reactant Matching kernel |
| 41:39 | Leveraging the kernel trick |
| 43:39 | Experimental setup |
| 46:22 | Importance of the polynomial kernel |
| 47:56 | Predicting EC codes not seen in training |
| 51:07 | Conclusions |
| 55:35 | Structured Output Prediction of Enzyme Function via Reaction Kernels |
Lecture rating
| People found this lecture: | ||
| Worth seeing | ||
| because it is: | ||
| Valuable and informative | ||
| Well presented | ||
| Easily understandable | ||
| Acceptably recorded | ||
| You need to login to cast your vote. | ||
Report a problem or upload files
If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Related content
SEE ALSO:
Link this page
Would you like to put a link to this lecture on your homepage?Go ahead! Copy the HTML snippet !





Structured Output Prediction of Enzyme Function via Reaction Kernels
author: Juho Rousu, University of Helsinki
Description
Enzyme function prediction is an important problem in post-genomic bioinformatics. There are two general methods for solving the problem: transfer of annotation from a similar, already annotated protein, and machine learning approaches that treat the problem as classification against a fixed taxonomy, such as Gene Ontology or the EC hierarchy. These methods are suitable in cases where the function has been previously characterized and included in the taxonomy. However, given a new function that is not previously described, existing approaches arguably do not offer adequate support for the human expert.
In this presentation, we I will present a structured output learning approach, where the enzyme function, an enzymatic reaction, is described in fine-grained fashion with so called reaction kernels which allow interpolation and extrapolation in the output (reaction) space. A structured output model is learned to predict enzymatic reactions from sequence motifs. We bring forward several choices for constructing reaction kernels and experiment with them in the remote homology case where the functions in the test set have not been seen in the training phase. Our experiments demonstrate the viability of our approach.