event thumbnail image
Machine Learning Summer School 2006 - Taipei

Protein Subcellular Localization Prediction Based on Compartment-Specific Biological Features

author: Chia-Yu Su, Institute of Information Science, Academia Sinica

Description

Prediction of subcellular localization of proteins is important for genome annotation, protein function prediction, and drug discovery. We present a prediction method for Gram-negative bacteria that uses ten one-versus-one support vector machine (SVM) classifiers, where compartment-specific biological features are selected as input to each SVM classifier. The final prediction of localization sites is determined by inte-grating the results from ten binary classifiers using a combination of majority votes and a probabilistic method. The overall accuracy reaches 91.4%, which is 1.6% better than the state-of-the-art system, in a ten-fold cross-validation evaluation on a bench-mark data set. We demonstrate that feature selection guided by biological knowledge and insights in one-versus-one SVM classifiers can lead to a significant improvement in the prediction performance. Our model is also used to produce highly accurate prediction of 92.8% overall accuracy for proteins of dual localizations.

You might be experiencing some problems with Your Video player.
Slides
0:05 Protein Subcellular Localization Prediction
Based on Support Vector Machines
1:25 Outline
2:19 Protein Subcellular Localization (PSL)
Prediction
3:15 Importance of PSL Prediction
4:20 Current PSL Prediction for Gram-
Negative Bacteria
5:47 Outline
6:06 Multiclass Classification in SVM
6:45 Outline
6:54 Multiclass Classification by 1-v-r SVM
8:02 General Biological Features for PSL
Prediction
8:21 1. Amino Acid Composition
2. Dipeptide Composition
9:14 3. Secondary Structure Elements
10:16 Training and Testing in SVM
11:11 Gram-Negative Bacteria Data Set
11:30 Performance Evaluation
12:06 Results of 1-v-r SVM Model
12:41 Outline
12:54 Multiclass Classification by 1-v-1 SVM
14:27 Accuracy and Feature Combination
15:16 Outline
15:38 Compartment-Specific Biological Features
15:58 Compartment-Specific Features in
Bacterial Secretory Pathways
17:15 More Compartment-Specific Biological
Features
17:32 Compartment-Specific Biological Features
18:26 System Architecture of PSL101
19:28 Feature Selection
20:32 Accuracy and Feature Combination
21:15 Outline
21:35 Refined Encoding Schemes to Encode
Protein Structures (SSE)?
23:07 A New Encoding Scheme (EC2) for SSE
24:14 Accuracy and Feature Combination
25:00 Outline
25:05 Conclusion
26:53 People
27:34 Thank You!
27:40 Questions?

Lecture rating

People found this lecture:
Worth seeing
because it is:
 Valuable and informative
Well presented
Easily understandable
Acceptably recorded
You need to login to cast your vote.

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment: