Averaging Support Vector Machines for Processing Large Data Sets
published: Sept. 1, 2008, recorded: July 2008, views: 3385
Report a problem or upload filesIf you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
The handling of large data sets by support vector machines (SVMs)(Vapnik, 1998) employing a nonlinear kernel suffers from the non-linear scaling of the numerical solution techniques for the underlying optimisation problem. This is in particular valid if the kernel matrix cannot be stored in the main memory anymore and therefore the evaluation of the kernel on given data points needs to be recomputed again and again. We investigate a simple approach to allow the processing of larger data sets: We separate the large data set into a number of smaller ones, each small enough to allow the caching of the kernel matrix, and learn a support vector machine for each of these data sets. For the evaluation on data points we then just simply average the results of the different SVMs.
Link this pageWould you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !