event thumbnail image
The 7th International Symposium on Intelligent Data Analysis

A Support Vector Machine Approach to Dutch Part-of-Speech Tagging

author: Mannes Poel, Twente University

Description

Part-of-Speech tagging, the assignment of Parts-of-Speech to the words in a given context of use, is a basic technique in many systems that handle natural languages. This paper describes a method for supervised training of a Part-of-Speech tagger using a committee of Support Vector Machines on a large corpus of annotated transcriptions of spoken Dutch. Special attention is paid to the decomposition of the large data set into parts for common, uncommon and unknown words. This does not only solve the space problems caused by the amount of data, it also improves the tagging time. The performance of the resulting tagger in terms of accuracy is 97.54 %, which is quite good, where the speed of the tagger is reasonably good.

You might be experiencing some problems with Your Video player.
Slides
0:00 A Support Vector Machine Approach to Dutch Part of Speech Tagging
0:33 Outline
1:20 CGN: Corpus Spoken Dutch
2:28 Part of Speech Tags
3:38 Tag Set
4:15 Part-of-Speech Tagging
4:47 Goal & Challenge
5:33 Design of the SVM Tagger
7:30 Decomposing the SVM
9:43 Initial Committee of SVM’s
10:36 Training and Test Data
11:15 Initial Evaluation on Validation Set
12:31 Compound Analysis pt 1
13:34 Compound Analysis pt 2
14:10 The Final Committee of SVM’s
14:49 Overall Performance
15:28 More Detailed Performance Analysis
16:09 Conclusions pt 1
16:27 Conclusions pt 2
20:49 Design of the SVM Tagger (a)

Lecture rating

People found this lecture:
Worth seeing
because it is:
 Valuable and informative
Well presented
Easily understandable
Acceptably recorded
You need to login to cast your vote.

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment: