A Support Vector Machine Approach to Dutch Part-of-Speech Tagging
author:
Mannes Poel,
Twente University
Description
Part-of-Speech tagging, the assignment of Parts-of-Speech
to the words in a given context of use, is a basic technique in many
systems that handle natural languages. This paper describes a method
for supervised training of a Part-of-Speech tagger using a committee of
Support Vector Machines on a large corpus of annotated transcriptions
of spoken Dutch. Special attention is paid to the decomposition of the
large data set into parts for common, uncommon and unknown words.
This does not only solve the space problems caused by the amount of
data, it also improves the tagging time. The performance of the resulting
tagger in terms of accuracy is 97.54 %, which is quite good, where the
speed of the tagger is reasonably good.
You might be experiencing some problems with Your Video player.
| Slides | |
| 0:00 | A Support Vector Machine Approach to Dutch Part of Speech Tagging |
| 0:33 | Outline |
| 1:20 | CGN: Corpus Spoken Dutch |
| 2:28 | Part of Speech Tags |
| 3:38 | Tag Set |
| 4:15 | Part-of-Speech Tagging |
| 4:47 | Goal & Challenge |
| 5:33 | Design of the SVM Tagger |
| 7:30 | Decomposing the SVM |
| 9:43 | Initial Committee of SVM’s |
| 10:36 | Training and Test Data |
| 11:15 | Initial Evaluation on Validation Set |
| 12:31 | Compound Analysis pt 1 |
| 13:34 | Compound Analysis pt 2 |
| 14:10 | The Final Committee of SVM’s |
| 14:49 | Overall Performance |
| 15:28 | More Detailed Performance Analysis |
| 16:09 | Conclusions pt 1 |
| 16:27 | Conclusions pt 2 |
| 20:49 | Design of the SVM Tagger (a) |
Lecture rating
| People found this lecture: | ||
| Worth seeing | ||
| because it is: | ||
| Valuable and informative | ||
| Well presented | ||
| Easily understandable | ||
| Acceptably recorded | ||
| You need to login to cast your vote. | ||
Report a problem or upload files
If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Related content
Visitors who watched this lecture also watched...
Link this page
Would you like to put a link to this lecture on your homepage?Go ahead! Copy the HTML snippet !





