A Joint Segmenting and Labeling Approach for Chinese Lexical Analysis

author: Xinhao Wang, National Laboratory On Machine Perception, Peking University
author: Jiazhong Nie, National Laboratory On Machine Perception, Peking University
author: Dingsheng Luo, National Laboratory On Machine Perception, Peking University
author: Xihong Wu, National Laboratory On Machine Perception, Peking University
published: Oct. 10, 2008,   recorded: September 2008,   views: 4013


Related Open Educational Resources

Related content

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Lecture popularity: You need to login to cast your vote.


This paper introduces an approach which jointly performs a cascade of segmentation and labeling subtasks for Chinese lexical analysis, including word segmentation, named entity recognition and part-of-speech tagging. Unlike the traditional pipeline manner, the cascaded subtasks are conducted in a single step simultaneously, therefore error propagation could be avoided and the information could be shared among multi-level subtasks. In this approach, Weighted Finite State Transducers (WFSTs) are adopted. Within the unified framework of WFSTs, the models for each subtask are represented and then combined into a single one. Thereby, through one-pass decoding the joint optimal outputs for multi-level processes will be reached. The experimental results show the effectiveness of the presented joint processing approach, which significantly outperforms the traditional method in pipeline style.

See Also:

Download slides icon Download slides: ecmlpkdd08_wang_ajsa_01.ppt (762.5┬áKB)

Help icon Streaming Video Help

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment:

make sure you have javascript enabled or clear this field: