video thumbnail
Pause
Mute
Subtitles not available
Playback speed
0.25
0.5
0.75
1
1.25
1.5
1.75
2
Full screen

Linguistic annotation of social media corpora: To what extent do we have to adapt existing encoding standards and tag sets?

Published on 2015-12-211974 Views

Related categories

Presentation

Linguistic annotation of CMC and social media corpora00:00
CMC and corpus linguistics04:31
Layers of describing data in corpora - 107:15
Layers of describing data in corpora - 208:39
Layers of describing data in corpora - 308:47
CMC ‘macro-’and ‘microstructures’ - 109:49
CMC ‘macro-’and ‘microstructures’ - 211:01
CMC ‘macro-’and ‘microstructures’ - 311:43
CMC ‘macro-’and ‘microstructures’ - 412:16
ChatCorpus2CLARIN: Project background - 113:02
The corpus14:14
Other corpora / data sets in the project focus15:43
ChatCorpus2CLARIN: Project background - 217:15
ChatCorpus2CLARIN: Project background - 317:39
Ways to handle the lack of standards for CMC corpora - 118:26
Ways to handle the lack of standards for CMC corpora - 221:49
Representation of structural information on the macro and micro level of CMC genres - 123:27
Representation of structural information on the macro and micro level of CMC genres - 225:05
Representation of structural information on the macro and micro level of CMC genres - 326:44
Post29:20
Modeling thread and logfile structures34:36
Schema drafts of the TEI-SIG on CMC http://wiki.36:17
TEI Special Interest Group (SIG) on CMC37:16
Part-of-speech annotations for the microlevel of CMC posts (using NLP tools & tag sets)38:17
The problem - 141:30
The problem - 242:58
Designing a basic PoS tag set for German CMC43:22
STTS 2.0: A basic PoS tag set for German CMC - 144:37
STTS 2.0: A basic PoS tag set for German CMC - 246:19
STTS 2.0: A basic PoS tag set for German CMC - 346:45
STTS 2.0: A basic PoS tag set for German CMC - 446:51
STTS 2.0: A basic PoS tag set for German CMC - 547:17
Tag set and annotation guidelines @EmpiriST201548:56
PoS annotation of the CLARIN-D project: workflow49:55
Manual post-processing of PoS tagging results with OrthoNormal50:58
Vision (1): The CLARIN-D chat corpus as a showcase51:37
Vision (2): Building a “community of best pactices”52:45
To what extent do we have to adapt existing encoding standards and tag sets?54:25