Linguistic annotation of social media corpora: To what extent do we have to adapt existing encoding standards and tag sets? thumbnail
slide-image
Pause
Mute
Subtitles not available
Playback speed
0.25
0.5
0.75
1
1.25
1.5
1.75
2
Full screen

Linguistic annotation of social media corpora: To what extent do we have to adapt existing encoding standards and tag sets?

Published on Dec 21, 20151962 Views

Related categories

Chapter list

Linguistic annotation of CMC and social media corpora00:00
CMC and corpus linguistics04:31
Layers of describing data in corpora - 107:15
Layers of describing data in corpora - 208:39
Layers of describing data in corpora - 308:47
CMC ‘macro-’and ‘microstructures’ - 109:49
CMC ‘macro-’and ‘microstructures’ - 211:01
CMC ‘macro-’and ‘microstructures’ - 311:43
CMC ‘macro-’and ‘microstructures’ - 412:16
ChatCorpus2CLARIN: Project background - 113:02
The corpus14:14
Other corpora / data sets in the project focus15:43
ChatCorpus2CLARIN: Project background - 217:15
ChatCorpus2CLARIN: Project background - 317:39
Ways to handle the lack of standards for CMC corpora - 118:26
Ways to handle the lack of standards for CMC corpora - 221:49
Representation of structural information on the macro and micro level of CMC genres - 123:27
Representation of structural information on the macro and micro level of CMC genres - 225:05
Representation of structural information on the macro and micro level of CMC genres - 326:44
Post29:20
Modeling thread and logfile structures34:36
Schema drafts of the TEI-SIG on CMC http://wiki.36:17
TEI Special Interest Group (SIG) on CMC37:16
Part-of-speech annotations for the microlevel of CMC posts (using NLP tools & tag sets)38:17
The problem - 141:30
The problem - 242:58
Designing a basic PoS tag set for German CMC43:22
STTS 2.0: A basic PoS tag set for German CMC - 144:37
STTS 2.0: A basic PoS tag set for German CMC - 246:19
STTS 2.0: A basic PoS tag set for German CMC - 346:45
STTS 2.0: A basic PoS tag set for German CMC - 446:51
STTS 2.0: A basic PoS tag set for German CMC - 547:17
Tag set and annotation guidelines @EmpiriST201548:56
PoS annotation of the CLARIN-D project: workflow49:55
Manual post-processing of PoS tagging results with OrthoNormal50:58
Vision (1): The CLARIN-D chat corpus as a showcase51:37
Vision (2): Building a “community of best pactices”52:45
To what extent do we have to adapt existing encoding standards and tag sets?54:25