SlovParl 2.0: The Collection of Slovene Parliamentary Debates from the Period of Secession

author: Tomaž Erjavec, Odsek za tehnologije znanja, Institut "Jožef Stefan"
published: May 30, 2018,   recorded: May 2018,   views: 7
released under terms of: Creative Commons Attribution (CC-BY)
Categories

Slides

Related content

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Lecture popularity: You need to login to cast your vote.
  Bibliography

Description

The paper describes the process of acquisition, up-translation, encoding, and annotation of the collection of the parliamentary debates from the Assembly of the Republic of Slovenia from 1990-1992, covering the period before, during, and after Slovenia became an independent country in 1991. The entire collection, comprising 232 sessions, 58,813 speeches and 10.8 million words was uniformly encoded in accordance with the Text Encoding Initiative (TEI) Guidelines, using the TEI module for drama texts. The corpus contains extensive meta-data about the speakers, a typology of sessions etc. and structural and editorial annotations. The corpus was also converted to use the spoken corpus module of TEI, and from this encoding automatically part-of-speech tagged and lemmatised. The corpus is maintained on GitHub and its major versions archived in the CLARIN.SI repository and available for analysis under its KonText and noSketchEngine concordancers, offering an invaluable resource for historians studying this watershed period of Slovenian history.

See Also:

Download slides icon Download slides: parlaCLARIN2018_erjavec_slov_parl_01.pdf (931.2 KB)


Help icon Streaming Video Help

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment:

make sure you have javascript enabled or clear this field: