event thumbnail image
Solomonovi seminarji

Social-media blog tagging: Metadata or “just more content” ?

author: Bettina Berendt, Humboldt-Universität zu Berlin

Description

The authoring of tags -- unlike the authoring of traditional metadata -- is highly popular among users. This harbours unprecedented opportunities for organizing content. However, tags are still poorly understood. What do they ''mean'', in what senses are they similar to or different from metadata? Different tags support different communities, but how exactly do they reflect the plurality of opinions,what is the relation to individual differences in authoring and reading? In this paper, we offer a definition and empirical evidence for the claim that ''tags are not metadata, but just more content''. The analysis rests on a multi-annotator classification of a blog corpus using the WordNet domain labels system (WND), the development of a system of text-classification methods using WordNet and WND, and a quantitative and qualitative comparative analysis of these classifications. We argue that the notion of a ''gold standard'' may be meaningless in social media, and we outline possible consequences for labelling and search-engine development.

You might be experiencing some problems with Your Video player.
Slides
0:00 Social-media blog tagging: Metadata or “just more content” ?
0:04 Acknowledgements
0:10 Agenda
0:16 Agenda - Blogs and other social media
0:22 Blogs
0:28 Blogs and other social media („Web 2.0“)
0:34 Blogs and other social media ,and their activity focus
0:41 Blogs and other social media, and some of their origins in older media
0:47 Blogs: Publication bordering on communication
0:53 Blogs and other social media: Where tagging (= adding keywords) is most prominent
0:59 Agenda - What blog tags (should) say: “executive summary“ of this talk
1:04 Tags in blogs (I): What does a tag tell us in terms of what a blog is about?
1:11 Tags in blogs (II): However ...
1:44 Agenda - Tag functions
3:36 1 million new tags every month
5:36 Tag functions (Golder & Huberman, 2006) in del.icio.us
8:04 Tag functions mirror standard metadata elements (here: Dublin Core)
9:08 Do we expect tags / metadata to add to content?
10:57 How do author tags relate to content?
11:01 Agenda - Empirical study (1)
11:05 Empirical study – Part I: Overview
11:40 Data
13:40 Classification taxonomy: WordNet Domains (Magnini & Cavaglia, 2000) –excerpt
14:47 Human annotations (“reader tags“)
16:27 Automated classification
19:10 WordNet: words -> senses
19:34 WordNet Domains as the classification hierarchy: senses -> domains
21:06 Study I: Results
22:59 Agenda - Empirical study (2)
23:11 Part II: Do automated methods err in the same way? What about automated-method – annotator suitability?
24:03 Similarity between methods
26:02 Combining automated methods
28:01 Tags provide additional information
29:35 A closer look at agreement
30:18 Tags don't give all the information – go on reading!
31:26 Interpretation requires knowledge of blogspace authoring conventions
32:30 Agenda - Some remarks on method
32:45 Searching for the basic level: Coarsening to hierarchy level 2
35:26 News are easier for content classification: Comparison classification blog corpus – Reuters RCV1
38:15 Summary
40:08 Outlook
57:08 Sources

Lecture rating

People found this lecture:
Worth seeing
because it is:
 Valuable and informative
Well presented
Easily understandable
Acceptably recorded
You need to login to cast your vote.

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment: