Social-media blog tagging: Metadata or “just more content” ?
Description
The authoring of tags -- unlike the authoring of traditional metadata -- is highly
popular among users. This harbours unprecedented opportunities for organizing
content. However, tags are still poorly understood. What do they ''mean'',
in what senses are they similar to or different from metadata?
Different tags support different communities, but how exactly
do they reflect the plurality of opinions,what is the relation to individual
differences in authoring and reading? In this paper, we offer a definition and
empirical evidence for the claim that ''tags are not metadata, but just more content''.
The analysis rests on a multi-annotator classification of a blog corpus using
the WordNet domain labels system (WND), the development of a system
of text-classification methods using WordNet and WND, and a quantitative
and qualitative comparative analysis of these classifications. We argue
that the notion of a ''gold standard'' may be meaningless in social media,
and we outline possible consequences for labelling and search-engine development.
| Slides | |
| 0:00 | Social-media blog tagging: Metadata or “just more content” ? |
| 0:04 | Acknowledgements |
| 0:10 | Agenda |
| 0:16 | Agenda - Blogs and other social media |
| 0:22 | Blogs |
| 0:28 | Blogs and other social media („Web 2.0“) |
| 0:34 | Blogs and other social media ,and their activity focus |
| 0:41 | Blogs and other social media, and some of their origins in older media |
| 0:47 | Blogs: Publication bordering on communication |
| 0:53 | Blogs and other social media: Where tagging (= adding keywords) is most prominent |
| 0:59 | Agenda - What blog tags (should) say: “executive summary“ of this talk |
| 1:04 | Tags in blogs (I): What does a tag tell us in terms of what a blog is about? |
| 1:11 | Tags in blogs (II): However ... |
| 1:44 | Agenda - Tag functions |
| 3:36 | 1 million new tags every month |
| 5:36 | Tag functions (Golder & Huberman, 2006) in del.icio.us |
| 8:04 | Tag functions mirror standard metadata elements (here: Dublin Core) |
| 9:08 | Do we expect tags / metadata to add to content? |
| 10:57 | How do author tags relate to content? |
| 11:01 | Agenda - Empirical study (1) |
| 11:05 | Empirical study – Part I: Overview |
| 11:40 | Data |
| 13:40 | Classification taxonomy: WordNet Domains (Magnini & Cavaglia, 2000) –excerpt |
| 14:47 | Human annotations (“reader tags“) |
| 16:27 | Automated classification |
| 19:10 | WordNet: words -> senses |
| 19:34 | WordNet Domains as the classification hierarchy: senses -> domains |
| 21:06 | Study I: Results |
| 22:59 | Agenda - Empirical study (2) |
| 23:11 | Part II: Do automated methods err in the same way? What about automated-method – annotator suitability? |
| 24:03 | Similarity between methods |
| 26:02 | Combining automated methods |
| 28:01 | Tags provide additional information |
| 29:35 | A closer look at agreement |
| 30:18 | Tags don't give all the information – go on reading! |
| 31:26 | Interpretation requires knowledge of blogspace authoring conventions |
| 32:30 | Agenda - Some remarks on method |
| 32:45 | Searching for the basic level: Coarsening to hierarchy level 2 |
| 35:26 | News are easier for content classification: Comparison classification blog corpus – Reuters RCV1 |
| 38:15 | Summary |
| 40:08 | Outlook |
| 57:08 | Sources |
Lecture rating
| People found this lecture: | ||
| Worth seeing | ||
| because it is: | ||
| Valuable and informative | ||
| Well presented | ||
| Easily understandable | ||
| Acceptably recorded | ||
| You need to login to cast your vote. | ||
Report a problem or upload files
If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Related content
Link this page
Would you like to put a link to this lecture on your homepage?Go ahead! Copy the HTML snippet !




