Collection, storage and analysis of online teenage talk: assets and challenges

author: Reinhild Vandekerckhove, University of Antwerp
published: June 6, 2017,   recorded: May 2017,   views: 1085
released under terms of: Creative Commons Attribution (CC-BY)


Related Open Educational Resources

Related content

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Lecture popularity: You need to login to cast your vote.


I will address a range of issues based on 10 years of experience with sociolinguistic research on informal computer-mediated communication (CMC) produced by youngsters. Starting from the two main datasets we are currently working with (corpus 2007-2013 and corpus 2015-2016), I’ll discuss some challenges with respect to gathering data on the social profile of the informants and some ethical issues. Next, attention will be devoted to the consequences of the size and (often imbalanced) composition of CMC-corpora for the data processing. In order to illustrate the challenges of the genre I'll briefly deal with a specific methodological issue: whether or not to operationalize the occurrence of CMC-features as binary or ordinal variables. Finally, while large corpora generally trigger (and necessitate) quantitative data processing, I want to stress that supplementary qualitative research may be indispensable if we do not want to get alienated from CMC-pragmatics.

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment:

make sure you have javascript enabled or clear this field: