Influence and Correlation in Social Networks

author: Mohammad Mahdian, Yahoo! Research
published: Aug. 25, 2008,   recorded: July 2008,   views: 10466


Related Open Educational Resources

Related content

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Lecture popularity: You need to login to cast your vote.


In many online social systems, social ties between users play an important role in dictating users' behavior. One of the ways this can happen is through social influence, the phenomenon that the actions of a user can induce his/her friends to behave in a similar way. In systems where social influence exists, ideas, modes of behavior, or new technologies can diffuse through the network like an epidemic. Therefore, identifying and understanding social influence is of tremendous interest from both an analysis (e.g., predicting the future of the system) and a design (e.g., designing viral marketing strategies) point of view.

In this talk, I will give a general overview of models for diffusion in social network, and then discuss the problem of identifying social influence in the data. This is a difficult task in general, since there are many other factors such as homophily or unobserved confounding variables that can induce statistical correlation between the actions of friends in a social network. Thus, distinguishing influence from those other factors is essentially the problem of distinguishing correlation from causality, a notoriously hard problem. Despite this, I will show how in an environment where the time stamp of the actions are observable, we can design simple statistical tests that distinguish between models of social influence and those that replicate the aforementioned sources of social correlation. I will sketch the proof of a theoretical justification of one of the tests, and present simulation results on randomly generated data and real tagging data from Flickr. The results exhibit that while there is significant social correlation in tagging behavior on this system, this correlation cannot be attributed to social influence.

See Also:

Download slides icon Download slides: mlg08_mahdian_icsn_01.pdf (1.2 MB)

Download slides icon Download slides: mlg08_mahdian_icsn_01.ppt (1.8 MB)

Help icon Streaming Video Help

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Reviews and comments:

Comment1 RR, September 19, 2008 at 7:27 a.m.:

The video for this doesn't work.

Comment2 tayfun, May 25, 2009 at 9:35 a.m.:

About the obesity study talked in the presentation, is it really "having an obese friend increases chance of obesity" or "obese people befriend other obese people"? I believe it's important to study which causes which, because if you don't then you might arrive at the wrong results.


Comment3 maria lovely, January 14, 2021 at 6:40 a.m.:

very nice thanks for sharing

Comment4 alta, January 14, 2021 at 6:41 a.m.:

amaziing thanks for sharing such a nice post

Write your own review or comment:

make sure you have javascript enabled or clear this field: