Dealing with structured and unstructured data at Facebook

author: Lars Backstrom, Facebook
introducer: Marko Grobelnik, Artificial Intelligence Laboratory, Jožef Stefan Institute
published: July 7, 2011,   recorded: May 2011,   views: 1760
Categories

Slides

Related content

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Lecture popularity: You need to login to cast your vote.
  Bibliography

Description

Facebook has undergone tremendous growth in the last five years. Here we will start by looking at some basic statistics and trends that have accompanied this growth. We'll then dive into two different topics. First, we will look at a general trend to make data more structured at Facebook. Having more structured data makes it easier to manage, understand, and leverage it. I will briefly discuss the tools (Hive) that have been built to enable the massive-scale data analysis that goes on at Facebook on a daily basis. In the second part of the talk, I will dive into the details of one of the systems that has contributed to the growth of Facebook: People You May Know. This system generates a significant number of the friend connections on Facebook, and by using increasingly sophisticated machine learning techniques, we have been able to make large improvements to the ranking used by the system since its original launch.

See Also:

Download slides icon Download slides: eswc2011_backstrom_facebook_01.pdf (6.8 MB)


Help icon Streaming Video Help

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment:

make sure you have javascript enabled or clear this field: