Cost-effective Outbreak Detection in Networks
Description
Which blogs should we read to avoid missing important information? Where should we place sensors in a water distribution network to quickly detect contaminants? These seemingly different problems share common structure: Outbreak detection can be modeled as a problem of selecting nodes (blogs, sensor locations, ...) in a network, in order to detect the spreading of a virus or information as quickly as possible. We present a general methodology for near optimal sensor placement in these and related problems. We demonstrate that many realistic outbreak detection objectives (e.g., detection likelihood, population affected) exhibit the property of ``submodularity''. We exploit submodularity to develop an efficient algorithm that scales to large problems, provably achieving near optimal placements, while being 700 times faster than a simple greedy algorithm. We evaluate our approach on several large real-world problems, including a model of a water distribution network, and real blog data. We also show how the approach leads to deeper insights in both applications, answering multicriteria trade-off, cost-sensitivity and generalization questions. Joint work with: Andreas Krause, Carlos Guestrin, Christos Faloutsos, Jeanne VanBriesen and Natalie Glance Recepient of best student paper award at ACM SIGKDD '07 conference.
| Slides | |
| 0:00 | Cost-effective Outbreak Detection in Networks |
| 0:42 | Diffusion in Social Networks - Page 1 |
| 1:04 | Diffusion in Social Networks - Page 2 |
| 1:40 | Empirical Studies of Diffusion |
| 2:34 | Diffusion in Networks |
| 3:14 | Scenario 1: Water Network |
| 3:45 | Scenario 2: Online media |
| 4:16 | Cascade Detection: General Problem |
| 5:18 | Two Parts to the Problem |
| 6:46 | Problem Setting |
| 7:37 | Structure of the Problem |
| 8:31 | Analysis |
| 9:21 | An Approximation Result |
| 11:08 | Reward functions: Submodularity |
| 12:18 | Reward Functions are Submodular |
| 13:01 | Background: Submodular functions |
| 14:43 | Towards a New Algorithm |
| 15:51 | Benefit‐Cost: More Problems |
| 16:55 | Solution: CELF Algorithm |
| 18:00 | How good is the solution? |
| 18:47 | Scaling up CELF algorithm |
| 19:34 | Scaling up CELF - page 1 |
| 20:07 | Scaling up CELF - page 2 |
| 21:59 | Experiments: 2 Case Studies |
| 22:51 | Case study 1: Cascades in Blogs |
| 23:27 | Diffusion in Blogs |
| 25:37 | Q1: Blogs: Solution Quality |
| 27:04 | Q2: Blogs: Cost of a Blog - part 1 |
| 27:49 | Q2: Blogs: Cost of a Blog - part 2 |
| 29:10 | Q4: Blogs: Heuristic Selection |
| 31:03 | Blogs: Generalization to Future |
| 34:01 | Q5: Blogs: Scalability |
| 35:16 | Case study 2: Water Network |
| 37:01 | Water: Solution Quality |
| 37:22 | Water: Heuristic Placement |
| 38:46 | Water: Placement Visualization |
| 39:41 | Water: Algorithm Scalability |
| 40:05 | Results of BWSN competition |
| 41:27 | Other results |
| 42:56 | Conclusion |
| 43:30 | Conclusion and Connections |
| 43:55 | Topic Diffusion: what blogs to read? |
| 45:29 | Diffusion in Blogs |
| 47:18 | Water: Placement Visualization |
| 48:08 | Further Connections |
| 48:45 | References |
Lecture rating
| People found this lecture: | ||
| Worth seeing | ||
| because it is: | ||
| Valuable and informative | ||
| Well presented | ||
| Easily understandable | ||
| Acceptably recorded | ||
| You need to login to cast your vote. | ||
Report a problem or upload files
If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Related content
SEE ALSO:
Link this page
Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !





