Identifying Temporal Patterns and Key Players in Document Collections
Description
We consider the problem of analyzing the development of a document collection over time without requiring meaningful citation data. Given a collection of timestamped documents, we formulate and explore the following two questions. First, what are the main topics and how do these topics develop over time? Second, to gain insight into the dynamics driving this development, what are the documents and who are the authors that are most influential in this process? Unlike prior work in citation analysis, we propose methods addressing these questions without requiring the availability of citation data. The methods use only the text of the documents as input. Consequentially, they are applicable to a much wider range of document collections (email, blogs, etc.), most of which lack meaningful citation data. We evaluate our methods on the proceedings of the Neural Information Processing Systems (NIPS) conference. Even with the preliminary methods that we implemented, the results show that the methods are effective and that addressing the questions based on the text alone is feasible. In fact, the text-based methods sometimes even identify influential papers that are missed by citation analysis.
| Slides | |
| 0:01 | Identifying Temporal Patterns and Key Players in Document Collections |
| 0:55 | Introduction and Goals |
| 1:11 | Key Contribution |
| 1:37 | Temporal Cluster Histograms: Goals |
| 2:03 | Temporal Cluster Histograms: Method |
| 2:45 | Temporal Cluster Histograms: Results |
| 4:02 | Related Work (Topics) |
| 4:25 | Influential Documents: Goals |
| 4:52 | Related Work (Influence) |
| 5:10 | Influential Documents: Method |
| 6:36 | Influential Documents: Results |
| 8:50 | Influential Authors: Goals |
| 9:03 | Influential Authors: Method |
| 9:36 | Influential Authors: Results |
| 10:45 | Conclusions |
| 11:15 | Future Directions |
Lecture rating
| People found this lecture: | ||
| Worth seeing | ||
| because it is: | ||
| Valuable and informative | ||
| Well presented | ||
| Easily understandable | ||
| Acceptably recorded | ||
| You need to login to cast your vote. | ||
Report a problem or upload files
If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Link this page
Would you like to put a link to this lecture on your homepage?Go ahead! Copy the HTML snippet !





