数据挖掘方面重要会议的最佳paper集合,后续将陆续分析一下内容:
主要有KDD、SIGMOD、VLDB、ICML、SIGIR
| KDD (Data Mining) |
||
| 2013 |
Simple and Deterministic Matrix Sketching |
Edo Liberty, Yahoo! Research |
| 2012 |
Searching and Mining Trillions of Time Series Subsequences under Dynamic Time Warping |
Thanawin Rakthanmanon, University of California Riverside; et al. |
| 2011 |
Leakage in Data Mining: Formulation, Detection, and Avoidance |
Shachar Kaufman, Tel-Aviv University; et al. |
| 2010 |
Large linear classification when data cannot fit in memory |
Hsiang-Fu Yu, National Taiwan University; et al. |
| Connecting the dots between news articles |
Dafna Shahaf & Carlos Guestrin, Carnegie Mellon University |
|
| 2009 |
Collaborative Filtering with Temporal Dynamics |
Yehuda Koren, Yahoo! Research |
| 2008 |
Fastanova: an efficient algorithm for genome-wide association study |
Xiang Zhang, University of North Carolina at Chapel Hill; et al. |
| 2007 |
Predictive discrete latent factor models for large scale dyadic data |
Deepak Agarwal & Srujana Merugu, Yahoo! Research |
| 2006 |
Training linear SVMs in linear time |
Thorsten Joachims, Cornell University |
| 2005 |
Graphs over time: densification laws, shrinking diameters and possible explanations |
Jure Leskovec, Carnegie Mellon University; et al. |
| 2004 |
A probabilistic framework for semi-supervised clustering |
Sugato Basu, University of Texas at Austin; et al. |
| 2003 |
Maximizing the spread of influence through a social network |
David Kempe, Cornell University; et al. |
| 2002 |
Pattern discovery in sequences under a Markov assumption |
Darya Chudova & Padhraic Smyth, University of California Irvine |
| 2001 |
Robust space transformations for distance-based operations |
Edwin M. Knorr, University of British Columbia; et al. |
| 2000 |
Hancock: a language for extracting signatures from data streams |
Corinna Cortes, AT&T Laboratories; et al. |
| 1999 |
MetaCost: a general method for making classifiers cost-sensitive |
Pedro Domingos, Universidade Técnica de Lisboa |
| 1998 |
Occam's Two Razors: The Sharp and the Blunt |
Pedro Domingos, Universidade Técnica de Lisboa |
| 1997 |
Analysis and Visualization of Classifier Performance: Comparison under Imprecise Class and Cost Di... |
Foster Provost & Tom Fawcett, NYNEX Science and Technology |
| SIGMOD (Databases) |
||||
| 2013 |
Massive Graph Triangulation |
Xiaocheng Hu, The Chinese University of Hong Kong; et al. |
||
| 2012 |
High-Performance Complex Event Processing over XML Streams |
Barzan Mozafari, Massachusetts Institute of Technology; et al. |
||
| 2011 |
Entangled Queries: Enabling Declarative Data-Driven Coordination |
Nitin Gupta, Cornell University; et al. |
||
| 2010 |
FAST: fast architecture sensitive tree search on modern CPUs and GPUs |
Changkyu Kim, Intel; et al. |
||
| 2009 |
Generating example data for dataflow programs |
Christopher Olston, Yahoo! Research; et al. |
||
| 2008 |
Serializable isolation for snapshot databases |
Michael J. Cahill, University of Sydney; et al. |
||
| Scalable Network Distance Browsing in Spatial Databases |
Hanan Samet, University of Maryland; et al. |
|||
| 2007 |
Compiling mappings to bridge applications and databases |
Sergey Melnik, Microsoft Research; et al. |
||
| Scalable Approximate Query Processing with the DBO Engine |
Christopher Jermaine, University of Florida; et al. |
|||
| 2006 |
To search or to crawl : towards a query optimizer for text-centric tasks |
Panagiotis G. Ipeirotis, New York University; et al. |
||
| 2004 |
Indexing spatio-temporal trajectories with Che -->
| |||