Sciweavers

KDD
2005
ACM
90views Data Mining» more  KDD 2005»
16 years 7 months ago
Variable latent semantic indexing
Anirban Dasgupta, Ravi Kumar, Prabhakar Raghavan, ...
KDD
2005
ACM
124views Data Mining» more  KDD 2005»
16 years 7 months ago
Scalable discovery of hidden emails from large folders
The popularity of email has triggered researchers to look for ways to help users better organize the enormous amount of information stored in their email folders. One challenge th...
Giuseppe Carenini, Raymond T. Ng, Xiaodong Zhou
KDD
2005
ACM
170views Data Mining» more  KDD 2005»
16 years 7 months ago
Parallel mining of closed sequential patterns
Discovery of sequential patterns is an essential data mining task with broad applications. Among several variations of sequential patterns, closed sequential pattern is the most u...
Shengnan Cong, Jiawei Han, David A. Padua
KDD
2005
ACM
141views Data Mining» more  KDD 2005»
16 years 7 months ago
Fast window correlations over uncooperative time series
Richard Cole, Dennis Shasha, Xiaojian Zhao
KDD
2005
ACM
142views Data Mining» more  KDD 2005»
16 years 7 months ago
Towards exploratory test instance specific algorithms for high dimensional classification
In an interactive classification application, a user may find it more valuable to develop a diagnostic decision support method which can reveal significant classification behavior...
Charu C. Aggarwal
KDD
2005
ACM
112views Data Mining» more  KDD 2005»
16 years 7 months ago
Model-based overlapping clustering
While the vast majority of clustering algorithms are partitional, many real world datasets have inherently overlapping clusters. Several approaches to finding overlapping clusters...
Arindam Banerjee, Chase Krumpelman, Joydeep Ghosh,...
KDD
2006
ACM
253views Data Mining» more  KDD 2006»
16 years 7 months ago
Adaptive Website Design Using Caching Algorithms
Visitors enter a website through a variety of means, including web searches, links from other sites, and personal bookmarks. In some cases the first page loaded satisfies the visi...
Justin Brickell, Inderjit S. Dhillon, Dharmendra S...
KDD
2006
ACM
185views Data Mining» more  KDD 2006»
16 years 7 months ago
Understanding Content Reuse on the Web: Static and Dynamic Analyses
Abstract. In this paper we present static and dynamic studies of duplicate and near-duplicate documents in the Web. The static and dynamic studies involve the analysis of similar c...
Ricardo A. Baeza-Yates, Álvaro R. Pereira J...
208
Voted
KDD
2006
ACM
200views Data Mining» more  KDD 2006»
16 years 7 months ago
A Random-Walk Based Scoring Algorithm Applied to Recommender Engines
Recommender systems are an emerging technology that helps consumers find interesting products and useful resources. A recommender system makes personalized product suggestions by e...
Augusto Pucci, Marco Gori, Marco Maggini
KDD
2006
ACM
185views Data Mining» more  KDD 2006»
16 years 7 months ago
How to Define Searching Sessions on Web Search Engines
We investigate three methods for defining a session on Web search engines. We examine 2,465,145 interactions from 534,507 Web searchers. We compare defining sessions using: 1) Int...
Bernard J. Jansen, Amanda Spink, Vinish Kathuria