Sciweavers

403 search results - page 23 / 81
» Testing the cluster hypothesis in distributed information re...
Sort
View
CCGRID
2006
IEEE
16 years 2 days ago
ReCon: A Fast and Reliable Replica Retrieval Service for the Data Grid
The Data Grid provides a scalable infrastructure for storage resources and data distribution management. It also supports a variety of scientific applications that require access...
XiaoLi Zhou, Eunsung Kim, Jai Wug Kim, Heon Young ...
SIGIR
2002
ACM
15 years 5 months ago
Document clustering with cluster refinement and model selection capabilities
In this paper, we propose a document clustering method that strives to achieve: (1) a high accuracy of document clustering, and (2) the capability of estimating the number of clus...
Xin Liu, Yihong Gong, Wei Xu, Shenghuo Zhu
WWW
2005
ACM
16 years 6 months ago
Clustering for probabilistic model estimation for CF
Based on the type of collaborative objects, a collaborative filtering (CF) system falls into one of two categories: item-based CF and user-based CF. Clustering is the basic idea i...
Qing Li, Byeong Man Kim, Sung-Hyon Myaeng
AUSDM
2006
Springer
112views Data Mining» more  AUSDM 2006»
15 years 9 months ago
Accuracy Estimation With Clustered Dataset
If the dataset available to machine learning results from cluster sampling (e.g. patients from a sample of hospital wards), the usual cross-validation error rate estimate can lead...
Ricco Rakotomalala, Jean-Hugues Chauchat, Fran&cce...
MM
2009
ACM
197views Multimedia» more  MM 2009»
16 years 15 days ago
Visual summaries of popular landmarks from community photo collections
We present a novel data-driven algorithm that leverages online image repositories such as Flickr for automatically generating tourist maps. Our hypothesis is that, given a large e...
Wei-Chao Chen, Agathe Battestini, Natasha Gelfand,...