Sciweavers

12766 search results - page 413 / 2554
» collective
Sort
View
WWW
2001
ACM
16 years 7 months ago
Building a distributed full-text index for the Web
We identify crucial design issues in building a distributed inverted index for a large collection of web pages. We introduce a novel pipelining technique for structuring the core ...
Sergey Melnik, Sriram Raghavan, Beverly Yang, Hect...
196
Voted
KDD
2007
ACM
143views Data Mining» more  KDD 2007»
16 years 7 months ago
Mining Research Communities in Bibliographical Data
Abstract. Extracting information from very large collections of structured, semistructured or even unstructured data can be a considerable challenge when much of the hidden informa...
Osmar R. Zaïane, Jiyang Chen, Randy Goebel
KDD
2006
ACM
143views Data Mining» more  KDD 2006»
16 years 7 months ago
Mining long-term search history to improve search accuracy
Long-term search history contains rich information about a user's search preferences. In this paper, we study statistical language modeling based methods to mine contextual i...
Bin Tan, Xuehua Shen, ChengXiang Zhai
OSDI
2008
ACM
16 years 7 months ago
Hunting for Problems with Artemis
Artemis is a modular application designed for analyzing and troubleshooting the performance of large clusters running datacenter services. Artemis is composed of four modules: (1)...
Gabriela F. Cretu-Ciocarlie, Mihai Budiu, Mois&eac...
258
Voted
SIGMOD
2008
ACM
115views Database» more  SIGMOD 2008»
16 years 7 months ago
Query answering techniques on uncertain and probabilistic data: tutorial summary
Uncertain data are inherent in some important applications, such as environmental surveillance, market analysis, and quantitative economics research. Due to the importance of thos...
Jian Pei, Ming Hua, Yufei Tao, Xuemin Lin