Search Sciweavers | Sciweavers

5107 search results - page 334 / 1022

» Data Mining and Information Retrieval

139

click to vote

WWW
2008
ACM

109views Internet Technology» more WWW 2008»

Recrawl scheduling based on information longevity

16 years 7 months ago

Download www2008.org

It is crucial for a web crawler to distinguish between ephemeral and persistent content. Ephemeral content (e.g., quote of the day) is usually not worth crawling, because by the t...

Christopher Olston, Sandeep Pandey

claim paper

Read More »

181

click to vote

ICDM
2010
IEEE

164views Data Mining» more ICDM 2010»

Improved Consistent Sampling, Weighted Minhash and L1 Sketching

15 years 4 months ago

Download static.googleusercontent.com

Abstract--We propose a new Consistent Weighted Sampling method, where the probability of drawing identical samples for a pair of inputs is equal to their Jaccard similarity. Our me...

Sergey Ioffe

claim paper

Read More »

213

click to vote

WWW
2011
ACM

282views Internet Technology» more WWW 2011»

Improving recommendation for long-tail queries via templates

15 years 1 months ago

Download www.www2011india.com

The ability to aggregate huge volumes of queries over a large population of users allows search engines to build precise models for a variety of query-assistance features such as ...

Idan Szpektor, Aristides Gionis, Yoelle Maarek

claim paper

Read More »

153

click to vote

CIKM
2005
Springer

96views Information Technology» more CIKM 2005»

On the estimation of frequent itemsets for data streams: theory and experiments

16 years 8 days ago

Download www.lirmm.fr

In this paper, we devise a method for the estimation of the true support of itemsets on data streams, with the objective to maximize one chosen criterion among {precision, recall}...

Pierre-Alain Laur, Richard Nock, Jean-Emile Sympho...

claim paper

Read More »

164

click to vote

AICOM
2005

165views more AICOM 2005»

Integration of hospital data using agent technologies - A case study

15 years 6 months ago

Download cintesis.med.up.pt

Data retrieval and its integration is one of the major problems that face large and complex health organizations. This is especially relevant when patient information is produced i...

Ricardo João Cruz Correia, Pedro Manuel Vie...

claim paper

Read More »

« Prev « First page 334 / 1022 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers