Search Sciweavers | Sciweavers

3152 search results - page 398 / 631

» Retrieval of Partial Documents

160

click to vote

WWW
2009
ACM

135views Internet Technology» more WWW 2009»

User-centric content freshness metrics for search engines

16 years 7 months ago

Download www2009.org

In order to return relevant search results, a search engine must keep its local repository synchronized to the Web, but it is usually impossible to attain perfect freshness. Hence...

Ali Dasdan, Xinh Huynh

claim paper

Read More »

156

click to vote

WWW
2007
ACM

162views Internet Technology» more WWW 2007»

Detecting near-duplicates for web crawling

16 years 7 months ago

Download infolab.stanford.edu

Near-duplicate web documents are abundant. Two such documents differ from each other in a very small portion that displays advertisements, for example. Such differences are irrele...

Gurmeet Singh Manku, Arvind Jain, Anish Das Sarma

claim paper

Read More »

188

click to vote

KDD
2002
ACM

186views Data Mining» more KDD 2002»

Topic-conditioned novelty detection

16 years 7 months ago

Download www.cs.cmu.edu

Automated detection of the first document reporting each new event in temporally-sequenced streams of documents is an open challenge. In this paper we propose a new approach which...

Yiming Yang, Jian Zhang, Jaime G. Carbonell, Chun ...

claim paper

Read More »

275

click to vote

SIGMOD
2008
ACM

122views Database» more SIGMOD 2008»

Building query optimizers for information extraction: the SQoUT project

16 years 6 months ago

Download www1.cs.columbia.edu

Text documents often embed data that is structured in nature. This structured data is increasingly exposed using information extraction systems, which generate structured relation...

Alpa Jain, Panagiotis G. Ipeirotis, Luis Gravano

claim paper

Read More »

182

click to vote

EDBT
2004
ACM

172views Database» more EDBT 2004»

Content-Based Routing of Path Queries in Peer-to-Peer Systems

16 years 6 months ago

Download www-db.deis.unibo.it

Peer-to-peer (P2P) systems are gaining increasing popularity as a scalable means to share data among a large number of autonomous nodes. In this paper, we consider the case in whic...

Georgia Koloniari, Evaggelia Pitoura

claim paper

Read More »

« Prev « First page 398 / 631 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers