Search Sciweavers | Sciweavers

2190 search results - page 316 / 438

» Unweaving a web of documents

221

click to vote

SIGIR
2008
ACM

176views Information Technology» more SIGIR 2008»

SpotSigs: robust and efficient near duplicate detection in large web collections

15 years 6 months ago

Download ilpubs.stanford.edu

Motivated by our work with political scientists who need to manually analyze large Web archives of news sites, we present SpotSigs, a new algorithm for extracting and matching sig...

Martin Theobald, Jonathan Siddharth, Andreas Paepc...

claim paper

Read More »

141

click to vote

WWW
2004
ACM

147views Internet Technology» more WWW 2004»

Building a companion website in the semantic web

16 years 7 months ago

Download www.iw3c2.org

A problem facing many textbook authors (including one of the authors of this paper) is the inevitable delay between new advances in the subject area and their incorporation in a n...

Timothy Miles-Board, Christopher Bailey, Wendy Hal...

claim paper

Read More »

147

click to vote

CIKM
2009
Springer

139views Information Technology» more CIKM 2009»

On the feasibility of multi-site web search engines

16 years 1 months ago

Download research.yahoo.com

Web search engines are often implemented as centralized systems. Designing and implementing a Web search engine in a distributed environment is a challenging engineering task that...

Ricardo A. Baeza-Yates, Aristides Gionis, Flavio J...

claim paper

Read More »

205

click to vote

SEMWEB
2009
Springer

229views Internet Technology» more SEMWEB 2009»

Populating the Semantic Web by Macro-reading Internet Text

16 years 1 months ago

Download rtw.ml.cmu.edu

A key question regarding the future of the semantic web is “how will we acquire structured information to populate the semantic web on a vast scale?” One approach is to enter t...

Tom M. Mitchell, Justin Betteridge, Andrew Carlson...

claim paper

Read More »

203

click to vote

WWW
2009
ACM

189views Internet Technology» more WWW 2009»

Extracting data records from the web using tag path clustering

15 years 11 months ago

Download www2009.org

Fully automatic methods that extract lists of objects from the Web have been studied extensively. Record extraction, the ﬁrst step of this object extraction process, identiﬁes...

Gengxin Miao, Jun'ichi Tatemura, Wang-Pin Hsiung, ...

claim paper

Read More »

« Prev « First page 316 / 438 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers