Sciweavers

1443 search results - page 153 / 289
» On the Instability of Web Search Engines
Sort
View
SIGIR
2006
ACM
16 years 9 days ago
Finding near-duplicate web pages: a large-scale evaluation of algorithms
Broder et al.’s [3] shingling algorithm and Charikar’s [4] random projection based approach are considered “state-of-theart” algorithms for finding near-duplicate web pag...
Monika Rauch Henzinger
ACL
2010
15 years 4 months ago
Speech-Driven Access to the Deep Web on Mobile Devices
The Deep Web is the collection of information repositories that are not indexed by search engines. These repositories are typically accessible through web forms and contain dynami...
Taniya Mishra, Srinivas Bangalore
DC
2001
15 years 7 months ago
Metadata Interoperability and Meta-search on the Web
Several initiatives for establishing standards for metadata models are being carried out at the moment, but everyone focuses on their own requirements when defining metadata attri...
Enric Peig, Jaime Delgado, Ismael Pérez
WWW
2010
ACM
16 years 1 months ago
New-web search with microblog annotations
Web search engines discover indexable documents by recursively ‘crawling’ from a seed URL. Their rankings take into account link popularity. While this works well, it introduc...
Tom Rowlands, David Hawking, Ramesh Sankaranarayan...
SIGIR
2010
ACM
15 years 10 months ago
The importance of anchor text for ad hoc search revisited
It is generally believed that propagated anchor text is very important for effective Web search as offered by the commercial search engines. “Google Bombs” are a notable illus...
Marijn Koolen, Jaap Kamps