Sciweavers

1161 search results - page 103 / 233
» Efficient search engine measurements
Sort
View
PVLDB
2008
124views more  PVLDB 2008»
15 years 5 months ago
Google's Deep Web crawl
The Deep Web, i.e., content hidden behind HTML forms, has long been acknowledged as a significant gap in search engine coverage. Since it represents a large portion of the structu...
Jayant Madhavan, David Ko, Lucja Kot, Vignesh Gana...
WWW
2009
ACM
16 years 7 months ago
Is there anything worth finding on the semantic web?
There has recently been an upsurge of interest in the possibilities of combining structured data and ad-hoc information retrieval from traditional hypertext. In this experiment, w...
Harry Halpin
WWW
2004
ACM
16 years 7 months ago
Lessons from a Gnutella-web gateway
We present a gateway between the WWW and the Gnutella peer-topeer network that permits searchers on one side to be able to search and retrieve files on the other side of the gatew...
Brian D. Davison, Wei Zhang, Baoning Wu
WWW
2007
ACM
16 years 7 months ago
Designing efficient sampling techniques to detect webpage updates
Due to resource constraints, Web archiving systems and search engines usually have difficulties keeping the entire local repository synchronized with the Web. We advance the state...
Qingzhao Tan, Ziming Zhuang, Prasenjit Mitra, C. L...
EDBT
2004
ACM
133views Database» more  EDBT 2004»
16 years 6 months ago
HOPI: An Efficient Connection Index for Complex XML Document Collections
In this paper we present HOPI, a new connection index for XML documents based on the concept of the 2?hop cover of a directed graph introduced by Cohen et al. In contrast to most o...
Ralf Schenkel, Anja Theobald, Gerhard Weikum