Sciweavers

2393 search results - page 171 / 479
» Information Retrieval from the Web: An Interactive Paradigm
Sort
View
WWW
2007
ACM
16 years 7 months ago
Detecting near-duplicates for web crawling
Near-duplicate web documents are abundant. Two such documents differ from each other in a very small portion that displays advertisements, for example. Such differences are irrele...
Gurmeet Singh Manku, Arvind Jain, Anish Das Sarma
WWW
2008
ACM
16 years 7 months ago
Genealogical trees on the web: a search engine user perspective
This paper presents an extensive study about the evolution of textual content on the Web, which shows how some new pages are created from scratch while others are created using al...
Ricardo A. Baeza-Yates, Álvaro R. Pereira J...
CIKM
2009
Springer
16 years 1 months ago
Vetting the links of the web
Many web links mislead human surfers and automated crawlers because they point to changed content, out-of-date information, or invalid URLs. It is a particular problem for large, ...
Na Dai, Brian D. Davison
CIKM
2009
Springer
15 years 4 months ago
Interactive relevance feedback with graded relevance and sentence extraction: simulated user experiments
Research on relevance feedback (RFB) in information retrieval (IR) has given mixed results. Success in RFB seems to depend on the searcher's willingness to provide feedback a...
Kalervo Järvelin
WWW
2001
ACM
16 years 7 months ago
Media Browser: An Example of Metadata-Based Browsing
Current methods for finding relevant content, especially in media-rich web environments, suggest that metadata is critical for accurate and efficient information retrieval. We des...
Alison Lennon, Daniel Lloyd-Jones, Ernest Wan