Sciweavers

2190 search results - page 268 / 438
» Unweaving a web of documents
Sort
View
CIKM
2003
Springer
15 years 12 months ago
Using titles and category names from editor-driven taxonomies for automatic evaluation
Evaluation of IR systems has always been difficult because of the need for manually assessed relevance judgments. The advent of large editor-driven taxonomies on the web opens the...
Steven M. Beitzel, Eric C. Jensen, Abdur Chowdhury...
WWW
2004
ACM
16 years 7 months ago
Time-based contextualized-news browser (t-cnb)
We propose a new way of browsing contextualized-news articles. Our prototype browser system is called a Time-based ContextualizedNews Browser (T-CNB). The T-CNB concurrently and a...
Akiyo Nadamoto, Katsumi Tanaka
WWW
2010
ACM
16 years 1 months ago
CETR: content extraction via tag ratios
We present Content Extraction via Tag Ratios (CETR) – a method to extract content text from diverse webpages by using the HTML document’s tag ratios. We describe how to comput...
Tim Weninger, William H. Hsu, Jiawei Han
ADAPTIVE
2007
Springer
16 years 26 days ago
Adaptive Focused Crawling
The large amount of available information on the Web makes it hard for users to locate resources about particular topics of interest. Traditional search tools, e.g., search engines...
Alessandro Micarelli, Fabio Gasparetti
EXPDB
2006
ACM
16 years 19 days ago
A Reproducible Benchmark for P2P Retrieval
With the growing popularity of information retrieval (IR) in distributed systems and in particular P2P Web search, a huge number of protocols and prototypes have been introduced i...
Thomas Neumann, Matthias Bender, Sebastian Michel,...