Sciweavers

2190 search results - page 229 / 438
» Unweaving a web of documents
Sort
View
SIGIR
1998
ACM
15 years 10 months ago
Improved Algorithms for Topic Distillation in a Hyperlinked Environment
This paper addresses the problem of topic distillation on the World Wide Web, namely, given a typical user query to find quality documents related to the query topic. Connectivity...
Krishna Bharat, Monika Rauch Henzinger
DOCENG
2007
ACM
15 years 10 months ago
Structure and content analysis for html medical articles: a hidden markov model approach
We describe ongoing research on segmenting and labeling HTML medical journal articles. In contrast to existing approaches in which HTML tags usually serve as strong indicators, we...
Jie Zou, Daniel X. Le, George R. Thoma
DOCENG
2007
ACM
15 years 10 months ago
XML version detection
The problem of version detection is critical in many important application scenarios, including software clone identification, Web page ranking, plagiarism detection, and peer-to-...
Deise de Brum Saccol, Nina Edelweiss, Renata de Ma...
FLAIRS
2008
15 years 9 months ago
QueSTS: A Query Specific Text Summarization System
Effective extraction of query relevant information present within documents on the web is a nontrivial task. In this paper we present our system called QueSTS, which does the abov...
M. Sravanthi, C. Ravindranath Chowdary, P. Sreeniv...
ECIR
2006
Springer
15 years 8 months ago
Improving Quality of Search Results Clustering with Approximate Matrix Factorisations
Abstract. In this paper we show how approximate matrix factorisations can be used to organise document summaries returned by a search engine into meaningful thematic categories. We...
Stanislaw Osinski