Sciweavers

2082 search results - page 227 / 417
» Query by document
Sort
View
WWW
2007
ACM
16 years 7 months ago
Detecting near-duplicates for web crawling
Near-duplicate web documents are abundant. Two such documents differ from each other in a very small portion that displays advertisements, for example. Such differences are irrele...
Gurmeet Singh Manku, Arvind Jain, Anish Das Sarma
CIKM
2008
Springer
15 years 8 months ago
A system for finding biological entities that satisfy certain conditions from texts
Finding biological entities (such as genes or proteins) that satisfy certain conditions from texts is an important and challenging task in biomedical information retrieval and tex...
Wei Zhou, Clement T. Yu, Weiyi Meng
DEXA
2003
Springer
115views Database» more  DEXA 2003»
15 years 11 months ago
Storing and Querying XML Data in the Nested Relational Sequence Database System
Abstract. We developed the Nested Relational Sequence Database System (NRSD System), which is built upon the Nested Relational Sequence Model (NRSM). The NRSM eliminates a substant...
Ho Lam Lau, Wilfred Ng
JCDL
2009
ACM
103views Education» more  JCDL 2009»
15 years 11 months ago
Query parameters for harvesting digital video and associated contextual information
Video is increasingly important to digital libraries and archives as both primary content and as context for the primary objects in collections. Services like YouTube not only off...
Gary Marchionini, Chirag Shah, Christopher A. Lee,...
FQAS
2009
Springer
142views Database» more  FQAS 2009»
15 years 11 months ago
On the Selection of the Best Retrieval Result Per Query - An Alternative Approach to Data Fusion
Some recent works have shown that the “perfect” selection of the best IR system per query could lead to a significant improvement on the retrieval performance. Motivated by thi...
Antonio Juárez-González, Manuel Mont...