Sciweavers

924 search results - page 57 / 185
» Measuring Information Understanding in Large Document Collec...
Sort
View
ECIR
2008
Springer
15 years 7 months ago
Clustering Template Based Web Documents
More and more documents on the World Wide Web are based on templates. On a technical level this causes those documents to have a quite similar source code and DOM tree structure. G...
Thomas Gottron
DIAL
2004
IEEE
170views Image Analysis» more  DIAL 2004»
15 years 10 months ago
A General System for the Retrieval of Document Images from Digital Libraries
Large collections of scanned documents (books and journals) are now available in Digital Libraries. The most common method for retrieving relevant information from these collectio...
Simone Marinai, Emanuele Marino, Francesca Cesarin...
ECIR
2011
Springer
14 years 9 months ago
Dynamic Two-Stage Image Retrieval from Large Multimodal Databases
Abstract. Content-based image retrieval (CBIR) with global features is notoriously noisy, especially for image queries with low percentages of relevant images in a collection. More...
Avi Arampatzis, Konstantinos Zagoris, Savvas A. Ch...
SIGIR
2009
ACM
16 years 24 days ago
Building enriched document representations using aggregated anchor text
It is well known that anchor text plays a critical role in a variety of search tasks performed over hypertextual domains, including enterprise search, wiki search, and web search....
Donald Metzler, Jasmine Novak, Hang Cui, Srihari R...
TKDE
1998
142views more  TKDE 1998»
15 years 6 months ago
Performance Analysis of Three Text-Join Algorithms
—When a multidatabase system contains textual database systems (i.e., information retrieval systems), queries against the global schema of the multidatabase system may contain a ...
Weiyi Meng, Clement T. Yu, Wei Wang 0010, Naphtali...