We have studied the automatic construction of a multilingual citation index by collecting Postscript and PDF files from the Internet. We propose a method to identify duplicate bibl...
Information retrieval systems can be partitioned into two main classes: large-scale systems that make use of an inverted index or some other auxiliary data structure, intended for...
In this paper, an effective content-based visual image retrieval system is presented. This system consists of two main components: visual content extraction and indexing, and quer...
In this paper we present and discuss the system we developed for the search task of the TRECVID 2002, and its evaluation in an interactive search task. To do this we will look at ...
Georgina Gaughan, Alan F. Smeaton, Cathal Gurrin, ...
The effects of out-of-vocabulary (OOV) items in spoken document retrieval (SDR) are investigated. Several sets of transcriptions were created for the TREC-8 SDR task using a speec...
Philip C. Woodland, Sue E. Johnson, P. Jourlin, Ka...