This paper addresses the problem of identifying collection dependent stop-words in order to reduce the size of inverted files. We present four methods to automatically recognise s...
Annotation of digitized pages from historical document collections is very important to research on automatic extraction of text blocks, lines, and handwriting recognition. We hav...
preferred abstracting and indexing databases to full text. Librarians and information professionals want the choice to be able to purchase subject orientated packages of electronic...
: In this paper we present the Photo Pyramid, a device with a graspable interface to retrieve and navigate through digital photo collections. The user selects a set of photos by at...
Nishchal Deshpande, A. Panas, A. Bondaryeva, N. Ki...
This paper describes experiments in the automatic construction of lexicons that would be useful in searching large document collections for text fragments that address a specific ...