In order to produce, a good summary, one has to identify the most relevant portions of a given text. We describe in this t)at)er a method for automatically training tel)it, signat...
In this paper, we study the use of support vector machine in text categorization. Unlike other machine learning techniques, it allows easy incorporation of new documents into an e...
We describe a joint probabilistic model for modeling the contents and inter-connectivity of document collections such as sets of web pages or research paper archives. The model is...
: The Perfect benchmarks are a collection of scientific and engineering application-level programs that have been widely used to compare the performance of many different computer ...
We report on our approaches and methods for the ImageCLEF 2010 Wikipedia image retrieval task. A distinctive feature of this year's image collection is that images are associ...