The retrieval of similar documents from large scale datasets has been the one of the main concerns in knowledge management environments, such as plagiarism detection, news impact a...
Felipe Bravo-Marquez, Gaston L'Huillier, Sebasti&a...
We present VideoPot, a desktop video search system designed for digital video files stored on personal computers. The core metric of this system is based on video-indexing techno...
Hidenobu Nagata, Dan Mikami, Shozo Azuma, Masashi ...
The lack of a large scale Chinese test collection is an obstacle to the Chinese information retrieval development. In order to address this issue, we built such a collection compos...
In this paper, we present a new method for learning to finding translations and transliterations on the Web for a given term. The approach involves using a small set of terms and ...
Joseph Z. Chang, Jason S. Chang, Jyh-Shing Roger J...
Web-page classification is much more difficult than pure-text classification due to a large variety of noisy information embedded in Web pages. In this paper, we propose a new Web...