Ranking for multilingual information retrieval (MLIR) is a task to rank documents of different languages solely based on their relevancy to the query regardless of query’s langu...
Abstract. Automatic plagiarism detection considering a reference corpus compares a suspicious text to a set of original documents in order to relate the plagiarised fragments to th...
Scanned document images are nowadays becoming available in increasingly higher resolutions. Meanwhile, the variations in image quality within typical document collections increase...
Iuliu Konya Konya, Christoph Seibert, Stefan Eicke...
A meta-search engine propagates user queries to its participant search engines following a server selection strategy. To facilitate server selection, the metasearch engine must ke...
We propose an approach to restore severely degraded
document images using a probabilistic context model. Un-
like traditional approaches that use previously learned
prior models...
Jyotirmoy Banerjee, Anoop M. Namboodiri, C. V. Jaw...