Stemming can improve retrieval accuracy, but stemmers are language-specific. Character n-gram tokenization achieves many of the benefits of stemming in a language independent way,...
In image retrieval, global features related to color or texture are commonly used to describe the image. The use of interest points in contentbased image retrieval allows image ind...
This paper presents a summarization model based on the Universal Networking Language (UNL), which is a conceptual language for representing texts sentence by sentence, using seman...
Camilla Brandel Martins, Lucia Helena Machado Rino
The explosive growth in the biomedical literature has made it difficult for researchers to keep up with advancements, even in their own narrow specializations. In addition, this c...
Abstract: In this paper we describe a flexible, portable and languageindependent infrastructure for setting up large monolingual language corpora. The approach is based on collecti...
Christian Biemann, Stefan Bordag, Gerhard Heyer, U...