Abstract. Geographic place names are semantically often highly ambiguous. For example, there are 491 places in Finland sharing the same name "Isosaari" (great island) tha...
Tomi Kauppinen, Riikka Henriksson, Reetta Sinkkil&...
The problem of dividing a sequence of values into segments occurs in database systems, information retrieval, and knowledge management. The challenge is to select a finite number ...
This paper compares several indexing methods for person names extracted from text, developed for an information retrieval system with requirements for fast approximate matching of...
Identification of transliterations is aimed at enriching multilingual lexicons and improving performance in various Natural Language Processing (NLP) applications including Cross ...
Clustering algorithms play an important role in data analysis and information retrieval. How to obtain a clustering for a large set of highdimensional data suitable for database ap...