The indexation of documents is a critical step of the information retrieval process and is often a manual task which highly depends on the indexer’s knowledge. We propose to imp...
High availability of information in medium and large-sized enterprises may naturally benefit from a wide-area replication of data. We discuss recent developments of replication arc...
We analyze the persistence of information on the web, looking at the percentage of invalid URLs contained in academic articles within the CiteSeer (ResearchIndex) database. The nu...
Steve Lawrence, Frans Coetzee, Gary William Flake,...
We employ Automorphology, an MDL-based algorithm that determines the suffixes present in a language-sample with no prior knowledge of the language in question, and describe our exp...
John A. Goldsmith, Derrick Higgins, Svetlana Sogla...
Abstract. A Conceptual Information System consists of a database together with conceptual hierarchies. The management system TOSCANA visualizes arbitrary combinations of conceptual...