This paper presents a large-scale system for the recognition and semantic disambiguation of named entities based on information extracted from a large encyclopedic collection and ...
One of the main interests in the Web Information Retrieval research area is the identification of the user interests and needs so the search engines and tools can help the users t...
We examine the suitability of RDF, RDF Schema (as simple ontology language), and RDF repository Sesame, for providing the backend to a prospective domain-specific web search tool, ...
Abstract. Search engines often employ techniques for determining syntactic similarity of Web pages. Such a tool allows them to avoid returning multiple copies of essentially the sa...
We study methods to initialize or bias different clustering methods using prior information about the "importance" of a keyword w.r.t. the whole document collection or a...