Most search systems for querying large document collections---for example, web search engines---are based on well-understood information retrieval principles
The paper presents a conceptual solution and an implementation for acquiring, modeling, publishing, retrieving, reusing and maintaining knowledge within XWiki, an open source coll...
Web track results are presented. A software project, IRTools, is described. IRTools is intended to enable information retrieval (IR) experimentation by incorporating methods for m...
Semantic similarity measures play important roles in information retrieval and Natural Language Processing. Previous work in semantic web-related applications such as community mi...
Web spam is behavior that attempts to deceive search engine ranking algorithms. TrustRank is a recent algorithm that can combat web spam. However, TrustRank is vulnerable in the s...