Sciweavers

782 search results - page 60 / 157
» A measure theoretic approach to information retrieval
Sort
View
CLEF
2010
Springer
15 years 7 months ago
External and Intrinsic Plagiarism Detection Using a Cross-Lingual Retrieval and Segmentation System - Lab Report for PAN at CLEF
We present our hybrid system for the PAN challenge at CLEF 2010. Our system performs plagiarism detection for translated and non-translated externally as well as intrinsically plag...
Markus Muhr, Roman Kern, Mario Zechner, Michael Gr...
KDD
2002
ACM
148views Data Mining» more  KDD 2002»
16 years 6 months ago
Discovering informative content blocks from Web documents
In this paper, we propose a new approach to discover informative contents from a set of tabular documents (or Web pages) of a Web site. Our system, InfoDiscoverer, first partition...
Shian-Hua Lin, Jan-Ming Ho
SDM
2008
SIAM
135views Data Mining» more  SDM 2008»
15 years 7 months ago
A Spamicity Approach to Web Spam Detection
Web spam, which refers to any deliberate actions bringing to selected web pages an unjustifiable favorable relevance or importance, is one of the major obstacles for high quality ...
Bin Zhou 0002, Jian Pei, ZhaoHui Tang
ICC
2007
IEEE
16 years 16 days ago
Scheduling Feed Retrieval
— The popularity of RSS and similar feed formats is growing fast. This paper gives an overview of the standards and implementations in this field, and analyzes whether they allo...
Ward van Wanrooij, Aiko Pras
JCDL
2010
ACM
259views Education» more  JCDL 2010»
15 years 11 months ago
Exploiting time-based synonyms in searching document archives
Query expansion of named entities can be employed in order to increase the retrieval effectiveness. A peculiarity of named entities compared to other vocabulary terms is that they...
Nattiya Kanhabua, Kjetil Nørvåg