Sciweavers

3699 search results - page 430 / 740
» Clustering with Qualitative Information
Sort
View
SIGIR
2010
ACM
15 years 10 months ago
Where to start filtering redundancy?: a cluster-based approach
Novelty detection is a difficult task, particularly at sentence level. Most of the approaches proposed in the past consist of re-ordering all sentences following their novelty sco...
Ronald T. Fernández, Javier Parapar, David ...
CIKM
2006
Springer
15 years 10 months ago
A fast and robust method for web page template detection and removal
The widespread use of templates on the Web is considered harmful for two main reasons. Not only do they compromise the relevance judgment of many web IR and web mining methods suc...
Karane Vieira, Altigran Soares da Silva, Nick Pint...
ECIR
2004
Springer
15 years 8 months ago
Performance Analysis of Distributed Architectures to Index One Terabyte of Text
We simulate different architectures of a distributed Information Retrieval system on a very large Web collection, in order to work out the optimal setting for a particular set of r...
Fidel Cacheda, Vassilis Plachouras, Iadh Ounis
DASFAA
2009
IEEE
118views Database» more  DASFAA 2009»
15 years 7 months ago
Detecting Aggregate Incongruities in XML
The problem of identifying deviating patterns in XML repositories has important applications in data cleaning, fraud detection, and stock market analysis. Current methods determine...
Wynne Hsu, Qiangfeng Peter Lau, Mong-Li Lee
ICIP
2006
IEEE
16 years 8 months ago
Acoustic Range Image Segmentation by Effective Mean Shift
Image perception in underwater environment is a difficult task for a human operator, and data segmentation becomes a crucial step toward an higher level interpretation and recogni...
Umberto Castellani, Marco Cristani, Vittorio Murin...