Sciweavers

8298 search results - page 1298 / 1660
» Amharic-English Information Retrieval
Sort
View
INCDM
2010
Springer
125views Data Mining» more  INCDM 2010»
15 years 8 months ago
Web-Site Boundary Detection
Defining the boundaries of a web-site, for (say) archiving or information retrieval purposes, is an important but complicated task. In this paper a web-page clustering approach to...
Ayesh Alshukri, Frans Coenen, Michele Zito
KDD
2010
ACM
161views Data Mining» more  KDD 2010»
15 years 8 months ago
Mass estimation and its applications
This paper introduces mass estimation—a base modelling mechanism in data mining. It provides the theoretical basis of mass and an efficient method to estimate mass. We show that...
Kai Ming Ting, Guang-Tong Zhou, Fei Tony Liu, Jame...
DAS
2008
Springer
15 years 8 months ago
An End-to-End Administrative Document Analysis System
This paper presents an end-to-end administrative document analysis system. This system uses case-based reasoning in order to process documents from known and unknown classes. For ...
Hatem Hamza, Yolande Belaïd, Abdel Belaï...
DBISP2P
2008
Springer
124views Database» more  DBISP2P 2008»
15 years 8 months ago
Exploiting Distribution Skew for Scalable P2P Text Clustering
K-Means clustering is widely used in information retrieval and data mining. Distributed K-Means variants have already been proposed, but none of the past algorithms scales to large...
Odysseas Papapetrou, Wolf Siberski, Fabian Leitrit...
DOCENG
2008
ACM
15 years 8 months ago
Identifying and expanding titles in web texts
In this paper, we present an analysis based on linguistic and typographic features that allows for the identification of titles in web documents. We focus in particular on procedu...
Clémentine Adam, Estelle Delpech, Patrick S...
« Prev « First page 1298 / 1660 Last » Next »