Sciweavers

5575 search results - page 724 / 1115
» Information Extraction
Sort
View
CEAS
2006
Springer
15 years 10 months ago
Introducing the Webb Spam Corpus: Using Email Spam to Identify Web Spam Automatically
Just as email spam has negatively impacted the user messaging experience, the rise of Web spam is threatening to severely degrade the quality of information on the World Wide Web....
Steve Webb, James Caverlee, Calton Pu
DAS
2006
Springer
15 years 10 months ago
Document Logical Structure Analysis Based on Perceptive Cycles
This paper describes a Neural Network (NN) approach for logical document structure extraction. In this NN architecture, called Transparent Neural Network (TNN), the document struct...
Yves Rangoni, Abdel Belaïd
DAS
2006
Springer
15 years 10 months ago
Bangla/English Script Identification Based on Analysis of Connected Component Profiles
Script identification is required for a multilingual OCR system. In this paper, we present a novel and efficient technique for Bangla/English script identification with application...
Lijun Zhou, Yue Lu, Chew Lim Tan
EDBTW
2006
Springer
15 years 10 months ago
NaviMoz: Mining Navigational Patterns in Portal Catalogs
Abstract. Portal Catalogs is a popular means of searching for information on the Web. They provide querying and browsing capabilities on data organized in a hierarchy, on a categor...
Eleni G. Christodoulou, Theodore Dalamagas, Timos ...
201
Voted
AIIA
2001
Springer
15 years 10 months ago
A New Machine Learning Approach to Fingerprint Classification
We present new fingerprint classification algorithms based on two machine learning approaches: support vector machines (SVMs), and recursive neural networks (RNNs). RNNs are traine...
Yuan Yao, Gian Luca Marcialis, Massimiliano Pontil...