Sciweavers

4234 search results - page 492 / 847
» A Method for Web Information Extraction
Sort
View
DIS
2007
Springer
16 years 27 days ago
Unsupervised Spam Detection Based on String Alienness Measures
We propose an unsupervised method for detecting spam documents from Web page data, based on equivalence relations on strings. We propose 3 measures for quantifying the alienness (...
Kazuyuki Narisawa, Hideo Bannai, Kohei Hatano, Mas...
CAISE
2004
Springer
16 years 4 days ago
A knowledge-based approach to ontology learning and semantic annotation
The so-called Semantic Web vision will certainly benefit from automatic semantic annotation of words in documents. We present a method, called structural semantic interconnections ...
Roberto Navigli, Paola Velardi
CANDT
2009
15 years 10 months ago
Measuring self-focus bias in community-maintained knowledge repositories
Self-focus is a novel way of understanding a type of bias in community-maintained Web 2.0 graph structures. It goes beyond previous measures of topical coverage bias by encapsulat...
Brent Hecht, Darren Gergle
156
Voted
DIS
2001
Springer
15 years 11 months ago
Eliminating Useless Parts in Semi-structured Documents Using Alternation Counts
We propose a preprocessing method for Web mining which, given semi-structured documents with the same structure and style, distinguishes useless parts and non-useless parts in each...
Daisuke Ikeda, Yasuhiro Yamada, Sachio Hirokawa
ACL
2011
14 years 10 months ago
Fine-Grained Class Label Markup of Search Queries
We develop a novel approach to the semantic analysis of short text segments and demonstrate its utility on a large corpus of Web search queries. Extracting meaning from short text...
Joseph Reisinger, Marius Pasca