Sciweavers

489 search results - page 51 / 98
» Effective techniques for automatic extraction of Web publica...
Sort
View
WWW
2006
ACM
16 years 6 months ago
Detecting spam web pages through content analysis
In this paper, we continue our investigations of "web spam": the injection of artificially-created pages into the web in order to influence the results from search engin...
Alexandros Ntoulas, Marc Najork, Mark Manasse, Den...
KDD
2012
ACM
212views Data Mining» more  KDD 2012»
13 years 8 months ago
Harnessing the wisdom of the crowds for accurate web page clipping
Clipping Web pages, namely extracting the informative clips (areas) from Web pages, has many applications, such as Web printing and e-reading on small handheld devices. Although m...
Lei Zhang, Linpeng Tang, Ping Luo, Enhong Chen, Li...
ICDE
2007
IEEE
173views Database» more  ICDE 2007»
16 years 7 months ago
Annotating Structured Data of the Deep Web
An increasing number of databases have become Web accessible through HTML form-based search interfaces. The data units returned from the underlying database are usually encoded in...
Yiyao Lu, Hai He, Hongkun Zhao, Weiyi Meng, Clemen...
IJCAI
2001
15 years 7 months ago
Keyword Spices: A New Method for Building Domain-Specific Web Search Engines
This paper presents a new method for building domain-specific web search engines. Previous methods eliminate irrelevant documents from the pages accessed using heuristics based on...
Satoshi Oyama, Takashi Kokubo, Toru Ishida, Teruhi...
AI
2007
Springer
16 years 11 days ago
Learning the Semantic Meaning of a Concept from the Web
Many researchers have used text classification method in solving the ontology mapping problem. Their mapping results heavily depend on the availability of quality exemplars used as...
Yang Yu, Yun Peng