Sciweavers

1834 search results - page 67 / 367
» Web Mining in Search Engines
Sort
View
KDD
2007
ACM
192views Data Mining» more  KDD 2007»
16 years 6 months ago
Active exploration for learning rankings from clickthrough data
We address the task of learning rankings of documents from search engine logs of user behavior. Previous work on this problem has relied on passively collected clickthrough data. ...
Filip Radlinski, Thorsten Joachims
WWW
2006
ACM
16 years 10 days ago
Do not crawl in the DUST: different URLs with similar text
We consider the problem of dust: Different URLs with Similar Text. Such duplicate URLs are prevalent in web sites, as web server software often uses aliases and redirections, and...
Uri Schonfeld, Ziv Bar-Yossef, Idit Keidar
DBISP2P
2005
Springer
183views Database» more  DBISP2P 2005»
15 years 12 months ago
Database Selection and Result Merging in P2P Web Search
Intelligent Web search engines are extremely popular now. Currently, only commercial centralized search engines like Google can process terabytes of Web data. Alternative search en...
Sergey Chernov, Pavel Serdyukov, Matthias Bender, ...
SAINT
2002
IEEE
15 years 11 months ago
On Updating in Very Short Time by Distributed Search Engines
Almost conventional search engines employ centralized architecture. However, such an engine is not suitable for fresh information retrieval because it spends a long time to collec...
Nobuyoshi Sato, Minoru Uehara, Yoshifumi Sakai, Hi...
WWW
2008
ACM
16 years 7 months ago
iRobot: an intelligent crawler for web forums
We study in this paper the Web forum crawling problem, which is a very fundamental step in many Web applications, such as search engine and Web data mining. As a typical user-crea...
Rui Cai, Jiang-Ming Yang, Wei Lai, Yida Wang, Lei ...