Sciweavers

1834 search results - page 142 / 367
» Web Mining in Search Engines
Sort
View
SIGIR
2004
ACM
16 years 5 hour ago
Query-related data extraction of hidden web documents
The larger amount of information on the Web is stored in document databases and is not indexed by general-purpose search engines (i.e., Google and Yahoo). Such information is dyna...
Yih-Ling Hedley, Muhammad Younas, Anne E. James, M...
JCDL
2011
ACM
225views Education» more  JCDL 2011»
14 years 9 months ago
How much of the web is archived?
The Memento Project’s archive access additions to HTTP have enabled development of new web archive access user interfaces. After experiencing this web time travel, the inevitabl...
Scott Ainsworth, Ahmed Alsum, Hany SalahEldeen, Mi...
SEMWEB
2009
Springer
16 years 1 months ago
Investigating the Semantic Gap through Query Log Analysis
Significant efforts have focused in the past years on bringing large amounts of metadata online and the success of these efforts can be seen by the impressive number of web site...
Peter Mika, Edgar Meij, Hugo Zaragoza
COLING
2010
15 years 1 months ago
A Method for Automatically Generating a Mediatory Summary to Verify Credibility of Information on the Web
In this paper, we propose a method for mediatory summarization, which is a novel technique for facilitating users' assessments of the credibility of information on the Web. A...
Hideyuki Shibuki, Takahiro Nagai, Masahiro Nakano,...
SIGMOD
2008
ACM
119views Database» more  SIGMOD 2008»
16 years 6 months ago
Webpage understanding: beyond page-level search
In this paper we introduce the webpage understanding problem which consists of three subtasks: webpage segmentation, webpage structure labeling, and webpage text segmentation and ...
Zaiqing Nie, Ji-Rong Wen, Wei-Ying Ma