Sciweavers

935 search results - page 98 / 187
» Analyzing Document Retrievability in Patent Retrieval Settin...
Sort
View
DIS
2007
Springer
16 years 14 days ago
Unsupervised Spam Detection Based on String Alienness Measures
We propose an unsupervised method for detecting spam documents from Web page data, based on equivalence relations on strings. We propose 3 measures for quantifying the alienness (...
Kazuyuki Narisawa, Hideo Bannai, Kohei Hatano, Mas...
IPM
2008
102views more  IPM 2008»
15 years 6 months ago
Fast exact maximum likelihood estimation for mixture of language model
Language modeling is an effective and theoretically attractive probabilistic framework for text information retrieval. The basic idea of this approach is to estimate a language mo...
Yi Zhang 0001, Wei Xu
RSFDGRC
2011
Springer
255views Data Mining» more  RSFDGRC 2011»
14 years 9 months ago
Construction and Analysis of Web-Based Computer Science Information Networks
WINACS (Web-based Information Network Analysis for Computer Science) is a project that incorporates many recent, exciting developments in data sciences to construct a Web-based co...
Jiawei Han
WWW
2007
ACM
16 years 7 months ago
BlogScope: spatio-temporal analysis of the blogosphere
We present BlogScope (www.blogscope.net), a system for analyzing the Blogosphere. BlogScope is an information discovery and text analysis system that offers a set of unique featur...
Nilesh Bansal, Nick Koudas
WWW
2003
ACM
16 years 7 months ago
Using Top-Ranking Sentences for Web Search Result Presentation
In this poster we propose a granular approach for presenting web search results. Sentences, taken from the top documents, are used as fine-grained representations of document cont...
Ryen W. White, Joemon M. Jose, Ian Ruthven