Sciweavers

5107 search results - page 108 / 1022
» Data Mining and Information Retrieval
Sort
View
WSDM
2010
ACM
215views Data Mining» more  WSDM 2010»
16 years 3 months ago
Boilerplate Detection using Shallow Text Features
In addition to the actual content Web pages consist of navigational elements, templates, and advertisements. This boilerplate text typically is not related to the main content, ma...
Christian Kohlschütter, Peter Fankhauser, Wol...
WSDM
2009
ACM
125views Data Mining» more  WSDM 2009»
16 years 1 months ago
Less is more: sampling the neighborhood graph makes SALSA better and faster
In this paper, we attempt to improve the effectiveness and the efficiency of query-dependent link-based ranking algorithms such as HITS, MAX and SALSA. All these ranking algorith...
Marc Najork, Sreenivas Gollapudi, Rina Panigrahy
CIKM
2010
Springer
15 years 4 months ago
Visual cube and on-line analytical processing of images
On-Line Analytical Processing (OLAP) has shown great success in many industry applications, including sales, marketing, management, financial data analysis, etc. In this paper, w...
Xin Jin, Jiawei Han, Liangliang Cao, Jiebo Luo, Bo...
CHI
2003
ACM
16 years 6 months ago
The impact of automated assistance on the information retrieval process
Advanced information retrieval systems providing automated assistance offer the opportunity to greatly enhance the effectiveness of the information retrieval process. One issue in...
Bernard J. Jansen, George K. Kroner
KDD
2010
ACM
272views Data Mining» more  KDD 2010»
15 years 4 months ago
Scalable similarity search with optimized kernel hashing
Scalable similarity search is the core of many large scale learning or data mining applications. Recently, many research results demonstrate that one promising approach is creatin...
Junfeng He, Wei Liu, Shih-Fu Chang