Sciweavers

2513 search results - page 207 / 503
» Improving Generalization by Data Categorization
Sort
View
CIKM
2011
Springer
14 years 6 months ago
Semi-supervised multi-task learning of structured prediction models for web information extraction
Extracting information from web pages is an important problem; it has several applications such as providing improved search results and construction of databases to serve user qu...
Paramveer S. Dhillon, Sundararajan Sellamanickam, ...
KDD
2012
ACM
186views Data Mining» more  KDD 2012»
13 years 9 months ago
Maximum inner-product search using cone trees
The problem of efficiently finding the best match for a query in a given set with respect to the Euclidean distance or the cosine similarity has been extensively studied. However...
Parikshit Ram, Alexander G. Gray
KDD
2009
ACM
150views Data Mining» more  KDD 2009»
16 years 7 months ago
Information theoretic regularization for semi-supervised boosting
We present novel semi-supervised boosting algorithms that incrementally build linear combinations of weak classifiers through generic functional gradient descent using both labele...
Lei Zheng, Shaojun Wang, Yan Liu, Chi-Hoon Lee
KDD
2006
ACM
107views Data Mining» more  KDD 2006»
16 years 7 months ago
Out-of-core frequent pattern mining on a commodity PC
In this work we focus on the problem of frequent itemset mining on large, out-of-core data sets. After presenting a characterization of existing out-of-core frequent itemset minin...
Gregory Buehrer, Srinivasan Parthasarathy, Amol Gh...
SIGMOD
2007
ACM
91views Database» more  SIGMOD 2007»
16 years 6 months ago
Indexing dataspaces
Dataspaces are collections of heterogeneous and partially unstructured data. Unlike data-integration systems that also offer uniform access to heterogeneous data sources, dataspac...
Xin Dong, Alon Y. Halevy