Sciweavers

4660 search results - page 333 / 932
» Learning from imperfect data
Sort
View
KDD
2004
ACM
164views Data Mining» more  KDD 2004»
16 years 7 months ago
Cluster-based concept invention for statistical relational learning
We use clustering to derive new relations which augment database schema used in automatic generation of predictive features in statistical relational learning. Clustering improves...
Alexandrin Popescul, Lyle H. Ungar
KDD
2002
ACM
130views Data Mining» more  KDD 2002»
16 years 7 months ago
Learning domain-independent string transformation weights for high accuracy object identification
The task of object identification occurs when integrating information from multiple websites. The same data objects can exist in inconsistent text formats across sites, making it ...
Sheila Tejada, Craig A. Knoblock, Steven Minton
RECOMB
2007
Springer
16 years 7 months ago
Learning Gene Regulatory Networks via Globally Regularized Risk Minimization
Learning the structure of a gene regulatory network from time-series gene expression data is a significant challenge. Most approaches proposed in the literature to date attempt to ...
Yuhong Guo, Dale Schuurmans
WEBDB
2009
Springer
115views Database» more  WEBDB 2009»
16 years 1 months ago
A Machine Learning Approach to Foreign Key Discovery
We study the problem of automatically discovering semantic associations between schema elements, namely foreign keys. This problem is important in all applications where data sets...
Alexandra Rostin, Oliver Albrecht, Jana Bauckmann,...
DMIN
2007
186views Data Mining» more  DMIN 2007»
15 years 8 months ago
Cost-Sensitive Learning vs. Sampling: Which is Best for Handling Unbalanced Classes with Unequal Error Costs?
- The classifier built from a data set with a highly skewed class distribution generally predicts the more frequently occurring classes much more often than the infrequently occurr...
Gary M. Weiss, Kate McCarthy, Bibi Zabar