Sciweavers

1151 search results - page 144 / 231
» Mining from Large Image Sets
Sort
View
KDD
2002
ACM
93views Data Mining» more  KDD 2002»
16 years 6 months ago
Interactive deduplication using active learning
Deduplication is a key operation in integrating data from multiple sources. The main challenge in this task is designing a function that can resolve when a pair of records refer t...
Sunita Sarawagi, Anuradha Bhamidipaty
KDD
2002
ACM
193views Data Mining» more  KDD 2002»
16 years 6 months ago
Query, analysis, and visualization of hierarchically structured data using Polaris
In the last several years, large OLAP databases have become common in a variety of applications such as corporate data warehouses and scientific computing. To support interactive ...
Chris Stolte, Diane Tang, Pat Hanrahan
ICDM
2009
IEEE
112views Data Mining» more  ICDM 2009»
16 years 1 months ago
Resolving Identity Uncertainty with Learned Random Walks
A pervasive problem in large relational databases is identity uncertainty which occurs when multiple entries in a database refer to the same underlying entity in the world. Relati...
Ted Sandler, Lyle H. Ungar, Koby Crammer
PKDD
2009
Springer
103views Data Mining» more  PKDD 2009»
16 years 27 days ago
Kernels for Periodic Time Series Arising in Astronomy
Abstract. We present a method for applying machine learning algorithms to the automatic classification of astronomy star surveys using time series of star brightness. Currently su...
Gabriel Wachman, Roni Khardon, Pavlos Protopapas, ...
PKDD
2009
Springer
134views Data Mining» more  PKDD 2009»
16 years 27 days ago
Multi-task Feature Selection Using the Multiple Inclusion Criterion (MIC)
Abstract. We address the problem of joint feature selection in multiple related classification or regression tasks. When doing feature selection with multiple tasks, usually one c...
Paramveer S. Dhillon, Brian Tomasik, Dean P. Foste...