Sciweavers

KDD
2005
ACM
106views Data Mining» more  KDD 2005»
16 years 7 months ago
Simultaneous optimization of complex mining tasks with a knowledgeable cache
With an increasing use of data mining tools and techniques, we envision that a Knowledge Discovery and Data Mining System (KDDMS) will have to support and optimize for the followi...
Ruoming Jin, Kaushik Sinha, Gagan Agrawal
KDD
2005
ACM
103views Data Mining» more  KDD 2005»
16 years 7 months ago
Fast discovery of unexpected patterns in data, relative to a Bayesian network
We consider a model in which background knowledge on a given domain of interest is available in terms of a Bayesian network, in addition to a large database. The mining problem is...
Szymon Jaroszewicz, Tobias Scheffer
KDD
2005
ACM
168views Data Mining» more  KDD 2005»
16 years 7 months ago
Nomograms for visualizing support vector machines
We propose a simple yet potentially very effective way of visualizing trained support vector machines. Nomograms are an established model visualization technique that can graphica...
Aleks Jakulin, Martin Mozina, Janez Demsar, Ivan B...
KDD
2005
ACM
171views Data Mining» more  KDD 2005»
16 years 7 months ago
Application of kernels to link analysis
Abstract. The application of kernel methods to link analysis is explored. We argue that a family of kernels on graphs provides a unified perspective on the three measures proposed ...
Takahiko Ito, Masashi Shimbo, Taku Kudo, Yuji Mats...
206
Voted
KDD
2005
ACM
161views Data Mining» more  KDD 2005»
16 years 7 months ago
Combining email models for false positive reduction
Machine learning and data mining can be effectively used to model, classify and discover interesting information for a wide variety of data including email. The Email Mining Toolk...
Shlomo Hershkop, Salvatore J. Stolfo
KDD
2005
ACM
103views Data Mining» more  KDD 2005»
16 years 7 months ago
Finding similar files in large document repositories
George Forman, Kave Eshghi, Stephane Chiocchetti
KDD
2005
ACM
86views Data Mining» more  KDD 2005»
16 years 7 months ago
Unweaving a web of documents
Ramanathan V. Guha, Ravi Kumar, D. Sivakumar, Ravi...
KDD
2005
ACM
80views Data Mining» more  KDD 2005»
16 years 7 months ago
Wavelet synopsis for data streams: minimizing non-euclidean error
We consider the wavelet synopsis construction problem for data streams where given n numbers we wish to estimate the data by constructing a synopsis, whose size, say B is much sma...
Sudipto Guha, Boulos Harb
KDD
2005
ACM
118views Data Mining» more  KDD 2005»
16 years 7 months ago
The predictive power of online chatter
Daniel Gruhl, Ramanathan V. Guha, Ravi Kumar, Jasm...