Sciweavers

KDD
2003
ACM
161views Data Mining» more  KDD 2003»
16 years 7 months ago
Empirical Bayesian data mining for discovering patterns in post-marketing drug safety
Because of practical limits in characterizing the safety profiles of therapeutic products prior to marketing, manufacturers and regulatory agencies perform post-marketing surveill...
David M. Fram, June S. Almenoff, William DuMouchel
KDD
2003
ACM
127views Data Mining» more  KDD 2003»
16 years 7 months ago
Experiments with random projections for machine learning
Dimensionality reduction via Random Projections has attracted considerable attention in recent years. The approach has interesting theoretical underpinnings and offers computation...
Dmitriy Fradkin, David Madigan
KDD
2003
ACM
175views Data Mining» more  KDD 2003»
16 years 7 months ago
Correlating synchronous and asynchronous data streams
Sudipto Guha, Dimitrios Gunopulos, Nick Koudas
KDD
2003
ACM
99views Data Mining» more  KDD 2003»
16 years 7 months ago
Fragments of order
High-dimensional collections of 0-1 data occur in many applications. The attributes in such data sets are typically considered to be unordered. However, in many cases there is a n...
Aristides Gionis, Teija Kujala, Heikki Mannila
157
Voted
KDD
2003
ACM
129views Data Mining» more  KDD 2003»
16 years 7 months ago
Online novelty detection on temporal sequences
: Novelty detection, or anomaly detection, on temporal sequences has increasingly attracted attention from researchers in different areas. In this paper, we present a new framework...
Junshui Ma, Simon Perkins
103
Voted
KDD
2003
ACM
143views Data Mining» more  KDD 2003»
16 years 7 months ago
To buy or not to buy: mining airfare data to minimize ticket purchase price
Oren Etzioni, Rattapoom Tuchinda, Craig A. Knobloc...
180
Voted
KDD
2003
ACM
243views Data Mining» more  KDD 2003»
16 years 7 months ago
Accurate decision trees for mining high-speed data streams
In this paper we study the problem of constructing accurate decision tree models from data streams. Data streams are incremental tasks that require incremental, online, and any-ti...
João Gama, Pedro Medas, Ricardo Rocha
KDD
2003
ACM
233views Data Mining» more  KDD 2003»
16 years 7 months ago
SEWeP: using site semantics and a taxonomy to enhance the Web personalization process
Web personalization is the process of customizing a Web site to the needs of each specific user or set of users, taking advantage of the knowledge acquired through the analysis of...
Magdalini Eirinaki, Michalis Vazirgiannis, Iraklis...
KDD
2003
ACM
124views Data Mining» more  KDD 2003»
16 years 7 months ago
Information-theoretic co-clustering
Two-dimensional contingency or co-occurrence tables arise frequently in important applications such as text, web-log and market-basket data analysis. A basic problem in contingenc...
Inderjit S. Dhillon, Subramanyam Mallela, Dharmend...
KDD
2003
ACM
194views Data Mining» more  KDD 2003»
16 years 7 months ago
Finding recent frequent itemsets adaptively over online data streams
A data stream is a massive unbounded sequence of data elements continuously generated at a rapid rate. Consequently, the knowledge embedded in a data stream is more likely to be c...
Joong Hyuk Chang, Won Suk Lee