Sciweavers

KDD
2007
ACM
168views Data Mining» more  KDD 2007»
16 years 7 months ago
Finding tribes: identifying close-knit individuals from employment patterns
We present a family of algorithms to uncover tribes--groups of individuals who share unusual sequences of affiliations. While much work inferring community structure describes lar...
Lisa Friedland, David Jensen
KDD
2007
ACM
152views Data Mining» more  KDD 2007»
16 years 7 months ago
Relational data pre-processing techniques for improved securities fraud detection
Commercial datasets are often large, relational, and dynamic. They contain many records of people, places, things, events and their interactions over time. Such datasets are rarel...
Andrew Fast, Lisa Friedland, Marc Maier, Brian Tay...
KDD
2007
ACM
132views Data Mining» more  KDD 2007»
16 years 7 months ago
Semi-supervised classification with hybrid generative/discriminative methods
Gregory Druck, Chris Pal, Andrew McCallum, Xiaojin...
136
Voted
KDD
2007
ACM
168views Data Mining» more  KDD 2007»
16 years 7 months ago
Development of NeuroElectroMagnetic ontologies(NEMO): a framework for mining brainwave ontologies
Dejing Dou, Gwen A. Frishkoff, Jiawei Rong, Robert...
KDD
2007
ACM
152views Data Mining» more  KDD 2007»
16 years 7 months ago
Efficient incremental constrained clustering
Clustering with constraints is an emerging area of data mining research. However, most work assumes that the constraints are given as one large batch. In this paper we explore the...
Ian Davidson, S. S. Ravi, Martin Ester
KDD
2007
ACM
150views Data Mining» more  KDD 2007»
16 years 7 months ago
Feature selection methods for text classification
Anirban Dasgupta, Petros Drineas, Boulos Harb, Van...
KDD
2007
ACM
335views Data Mining» more  KDD 2007»
16 years 7 months ago
Detecting changes in large data sets of payment card data: a case study
An important problem in data mining is detecting changes in large data sets. Although there are a variety of change detection algorithms that have been developed, in practice it c...
Chris Curry, Robert L. Grossman, David Locke, Stev...
KDD
2007
ACM
154views Data Mining» more  KDD 2007»
16 years 7 months ago
Canonicalization of database records using adaptive similarity measures
It is becoming increasingly common to construct databases from information automatically culled from many heterogeneous sources. For example, a research publication database can b...
Aron Culotta, Michael L. Wick, Robert Hall, Matthe...
KDD
2007
ACM
169views Data Mining» more  KDD 2007»
16 years 7 months ago
Exploiting underrepresented query aspects for automatic query expansion
Users attempt to express their search goals through web search queries. When a search goal has multiple components or aspects, documents that represent all the aspects are likely ...
Daniel Crabtree, Peter Andreae, Xiaoying Gao