Sciweavers

KDD
2006
ACM
213views Data Mining» more  KDD 2006»
16 years 7 months ago
Pragmatic text mining: minimizing human effort to quantify many issues in call logs
George Forman, Evan Kirshenbaum, Jaap Suermondt
KDD
2006
ACM
130views Data Mining» more  KDD 2006»
16 years 7 months ago
Mining relational data through correlation-based multiple view validation
Commercial relational databases currently store vast amounts of real-world data. The data within these relational repositories are represented by multiple relations, which are int...
Hongyu Guo, Herna L. Viktor
KDD
2006
ACM
143views Data Mining» more  KDD 2006»
16 years 7 months ago
Algorithms for discovering bucket orders from data
Ordering and ranking items of different types are important tasks in various applications, such as query processing and scientific data mining. A total order for the items can be ...
Aristides Gionis, Heikki Mannila, Kai Puolamä...
KDD
2006
ACM
164views Data Mining» more  KDD 2006»
16 years 7 months ago
Assessing data mining results via swap randomization
The problem of assessing the significance of data mining results on high-dimensional 0?1 data sets has been studied extensively in the literature. For problems such as mining freq...
Aristides Gionis, Heikki Mannila, Panayiotis Tsapa...
KDD
2006
ACM
222views Data Mining» more  KDD 2006»
16 years 7 months ago
A component-based framework for knowledge discovery in bioinformatics
Motivation: In the field of bioinformatics there is an emerging need to integrate all knowledge discovery steps into a standardized modular framework. Indeed, component-based deve...
Julien Etienne, Bernd Wachmann, Lei Zhang
KDD
2006
ACM
156views Data Mining» more  KDD 2006»
16 years 7 months ago
Discovering significant OPSM subspace clusters in massive gene expression data
Order-preserving submatrixes (OPSMs) have been accepted as a biologically meaningful subspace cluster model, capturing the general tendency of gene expressions across a subset of ...
Byron J. Gao, Obi L. Griffith, Martin Ester, Steve...
KDD
2006
ACM
198views Data Mining» more  KDD 2006»
16 years 7 months ago
Estimating the global pagerank of web communities
Localized search engines are small-scale systems that index a particular community on the web. They offer several benefits over their large-scale counterparts in that they are rel...
Jason V. Davis, Inderjit S. Dhillon
KDD
2006
ACM
174views Data Mining» more  KDD 2006»
16 years 7 months ago
Onboard classifiers for science event detection on a remote sensing spacecraft
Typically, data collected by a spacecraft is downlinked to Earth and pre-processed before any analysis is performed. We have developed classifiers that can be used onboard a space...
Ashley Davies, Benjamin Cichy, Dominic Mazzoni, Ng...
KDD
2006
ACM
155views Data Mining» more  KDD 2006»
16 years 7 months ago
Single-pass online learning: performance, voting schemes and online feature selection
To learn concepts over massive data streams, it is essential to design inference and learning methods that operate in real time with limited memory. Online learning methods such a...
Vitor R. Carvalho, William W. Cohen