Sciweavers

KDD
2003
ACM
142views Data Mining» more  KDD 2003»
16 years 7 months ago
Mining phenotypes and informative genes from gene expression data
Mining microarray gene expression data is an important research topic in bioinformatics with broad applications. While most of the previous studies focus on clustering either gene...
Chun Tang, Aidong Zhang, Jian Pei
KDD
2003
ACM
122views Data Mining» more  KDD 2003»
16 years 7 months ago
Discovery of climate indices using clustering
To analyze the effect of the oceans and atmosphere on land climate, Earth Scientists have developed climate indices, which are time series that summarize the behavior of selected ...
Michael Steinbach, Pang-Ning Tan, Vipin Kumar, Ste...
KDD
2003
ACM
118views Data Mining» more  KDD 2003»
16 years 7 months ago
Generating English summaries of time series data using the Gricean maxims
We are developing technology for generating English textual summaries of time-series data, in three domains: weather forecasts, gas-turbine sensor readings, and hospital intensive...
Somayajulu Sripada, Ehud Reiter, Jim Hunter, Jin Y...
KDD
2003
ACM
142views Data Mining» more  KDD 2003»
16 years 7 months ago
Frequent-subsequence-based prediction of outer membrane proteins
A number of medically important disease-causing bacteria (collectively called Gram-negative bacteria) are noted for the extra "outer" membrane that surrounds their cell....
Rong She, Fei Chen 0002, Ke Wang, Martin Ester, Je...
KDD
2003
ACM
162views Data Mining» more  KDD 2003»
16 years 7 months ago
Improving spatial locality of programs via data mining
In most computer systems, page fault rate is currently minimized by generic page replacement algorithms which try to model the temporal locality inherent in programs. In this pape...
Karlton Sequeira, Mohammed Javeed Zaki, Boleslaw K...
KDD
2003
ACM
157views Data Mining» more  KDD 2003»
16 years 7 months ago
Cross-training: learning probabilistic mappings between topics
Classification is a well-established operation in text mining. Given a set of labels A and a set DA of training documents tagged with these labels, a classifier learns to assign l...
Sunita Sarawagi, Soumen Chakrabarti, Shantanu Godb...
KDD
2003
ACM
144views Data Mining» more  KDD 2003»
16 years 7 months ago
Clinical and financial outcomes analysis with existing hospital patient records
Existing patient records are a valuable resource for automated outcomes analysis and knowledge discovery. However, key clinical data in these records is typically recorded in unst...
R. Bharat Rao, Sathyakama Sandilya, Radu Stefan Ni...
KDD
2003
ACM
111views Data Mining» more  KDD 2003»
16 years 7 months ago
Critical event prediction for proactive management in large-scale computer clusters
Adam J. Oliner, Anand Sivasubramaniam, Irina Rish,...
KDD
2003
ACM
145views Data Mining» more  KDD 2003»
16 years 7 months ago
Carpenter: finding closed patterns in long biological datasets
The growth of bioinformatics has resulted in datasets with new characteristics. These datasets typically contain a large number of columns and a small number of rows. For example,...
Feng Pan, Gao Cong, Anthony K. H. Tung, Jiong Yang...