Sciweavers

KDD
2003
ACM
214views Data Mining» more  KDD 2003»
16 years 7 months ago
Adaptive duplicate detection using learnable string similarity measures
The problem of identifying approximately duplicate records in databases is an essential step for data cleaning and data integration processes. Most existing approaches have relied...
Mikhail Bilenko, Raymond J. Mooney
KDD
2003
ACM
148views Data Mining» more  KDD 2003»
16 years 7 months ago
A highly-usable projected clustering algorithm for gene expression profiles
Projected clustering has become a hot research topic due to its ability to cluster high-dimensional data. However, most existing projected clustering algorithms depend on some cri...
Kevin Y. Yip, David W. Cheung, Michael K. Ng
KDD
2003
ACM
122views Data Mining» more  KDD 2003»
16 years 7 months ago
Enhanced visualization of time series through higher fourier harmonics
Li Zhang, Aidong Zhang, Murali Ramanathan
KDD
2003
ACM
133views Data Mining» more  KDD 2003»
16 years 7 months ago
Interactive Analysis of Gene Interactions Using Graphical gaussian model
DNA microarray provides a powerful basis for analysis of gene expression. Data mining methods such as clustering have been widely applied to microarray data to link genes that sho...
Xintao Wu, Yong Ye, Kalpathi R. Subramanian
KDD
2003
ACM
142views Data Mining» more  KDD 2003»
16 years 7 months ago
Extracting information from text and images for location proteomics
There is extensive interest in automating the collection, organization and summarization of biological data. Data in the form of figures and accompanying captions in literature pr...
Zhenzhen Kou, William W. Cohen, Robert F. Murphy
KDD
2003
ACM
190views Data Mining» more  KDD 2003»
16 years 7 months ago
Distance-enhanced association rules for gene expression
We introduce a novel data mining technique for the analysis of gene expression. Gene expression is the effective production of the protein that a gene encodes. We focus on the cha...
Aleksandar Icev, Carolina Ruiz, Elizabeth F. Ryder
KDD
2004
ACM
171views Data Mining» more  KDD 2004»
16 years 7 months ago
Integrating Web Conceptual Modeling and Web Usage Mining
We present a case study about the application of the inductive database approach to the analysis of Web logs. We consider rich XML Web logs ? called conceptual logs ? that are gen...
Rosa Meo, Pier Luca Lanzi, Maristella Matera, Robe...
KDD
2004
ACM
150views Data Mining» more  KDD 2004»
16 years 7 months ago
Markov Blankets and Meta-heuristics Search: Sentiment Extraction from Unstructured Texts
Extracting sentiments from unstructured text has emerged as an important problem in many disciplines. An accurate method would enable us, for example, to mine online opinions from ...
Edoardo Airoldi, Xue Bai, Rema Padman