Sciweavers

7200 search results - page 973 / 1440
» Self-Organizing Data Mining
Sort
View
194
Voted
KDD
2007
ACM
167views Data Mining» more  KDD 2007»
16 years 7 months ago
Generalized component analysis for text with heterogeneous attributes
We present a class of richly structured, undirected hidden variable models suitable for simultaneously modeling text along with other attributes encoded in different modalities. O...
Xuerui Wang, Chris Pal, Andrew McCallum
215
Voted
KDD
2006
ACM
381views Data Mining» more  KDD 2006»
16 years 7 months ago
GPLAG: detection of software plagiarism by program dependence graph analysis
Along with the blossom of open source projects comes the convenience for software plagiarism. A company, if less self-disciplined, may be tempted to plagiarize some open source pr...
Chao Liu 0001, Chen Chen, Jiawei Han, Philip S. Yu
200
Voted
KDD
2006
ACM
191views Data Mining» more  KDD 2006»
16 years 7 months ago
Beyond classification and ranking: constrained optimization of the ROI
Classification has been commonly used in many data mining projects in the financial service industry. For instance, to predict collectability of accounts receivable, a binary clas...
Lian Yan, Patrick Baldasare
KDD
2004
ACM
124views Data Mining» more  KDD 2004»
16 years 7 months ago
Automatic multimedia cross-modal correlation discovery
Given an image (or video clip, or audio song), how do we automatically assign keywords to it? The general problem is to find correlations across the media in a collection of multi...
Jia-Yu Pan, Hyung-Jeong Yang, Christos Faloutsos, ...
KDD
2003
ACM
142views Data Mining» more  KDD 2003»
16 years 7 months ago
Frequent-subsequence-based prediction of outer membrane proteins
A number of medically important disease-causing bacteria (collectively called Gram-negative bacteria) are noted for the extra "outer" membrane that surrounds their cell....
Rong She, Fei Chen 0002, Ke Wang, Martin Ester, Je...