Sciweavers

2444 search results - page 321 / 489
» A Pattern Based Data Mining Approach
Sort
View
ICDM
2008
IEEE
147views Data Mining» more  ICDM 2008»
16 years 28 days ago
Clustering Documents with Active Learning Using Wikipedia
Wikipedia has been applied as a background knowledge base to various text mining problems, but very few attempts have been made to utilize it for document clustering. In this pape...
Anna Huang, David N. Milne, Eibe Frank, Ian H. Wit...
KDD
2006
ACM
179views Data Mining» more  KDD 2006»
16 years 6 months ago
Extracting key-substring-group features for text classification
In many text classification applications, it is appealing to take every document as a string of characters rather than a bag of words. Previous research studies in this area mostl...
Dell Zhang, Wee Sun Lee
PVLDB
2010
110views more  PVLDB 2010»
15 years 4 months ago
Behavior Based Record Linkage
In this paper, we present a new record linkage approach that uses entity behavior to decide if potentially different entities are in fact the same. An entity’s behavior is extra...
Mohamed Yakout, Ahmed K. Elmagarmid, Hazem Elmelee...
WSDM
2010
ACM
215views Data Mining» more  WSDM 2010»
16 years 3 months ago
GeoFolk: Latent spatial semantics in Web 2.0 social media
We describe an approach for multi-modal characterization of social media by combining text features (e.g. tags as a prominent example of short, unstructured text labels) with spat...
Sergej Sizov
PKDD
2009
Springer
118views Data Mining» more  PKDD 2009»
16 years 1 months ago
Protein Identification from Tandem Mass Spectra with Probabilistic Language Modeling
This paper presents an interdisciplinary investigation of statistical information retrieval (IR) techniques for protein identification from tandem mass spectra, a challenging probl...
Yiming Yang, Abhay Harpale, Subramaniam Ganapathy