Sciweavers

2277 search results - page 227 / 456
» Clustering by pattern similarity in large data sets
Sort
View
KDD
2001
ACM
141views Data Mining» more  KDD 2001»
16 years 7 months ago
Induction of semantic classes from natural language text
Many applications dealing with textual information require classification of words into semantic classes (or concepts). However, manually constructing semantic classes is a tediou...
Dekang Lin, Patrick Pantel
JIIS
2006
113views more  JIIS 2006»
15 years 6 months ago
Spatial ordering and encoding for geographic data mining and visualization
: Geographic information (e.g., locations, networks, and nearest neighbors) are unique and different from other aspatial attributes (e.g., population, sales, or income). It is a ch...
Diansheng Guo, Mark Gahegan
SSPR
2004
Springer
15 years 12 months ago
Feature Shaving for Spectroscopic Data
High-resolution spectroscopy is a powerful industrial tool. The number of features (wavelengths) in these data sets varies from several hundreds up to a thousand. Relevant feature ...
Serguei Verzakov, Pavel Paclík, Robert P. W...
KDD
2008
ACM
183views Data Mining» more  KDD 2008»
16 years 7 months ago
De-duping URLs via rewrite rules
A large fraction of the URLs on the web contain duplicate (or near-duplicate) content. De-duping URLs is an extremely important problem for search engines, since all the principal...
Anirban Dasgupta, Ravi Kumar, Amit Sasturkar
BMCBI
2006
112views more  BMCBI 2006»
15 years 6 months ago
A phylogenomic gene cluster resource: the Phylogenetically Inferred Groups (PhIGs) database
Background: We present here the PhIGs database, a phylogenomic resource for sequenced genomes. Although many methods exist for clustering gene families, very few attempt to create...
Paramvir S. Dehal, Jeffrey L. Boore