Sciweavers

5647 search results - page 366 / 1130
» A word from the editor
Sort
View
UAI
2008
15 years 8 months ago
Latent Topic Models for Hypertext
Latent topic models have been successfully applied as an unsupervised topic discovery technique in large document collections. With the proliferation of hypertext document collect...
Amit Gruber, Michal Rosen-Zvi, Yair Weiss
ACL
2006
15 years 8 months ago
Aligning Features with Sense Distinction Dimensions
In this paper we present word sense disambiguation (WSD) experiments on ten highly polysemous verbs in Chinese, where significant performance improvements are achieved using rich ...
Nianwen Xue, Jinying Chen, Martha Palmer
158
Voted
DAGSTUHL
2006
15 years 8 months ago
Automatic Meaning Discovery Using Google
We have found a method to automatically extract the meaning of words and phrases from the world-wide-web using Google page counts. The approach is novel in its unrestricted proble...
Rudi Cilibrasi, Paul M. B. Vitányi
HIS
2003
15 years 8 months ago
Evolving Better Stoplists for Document Clustering and Web Intelligence
: Text classification, document clustering and similar document analysis tasks are currently the subject of significant global research, since such areas underpin web intelligence,...
Mark P. Sinka, David Corne
SDM
2003
SIAM
134views Data Mining» more  SDM 2003»
15 years 8 months ago
Hierarchical Document Clustering using Frequent Itemsets
A major challenge in document clustering is the extremely high dimensionality. For example, the vocabulary for a document set can easily be thousands of words. On the other hand, ...
Benjamin C. M. Fung, Ke Wang, Martin Ester