Sciweavers

285 search results - page 47 / 57
» Ontology-based Text Document Clustering
Sort
View
CIKM
2008
Springer
15 years 8 months ago
Learning to link with wikipedia
This paper describes how to automatically cross-reference documents with Wikipedia: the largest knowledge base ever known. It explains how machine learning can be used to identify...
David N. Milne, Ian H. Witten
SAC
2010
ACM
16 years 24 days ago
Mining temporal relationships among categories
Temporal text mining deals with discovering temporal patterns in text over a period of time. A Theme Evolution Graph (TEG) is used to visualize when new themes are created and how...
Saket S. R. Mengle, Nazli Goharian
ICDAR
2009
IEEE
16 years 21 days ago
Word-Based Adaptive OCR for Historical Books
The aim of this work is to propose a new approach to the recognition of historical texts by providing an adaptive mechanism that automatically tunes itself to a specific book. Th...
Vladimir Kluzner, Asaf Tzadok, Yuval Shimony, Euge...
JACM
2010
208views more  JACM 2010»
15 years 4 months ago
The nested chinese restaurant process and bayesian nonparametric inference of topic hierarchies
clustering of documents according to sharing of topics at multiple levels of abstraction. Given a corpus of documents, a posterior inference algorithm finds an approximation to a ...
David M. Blei, Thomas L. Griffiths, Michael I. Jor...
SIGIR
2004
ACM
15 years 11 months ago
On scaling latent semantic indexing for large peer-to-peer systems
The exponential growth of data demands scalable infrastructures capable of indexing and searching rich content such as text, music, and images. A promising direction is to combine...
Chunqiang Tang, Sandhya Dwarkadas, Zhichen Xu