Sciweavers

2827 search results - page 227 / 566
» Marking Text Documents
Sort
View
CAIP
2009
Springer
140views Image Analysis» more  CAIP 2009»
15 years 10 months ago
Hierarchical Decomposition of Handwritten Manuscripts Layouts
Abstract. In this paper we propose a new approach to improve electronic editions of literary corpus, providing an efficient estimation of manuscripts pages structure. In any handwr...
Vincent Malleron, Véronique Eglin, Hubert E...
AI
2008
Springer
15 years 8 months ago
A Statistical Model for Topic Segmentation and Clustering
This paper presents a statistical model for discovering topical clusters of words in unstructured text. The model uses a hierarchical Bayesian structure and it is also able to iden...
M. Mahdi Shafiei, Evangelos E. Milios
NIPS
2001
15 years 8 months ago
Latent Dirichlet Allocation
We describe latent Dirichlet allocation (LDA), a generative probabilistic model for collections of discrete data such as text corpora. LDA is a three-level hierarchical Bayesian m...
David M. Blei, Andrew Y. Ng, Michael I. Jordan
COLING
2002
15 years 6 months ago
Extracting Important Sentences with Support Vector Machines
Extracting sentences that contain important information from a document is a form of text summarization. The technique is the key to the automatic generation of summaries similar ...
Tsutomu Hirao, Hideki Isozaki, Eisaku Maeda, Yuji ...
SAC
2008
ACM
15 years 6 months ago
Discovering relationships among categories using misclassification information
Knowledge of relationships among categories is of the interest in different domains such as text classification, content analysis, and text mining. We propose and evaluate approac...
Saket S. R. Mengle, Nazli Goharian, Alana Platt