Sciweavers

8316 search results - page 229 / 1664
» Web Document Modeling
Sort
View
COLING
2008
15 years 8 months ago
An Improved Hierarchical Bayesian Model of Language for Document Classification
This paper addresses the fundamental problem of document classification, and we focus attention on classification problems where the classes are mutually exclusive. In the course ...
Ben Allison
ICDAR
2009
IEEE
15 years 4 months ago
Language Model Integration for the Recognition of Handwritten Medieval Documents
Building recognition systems for historical documents is a difficult task. Especially, when it comes to medieval scripts. The complexity is mainly affected by the poor quality and...
Markus Wüthrich, Marcus Liwicki, Andreas Fisc...
DOLAP
2005
ACM
15 years 8 months ago
A relevance-extended multi-dimensional model for a data warehouse contextualized with documents
Current data warehouse and OLAP technologies can be applied to analyze the structured data that companies store in their databases. The circumstances that describe the context ass...
Juan Manuel Pérez, Rafael Berlanga Llavori,...
ICMCS
2007
IEEE
130views Multimedia» more  ICMCS 2007»
16 years 25 days ago
Word Topical Mixture Models for Extractive Spoken Document Summarization
This paper considers extractive summarization of Chinese spoken documents. In contrast to conventional approaches, we attempt to deal with the extractive summarization problem und...
Berlin Chen, Yi-Ting Chen
CVPR
2010
IEEE
16 years 2 months ago
Improving State-of-the-Art OCR through High-Precision Document-Specific Modeling
Optical character recognition (OCR) remains a difficult problem for noisy documents or documents not scanned at high resolution. Many current approaches rely on stored font models...
Andrew Kae, Gary Huang, Erik Learned-miller, Carl ...