Sciweavers

8316 search results - page 231 / 1664
» Web Document Modeling
Sort
View
SETN
2010
Springer
16 years 1 months ago
Scalable Semantic Annotation of Text Using Lexical and Web Resources
Abstract. In this paper we are dealing with the task of adding domainspecific semantic tags to a document, based solely on the domain ontology and generic lexical and Web resource...
Elias Zavitsanos, George Tsatsaronis, Iraklis Varl...
ICDAR
2003
IEEE
15 years 11 months ago
Lexical Postcorrection of OCR-Results: The Web as a Dynamic Secondary Dictionary?
Postcorrection of OCR-results for text documents is usually based on electronic dictionaries. When scanning texts from a specific thematic area, conventional dictionaries often m...
Christian M. Strohmaier, Christoph Ringlstetter, K...
ISMIS
2003
Springer
15 years 11 months ago
MetaNews: An Information Agent for Gathering News Articles on the Web
This paper presents MetaNews, an information gathering agent for news articles on the Web. MetaNews reads HTML documents from online news sites and extracts article information fro...
Dae-Ki Kang, Joongmin Choi
IJCAI
2003
15 years 8 months ago
Coherent Keyphrase Extraction via Web Mining
Keyphrases are useful for a variety of purposes, including summarizing, indexing, labeling, categorizing, clustering, highlighting, browsing, and searching. The task of automatic ...
Peter D. Turney
CIKM
2011
Springer
14 years 6 months ago
Towards noise-resilient document modeling
We introduce a generative probabilistic document model based on latent Dirichlet allocation (LDA), to deal with textual errors in the document collection. Our model is inspired by...
Tao Yang, Dongwon Lee