Sciweavers

2827 search results - page 368 / 566
» Marking Text Documents
Sort
View
NAACL
2010
15 years 4 months ago
Quantifying the Limits and Success of Extractive Summarization Systems Across Domains
This paper analyzes the topic identification stage of single-document automatic text summarization across four different domains, consisting of newswire, literary, scientific and ...
Hakan Ceylan, Rada Mihalcea, Umut O'zertem, Elena ...
EMNLP
2009
15 years 4 months ago
Polylingual Topic Models
Topic models are a useful tool for analyzing large text collections, but have previously been applied in only monolingual, or at most bilingual, contexts. Meanwhile, massive colle...
David M. Mimno, Hanna M. Wallach, Jason Naradowsky...
INTERSPEECH
2010
15 years 1 months ago
Semi-automated update of automatic transcription system for the Japanese national congress
Update of acoustic and language models is vital to maintain performance of automatic speech recognition (ASR) systems. To alleviate efforts for updating models, we propose a "...
Yuya Akita, Masato Mimura, Graham Neubig, Tatsuya ...
CLEF
2011
Springer
14 years 6 months ago
Author Identification Using Semi-supervised Learning - Notebook for PAN at CLEF 2011
Author identification models fall into two major categories according to the way they handle the training texts: profile-based models produce one representation per author while in...
Ioannis Kourtis, Efstathios Stamatatos
KAIS
2006
247views more  KAIS 2006»
15 years 6 months ago
XCQ: A queriable XML compression system
XML has already become the de facto standard for specifying and exchanging data on the Web. However, XML is by nature verbose and thus XML documents are usually large in size, a fa...
Wilfred Ng, Wai Yeung Lam, Peter T. Wood, Mark Lev...