The problem of document categorization is considered. The set of domains and the keywords specific for these domains is supposed to be selected beforehand as initial data. We apply...
Mikhail Alexandrov, Alexander F. Gelbukh, George L...
In this paper we describe a rule-based formalism for the analysis and labelling of texts segments. The rules are contextual rewriting rules with a restricted form of negation. They...
: Up to now, the results of applying sophisticated NL techniques to IR have been mostly disappointing. Our research aims at investigating in detail the role of syntactic analysis i...
This paper describes a text generation system, XExplainer, which can dynamically produce a description of commodities in Korean from a relational database for homeshopping sites. ...
We study the problem of topic segmentation of manually transcribed speech in order to facilitate information extraction from dialogs. Our approach is based on a combination of mul...