Topic modeling techniques have widespread use in text data mining applications. Some applications use batch models, which perform clustering on the document collection in aggregat...
This paper presents an original approach to modelling user’s information need in text filtering environment. This approach relies on a specific novelty detection model which a...
We propose a multiclass (MC) classification approach to text categorization (TC). To fully take advantage of both positive and negative training examples, a maximal figure-of-meri...
Previous work has shown that high quality phrasal paraphrases can be extracted from bilingual parallel corpora. However, it is not clear whether bitexts are an appropriate resourc...
Juri Ganitkevitch, Chris Callison-Burch, Courtney ...
The main purpose of communication is to exchange information. Any discourse understanding model should be able to process the flow of information throughout the entire text. Accor...