Extractive text summarization aims to create a condensed version of one or more source documents by selecting the most informative sentences. Research in text summarization has th...
We describe a method for generating accurate, compact, human understandable text classifiers. Text datasets are indexed using Apache Lucene and Genetic Programs are used to constr...
In multi-label text categorization, determining the final set of classes that will label a given document is not trivial. It implies first to determine whether a class is suitable ...
This paper describes a method for definition question answering based on the use of surface text patterns. The method is specially suited to answer questions about person's po...
The Linguistic Data Consortium (LDC) is currently involved in a major effort to expand its multilingual text resources, in particular for machine translation, message understandin...