Sciweavers

2827 search results - page 243 / 566
» Marking Text Documents
Sort
View
COLING
2008
15 years 8 months ago
A Framework for Identifying Textual Redundancy
The task of identifying redundant information in documents that are generated from multiple sources provides a significant challenge for summarization and QA systems. Traditional ...
Kapil Thadani, Kathleen McKeown
AAAI
2006
15 years 8 months ago
Comparative Experiments on Sentiment Classification for Online Product Reviews
Evaluating text fragments for positive and negative subjective expressions and their strength can be important in applications such as single- or multi- document summarization, do...
Hang Cui, Vibhu O. Mittal, Mayur Datar
ICML
2005
IEEE
16 years 7 months ago
Modeling word burstiness using the Dirichlet distribution
Multinomial distributions are often used to model text documents. However, they do not capture well the phenomenon that words in a document tend to appear in bursts: if a word app...
Rasmus Elsborg Madsen, David Kauchak, Charles Elka...
ACL
2008
15 years 8 months ago
In-Browser Summarisation: Generating Elaborative Summaries Biased Towards the Reading Context
We investigate elaborative summarisation, where the aim is to identify supplementary information that expands upon a key fact. We envisage such summaries being useful when browsin...
Stephen Wan, Cécile Paris
IFIP12
2004
15 years 8 months ago
Impact on Performance of Hypertext Classification of Selective Rich HTML Capture
: Hypertext categorization is the automatic classification of web documents into predefined classes. It poses new challenges for automatic categorization because of the rich inform...
Houda Benbrahim, Max Bramer