This paper describes a text chunking system based on a generalization of the Winnow algorithm. We propose a general statistical model for text chunking which we then convert into ...
This paper presents a two-stage approach to summarizing multiple contrastive viewpoints in opinionated text. In the first stage, we use an unsupervised probabilistic approach to m...
This paper proposes, an efficient method for text independent writer identification using a codebook. The occurrence histogram of the shapes in the codebook is used to create a fea...
We consider the problem of predicting a movie's opening weekend revenue. Previous work on this problem has used metadata about a movie--e.g., its genre, MPAA rating, and cast...
Mahesh Joshi, Dipanjan Das, Kevin Gimpel, Noah A. ...
Analogy is heavily used in instructional texts. We introduce the concept of analogical dialogue acts (ADAs), which represent the roles utterances play in instructional analogies. ...