We introduce a technique for creating novel, textuallyenhanced thumbnails of Web pages. These thumbnails combine the advantages of image thumbnails and text summaries to provide c...
Allison Woodruff, Andrew Faulring, Ruth Rosenholtz...
It is widely believed that some queries submitted to search engines are by nature ambiguous (e.g., java, apple). However, few studies have investigated the questions of "how ...
Semantic relatedness between words is important to many NLP tasks, and numerous measures exist which use a variety of resources. Thus far, such work is confined to measuring simil...
In this paper we introduce a framework for automated text recognition from images. We first describe a simple but efficient text detection and recognition method based on analysis...
Tasks recognizing named entities such as products, people names, or locations from documents have recently received significant attention in the literature. Many solutions to thes...