We present Content Extraction via Tag Ratios (CETR) – a method to extract content text from diverse webpages by using the HTML document’s tag ratios. We describe how to comput...
Here we present PaperSpace a computer vision based document management system that allows users to combine paper and digital documents. Using PaperSpace users can locate paper cop...
Jeff Smith, Jeremy Long, Tanya Lung, Mohd M. Anwar...
We present a freely available benchmark dataset for audio classification and clustering. This dataset consists of 10 seconds samples of 1886 songs obtained from the Garageband si...
This paper presents results from a desktop experiment in which the participants’ route selection behavior in an unknown street network is investigated. The participants were pres...
In this paper, we present an overview of extensible Retrieval, Annotation and Caching Engine (eRACE), a modular and distributed intermediary infrastructure that collects informati...