We present a general framework for the task of extracting specific information “on demand” from a large corpus such as the Web under resource-constraints. Given a database wit...
Tables are ubiquitous in web pages and scientific documents. With the explosive development of the web, tables have become a valuable information repository. Therefore, effective...
This paper presents the Visor (VIdeo Surveillance Online Repository) project designed with the aim of establishing an open platform for collecting, annotating, retrieving, sharing...
The increasing use of multimedia in education makes text-production with computers important for students. What kind of role does the Internet play here as an external source of i...
Large web or e-commerce sites are frequently hosted on clusters. Successful open-source tools exist for clustering the front tiers of such sites (web servers and application serve...