Sciweavers

WWW
2007
ACM
16 years 7 months ago
A no-frills architecture for lightweight answer retrieval
In a new model for answer retrieval, document collections are distilled offline into large repositories of facts. Each fact constitutes a potential direct answer to questions seek...
Marius Pasca
WWW
2007
ACM
16 years 7 months ago
Organizing and searching the world wide web of facts -- step two: harnessing the wisdom of the crowds
As part of a large effort to acquire large repositories of facts from unstructured text on the Web, a seed-based framework for textual information extraction allows for weakly sup...
Marius Pasca
WWW
2007
ACM
16 years 7 months ago
Preserving XML queries during schema evolution
In XML databases, new schema versions may be released as frequently as once every two weeks. This poster describes a taxonomy of changes for XML schema evolution. It examines the ...
Mirella Moura Moro, Susan Malaika, Lipyeow Lim
WWW
2007
ACM
16 years 7 months ago
Extensible schema documentation with XSLT 2.0
XML Schema documents are defined using an XML syntax, which means that the idea of generating schema documentation through standard XML technologies is intriguing. We present X2Do...
Felix Michel, Erik Wilde
WWW
2007
ACM
16 years 7 months ago
Detectives: detecting coalition hit inflation attacks in advertising networks streams
Click fraud is jeopardizing the industry of Internet advertising. Internet advertising is crucial for the thriving of the entire Internet, since it allows producers to advertise t...
Ahmed Metwally, Divyakant Agrawal, Amr El Abbadi
WWW
2007
ACM
16 years 7 months ago
A high-performance interpretive approach to schema-directed parsing
XML delivers key advantages in interoperability due to its flexibility, expressiveness, and platform-neutrality. As XML has become a performance-critical aspect of the next genera...
Morris Matsa, Eric Perkins, Abraham Heifets, Marga...
WWW
2007
ACM
16 years 7 months ago
Towards environment generated media: object-participation-type weblog in home sensor network
The environment generated media (EGM) are defined here as being generated from a massive amount of and/or incomprehensible environmental data by compressing them into averages or ...
Takuya Maekawa, Yutaka Yanagisawa, Takeshi Okadome
WWW
2007
ACM
16 years 7 months ago
Supporting end-users in the creation of dependable web clips
Web authoring environments enable end-users to create applications that integrate information from other web sources. Users can create web sites that include built-in components t...
Sandeep Lingam, Sebastian G. Elbaum
WWW
2007
ACM
16 years 7 months ago
Classifying web sites
In this paper, we present a novel method for the classification of Web sites. This method exploits both structure and content of Web sites in order to discern their functionality....
Christoph Lindemann, Lars Littig
WWW
2007
ACM
16 years 7 months ago
Towards effective browsing of large scale social annotations
This paper is concerned with the problem of browsing social annotations. Today, a lot of services (e.g., Del.icio.us, Filckr) have been provided for helping users to manage and sh...
Rui Li, Shenghua Bao, Yong Yu, Ben Fei, Zhong Su