Sciweavers

1486 search results - page 146 / 298
» A Document as a Small World
Sort
View
WWW
2006
ACM
16 years 7 months ago
Robust web content extraction
We present an empirical evaluation and comparison of two content extraction methods in HTML: absolute XPath expressions and relative XPath expressions. We argue that the relative ...
Marek Kowalkiewicz, Maria E. Orlowska, Tomasz Kacz...
SIGMOD
2006
ACM
161views Database» more  SIGMOD 2006»
16 years 6 months ago
Paper-based mobile access to databases
Our demonstration is a paper-based interactive guide for visitors to the world's largest international arts festival that was developed as part of a project investigating new...
Beat Signer, Moira C. Norrie, Michael Grossniklaus...
SIGMOD
2005
ACM
161views Database» more  SIGMOD 2005»
16 years 6 months ago
Efficient Keyword Search for Smallest LCAs in XML Databases
Keyword search is a proven, user-friendly way to query HTML documents in the World Wide Web. We propose keyword search in XML documents, modeled as labeled trees, and describe cor...
Yu Xu, Yannis Papakonstantinou
WWW
2010
ACM
16 years 1 months ago
CETR: content extraction via tag ratios
We present Content Extraction via Tag Ratios (CETR) – a method to extract content text from diverse webpages by using the HTML document’s tag ratios. We describe how to comput...
Tim Weninger, William H. Hsu, Jiawei Han
ICDM
2008
IEEE
148views Data Mining» more  ICDM 2008»
16 years 25 days ago
Formal Models for Expert Finding on DBLP Bibliography Data
Finding relevant experts in a specific field is often crucial for consulting, both in industry and in academia. The aim of this paper is to address the expert-finding task in a...
Hongbo Deng, Irwin King, Michael R. Lyu