Sciweavers

5575 search results - page 683 / 1115
» Information Extraction
Sort
View
194
Voted
WWW
2005
ACM
16 years 7 months ago
The volume and evolution of web page templates
Web pages contain a combination of unique content and template material, which is present across multiple pages and used primarily for formatting, navigation, and branding. We stu...
David Gibson, Kunal Punera, Andrew Tomkins
160
Voted
WWW
2005
ACM
16 years 7 months ago
Automatically learning document taxonomies for hierarchical classification
While several hierarchical classification methods have been applied to web content, such techniques invariably rely on a pre-defined taxonomy of documents. We propose a new techni...
Kunal Punera, Suju Rajan, Joydeep Ghosh
WWW
2010
ACM
16 years 1 months ago
Towards comment-based cross-media retrieval
This paper investigates whether Web comments can be exploited for cross-media retrieval. Comparing Web items such as texts, images, videos, music, products, or personal profiles ...
Martin Potthast, Benno Stein, Steffen Becker
IEEEIAS
2009
IEEE
16 years 1 months ago
A Block Feature Correlation Based Image Watermarking for Tamper Detection Using Linear Equation
: This paper proposes a new watermarking method for detecting image tampering. First, an authentication number is created using a pair of watermark pixels as the coefficients of a ...
Chin-Feng Lee, Kuo-Nan Chen, Chin-Chen Chang, Meng...
AIRWEB
2009
Springer
16 years 1 months ago
A study of link farm distribution and evolution using a time series of web snapshots
In this paper, we study the overall link-based spam structure and its evolution which would be helpful for the development of robust analysis tools and research for Web spamming a...
Young-joo Chung, Masashi Toyoda, Masaru Kitsuregaw...