Search Sciweavers | Sciweavers

24444 search results - page 4461 / 4889

» A Data Model for Data Integration

141

click to vote

WWW
2008
ACM

109views Internet Technology» more WWW 2008»

Recrawl scheduling based on information longevity

16 years 7 months ago

Download www2008.org

It is crucial for a web crawler to distinguish between ephemeral and persistent content. Ephemeral content (e.g., quote of the day) is usually not worth crawling, because by the t...

Christopher Olston, Sandeep Pandey

claim paper

Read More »

173

Voted

WWW
2008
ACM

142views Internet Technology» more WWW 2008»

Mining for personal name aliases on the web

16 years 7 months ago

Download www2008.org

We propose a novel approach to find aliases of a given name from the web. We exploit a set of known names and their aliases as training data and extract lexical patterns that conv...

Danushka Bollegala, Taiki Honma, Yutaka Matsuo, Mi...

claim paper

Read More »

160

Voted

WWW
2008
ACM

120views Internet Technology» more WWW 2008»

Folksoviz: a subsumption-based folksonomy visualization using wikipedia texts

16 years 7 months ago

Download www2008.org

In this paper, targeting del.icio.us tag data, we propose a method, FolksoViz, for deriving subsumption relationships between tags by using Wikipedia texts, and visualizing a folk...

Kangpyo Lee, Hyunwoo Kim, Chungsu Jang, Hyoung-Joo...

claim paper

Read More »

217

Voted

WWW
2007
ACM

144views Internet Technology» more WWW 2007»

Towards domain-independent information extraction from web tables

16 years 7 months ago

Download www2007.org

Traditionally, information extraction from web tables has focused on small, more or less homogeneous corpora, often based on assumptions about the use of <table> tags. A mul...

Bernhard Krüpl, Bernhard Pollak, Marcus Herzo...

claim paper

Read More »

192

click to vote

WWW
2006
ACM

206views Internet Technology» more WWW 2006»

Beyond PageRank: machine learning for static ranking

16 years 7 months ago

Download www2006.org

Since the publication of Brin and Page's paper on PageRank, many in the Web community have depended on PageRank for the static (query-independent) ordering of Web pages. We s...

Matthew Richardson, Amit Prakash, Eric Brill

claim paper

Read More »

« Prev « First page 4461 / 4889 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers