Search Sciweavers | Sciweavers

28131 search results - page 5460 / 5627

» Images, Images, Billions of Images

167

click to vote

WWW
2009
ACM

131views Internet Technology» more WWW 2009»

Purely URL-based topic classification

16 years 7 months ago

Download www2009.org

Given only the URL of a web page, can we identify its topic? This is the question that we examine in this paper. Usually, web pages are classified using their content [7], but a U...

Eda Baykan, Monika Rauch Henzinger, Ludmila Marian...

claim paper

Read More »

182

click to vote

WWW
2008
ACM

130views Internet Technology» more WWW 2008»

A differential notion of place for local search

16 years 7 months ago

Download people.kmi.open.ac.uk

For extracting the characteristics a specific geographic entity, and notably a place, we propose to use dynamic Extreme Tagging Systems in combination with the classic approach of...

Vlad Tanasescu, John Domingue

claim paper

Read More »

175

click to vote

WWW
2005
ACM

135views Internet Technology» more WWW 2005»

LSH forest: self-tuning indexes for similarity search

16 years 7 months ago

Download www2005.org

We consider the problem of indexing high-dimensional data for answering (approximate) similarity-search queries. Similarity indexes prove to be important in a wide variety of sett...

Mayank Bawa, Tyson Condie, Prasanna Ganesan

claim paper

Read More »

170

click to vote

WWW
2005
ACM

150views Internet Technology» more WWW 2005»

Extracting context to improve accuracy for HTML content extraction

16 years 7 months ago

Download www1.cs.columbia.edu

Web pages contain clutter (such as ads, unnecessary images and extraneous links) around the body of an article, which distracts a user from actual content. Extraction of "use...

Suhit Gupta, Gail E. Kaiser, Salvatore J. Stolfo

claim paper

Read More »

168

click to vote

WWW
2004
ACM

130views Internet Technology» more WWW 2004»

Managing versions of web documents in a transaction-time web server

16 years 7 months ago

Download www.iw3c2.org

This paper presents a transaction-time HTTP server, called ? Apache that supports document versioning. A document often consists of a main file formatted in HTML or XML and severa...

Curtis E. Dyreson, Hui-ling Lin, Yingxia Wang

claim paper

Read More »

« Prev « First page 5460 / 5627 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers