Search Sciweavers | Sciweavers

183

HT
2006
ACM

92views Internet Technology» more HT 2006»

Evaluation of crawling policies for a web-repository crawler

16 years 19 days ago

We have developed a web-repository crawler that is used for reconstructing websites when backups are unavailable. Our crawler retrieves web resources from the Internet Archive, Go...

Frank McCown, Michael L. Nelson

claim paper

Read More »

156

click to vote

ICWSM
2010

119views Internet Technology» more ICWSM 2010»

Social Intellisense: A Task-Embedded Interface to Folksonomies

15 years 8 months ago

Download research.microsoft.com

We present an application for accessing and creating socially constructed sets of information. Users store and retrieve information, such as bits of text, through the use of "...

Scott Counts, Kristie Fisher, Aaron Hoff

claim paper

Read More »

170

click to vote

SIGIR
2008
ACM

138views Information Technology» more SIGIR 2008»

To tag or not to tag -: harvesting adjacent metadata in large-scale tagging systems

15 years 6 months ago

Download lsirpeople.epfl.ch

We present HAMLET, a suite of principles, scoring models and algorithms to automatically propagate metadata along edges in a document neighborhood. As a showcase scenario we consi...

Adriana Budura, Sebastian Michel, Philippe Cudr&ea...

claim paper

Read More »

185

click to vote

SIGIR
2010
ACM

169views Information Technology» more SIGIR 2010»

Efficient partial-duplicate detection based on sequence matching

15 years 1 months ago

Download homepage.fudan.edu.cn

With the ever-increasing growth of the Internet, numerous copies of documents become serious problem for search engine, opinion mining and many other web applications. Since parti...

Qi Zhang, Yue Zhang, Haomin Yu, Xuanjing Huang

claim paper

Read More »

172

click to vote

WWW
2005
ACM

138views Internet Technology» more WWW 2005»

Disambiguating Web appearances of people in a social network

16 years 7 months ago

Download www.ra.ethz.ch

Say you are looking for information about a particular person. A search engine returns many pages for that person's name but which pages are about the person you care about, ...

Ron Bekkerman, Andrew McCallum

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers