Search Sciweavers | Sciweavers

6240 search results - page 372 / 1248

» From Internet Information Searching to Information Summarizi...

185

click to vote

WWW
2006
ACM

179views Internet Technology» more WWW 2006»

Detecting spam web pages through content analysis

16 years 7 months ago

Download research.microsoft.com

In this paper, we continue our investigations of "web spam": the injection of artificially-created pages into the web in order to influence the results from search engin...

Alexandros Ntoulas, Marc Najork, Mark Manasse, Den...

claim paper

Read More »

185

click to vote

WWW
2010
ACM

257views Internet Technology» more WWW 2010»

CETR: content extraction via tag ratios

16 years 1 months ago

Download www.cs.illinois.edu

We present Content Extraction via Tag Ratios (CETR) – a method to extract content text from diverse webpages by using the HTML document’s tag ratios. We describe how to comput...

Tim Weninger, William H. Hsu, Jiawei Han

claim paper

Read More »

184

click to vote

WWW
2010
ACM

188views Internet Technology» more WWW 2010»

What is disputed on the web?

16 years 1 months ago

Download berkeley.intel-research.net

We present a method for automatically acquiring of a corpus of disputed claims from the web. We consider a factual claim to be disputed if a page on the web suggests both that the...

Rob Ennals, Dan Byler, John Mark Agosta, Barbara R...

claim paper

Read More »

186

click to vote

HT
2006
ACM

92views Internet Technology» more HT 2006»

Evaluation of crawling policies for a web-repository crawler

16 years 24 days ago

Download www.cs.odu.edu

We have developed a web-repository crawler that is used for reconstructing websites when backups are unavailable. Our crawler retrieves web resources from the Internet Archive, Go...

Frank McCown, Michael L. Nelson

claim paper

Read More »

244

click to vote

WEBI
2005
Springer

216views Internet Technology» more WEBI 2005»

A Semi-Supervised Document Clustering Algorithm Based on EM

16 years 9 days ago

Download www.dii.unisi.it

Document clustering is a very hard task in Automatic Text Processing since it requires to extract regular patterns from a document collection without a priori knowledge on the cat...

Leonardo Rigutini, Marco Maggini

claim paper

Read More »

« Prev « First page 372 / 1248 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers