Search Sciweavers | Sciweavers

4234 search results - page 337 / 847

» A Method for Web Information Extraction

163

click to vote

WWW
2005
ACM

124views Internet Technology» more WWW 2005»

Scaling link-based similarity search

16 years 7 months ago

Download www2005.org

To exploit the similarity information hidden in the hyperlink structure of the web, this paper introduces algorithms scalable to graphs with billions of vertices on a distributed ...

Balázs Rácz, Dániel Fogaras

claim paper

Read More »

172

click to vote

CIKM
2008
Springer

113views Information Technology» more CIKM 2008»

Information shared by many objects

15 years 8 months ago

Download learn.tsinghua.edu.cn

If Kolmogorov complexity [25] measures information in one object and Information Distance [4, 23, 24, 42] measures information shared by two objects, how do we measure information...

Chong Long, Xiaoyan Zhu, Ming Li, Bin Ma

claim paper

Read More »

159

click to vote

ACL
2006

95views Computational Linguistics» more ACL 2006»

Selection of Effective Contextual Information for Automatic Synonym Acquisition

15 years 8 months ago

Download acl.ldc.upenn.edu

Various methods have been proposed for automatic synonym acquisition, as synonyms are one of the most fundamental lexical knowledge. Whereas many methods are based on contextual c...

Masato Hagiwara, Yasuhiro Ogawa, Katsuhiko Toyama

claim paper

Read More »

169

click to vote

SIGIR
2002
ACM

152views Information Technology» more SIGIR 2002»

Unsupervised document classification using sequential information maximization

15 years 6 months ago

Download www.cs.huji.ac.il

We present a novel sequential clustering algorithm which is motivated by the Information Bottleneck (IB) method. In contrast to the agglomerative IB algorithm, the new sequential ...

Noam Slonim, Nir Friedman, Naftali Tishby

claim paper

Read More »

208

click to vote

WWW
2008
ACM

163views Internet Technology» more WWW 2008»

As we may perceive: finding the boundaries of compound documents on the web

16 years 7 months ago

Download www2008.org

This paper considers the problem of identifying on the Web compound documents (cDocs) ? groups of web pages that in aggregate constitute semantically coherent information entities...

Pavel Dmitriev

claim paper

Read More »

« Prev « First page 337 / 847 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers