Search Sciweavers | Sciweavers

184

CIKM
2011
Springer

218views Information Technology» more CIKM 2011»

Probabilistic near-duplicate detection using simhash

14 years 6 months ago

This paper oﬀers a novel look at using a dimensionalityreduction technique called simhash [8] to detect similar document pairs in large-scale collections. We show that this algo...

Sadhan Sood, Dmitri Loguinov

claim paper

Read More »

169

click to vote

CIKM
2011
Springer

192views Information Technology» more CIKM 2011»

Joint inference for cross-document information extraction

14 years 6 months ago

Download nlp.cs.qc.cuny.edu

Previous information extraction (IE) systems are typically organized as a pipeline architecture of separated stages which make independent local decisions. When the data grows bey...

Qi Li, Sam Anzaroot, Wen-Pin Lin, Xiang Li, Heng J...

claim paper

Read More »

180

click to vote

CIKM
2011
Springer

215views Information Technology» more CIKM 2011»

14 years 6 months ago

Classifying trending topics: a typology of conversation triggers on Twitter

Download nlp.uned.es

Twitter summarizes the great deal of messages posted by users in the form of trending topics that reﬂect the top conversations being discussed at a given moment. These trending ...

Arkaitz Zubiaga, Damiano Spina, Víctor Fres...

claim paper

Read More »

179

click to vote

CIKM
2011
Springer

183views Information Technology» more CIKM 2011»

Factorization-based lossless compression of inverted indices

14 years 6 months ago

Download www.cs.uwaterloo.ca

Many large-scale Web applications that require ranked top-k retrieval are implemented using inverted indices. An inverted index represents a sparse term-document matrix, where non...

George Beskales, Marcus Fontoura, Maxim Gurevich, ...

claim paper

Read More »

206

click to vote

CIKM
2011
Springer

185views Information Technology» more CIKM 2011»

Estimating selectivity for joined RDF triple patterns

14 years 6 months ago

Download www.it.swin.edu.au

A fundamental problem related to RDF query processing is selectivity estimation, which is crucial to query optimization for determining a join order of RDF triple patterns. In thi...

Hai Huang 0003, Chengfei Liu

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers