Search Sciweavers | Sciweavers

288

PRL
2011

259views Computer Networks» more PRL 2011»

A Bayes-true data generator for evaluation of supervised and unsupervised learning methods

14 years 9 months ago

Benchmarking pattern recognition, machine learning and data mining methods commonly relies on real-world data sets. However, there are some disadvantages in using real-world data....

Janick V. Frasch, Aleksander Lodwich, Faisal Shafa...

claim paper

Read More »

198

click to vote

SDM
2011
SIAM

198views Data Mining» more SDM 2011»

Exemplar-based Robust Coherent Biclustering

14 years 9 months ago

Download www.cs.iastate.edu

The biclustering, co-clustering, or subspace clustering problem involves simultaneously grouping the rows and columns of a data matrix to uncover biclusters or sub-matrices of the...

Kewei Tu, Xixiu Ouyang, Dingyi Han, Vasant Honavar

claim paper

Read More »

187

click to vote

SIGIR
2011
ACM

257views Information Technology» more SIGIR 2011»

No free lunch: brute force vs. locality-sensitive hashing for cross-lingual pairwise similarity

14 years 9 months ago

Download www.umiacs.umd.edu

This work explores the problem of cross-lingual pairwise similarity, where the task is to extract similar pairs of documents across two diﬀerent languages. Solutions to this pro...

Ferhan Ture, Tamer Elsayed, Jimmy J. Lin

claim paper

Read More »

215

click to vote

SIGMOD
2011
ACM

248views Database» more SIGMOD 2011»

Llama: leveraging columnar storage for scalable join processing in the MapReduce framework

14 years 9 months ago

Download www.comp.nus.edu.sg

To achieve high reliability and scalability, most large-scale data warehouse systems have adopted the cluster-based architecture. In this paper, we propose the design of a new clu...

Yuting Lin, Divyakant Agrawal, Chun Chen, Beng Chi...

claim paper

Read More »

169

click to vote

ICCV
2011
IEEE

160views Computer Vision» more ICCV 2011»

Source Constrained Clustering

14 years 6 months ago

Download www.ri.cmu.edu

We consider the problem of quantizing data generated from disparate sources, e.g. subjects performing actions with different styles, movies with particular genre bias, various con...

Ekaterina Taralova, Fernando DelaTorre, Martial He...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers