Sciweavers

5209 search results - page 779 / 1042
» Multiobjective Data Clustering
Sort
View
CIKM
2011
Springer
14 years 6 months ago
Probabilistic near-duplicate detection using simhash
This paper offers a novel look at using a dimensionalityreduction technique called simhash [8] to detect similar document pairs in large-scale collections. We show that this algo...
Sadhan Sood, Dmitri Loguinov
HPDC
2005
IEEE
16 years 6 days ago
Generosity and gluttony in GEMS: grid enabled molecular simulations
Biomolecular simulations produce more output data than can be managed effectively by traditional computing systems. Researchers need distributed systems that allow the pooling of...
Justin M. Wozniak, Paul Brenner, Douglas Thain, Aa...
CVPR
2001
IEEE
16 years 8 months ago
A Weighted Non-Negative Matrix Factorization for Local Representations
This paper presents an improvement of the classical Non-negative Matrix Factorization (NMF) approach, for dealing with local representations of image objects. NMF, when applied to...
David Guillamet, Jordi Vitrià, Marco Bressa...
CVPR
2008
IEEE
16 years 8 months ago
Sequential sparsification for change detection
This paper presents a general method for segmenting a vector valued sequence into an unknown number of subsequences where all data points from a subsequence can be represented wit...
Necmiye Ozay, Mario Sznaier, Octavia I. Camps
ICDE
2005
IEEE
154views Database» more  ICDE 2005»
16 years 8 months ago
Deep Store: an Archival Storage System Architecture
We present the Deep Store archival storage architecture, a large-scale storage system that stores immutable data efficiently and reliably for long periods of time. Archived data i...
Lawrence You, Kristal T. Pollack, Darrell D. E. Lo...