Sciweavers

2277 search results - page 101 / 456
» Clustering by pattern similarity in large data sets
Sort
View
KDD
2010
ACM
272views Data Mining» more  KDD 2010»
15 years 4 months ago
Scalable similarity search with optimized kernel hashing
Scalable similarity search is the core of many large scale learning or data mining applications. Recently, many research results demonstrate that one promising approach is creatin...
Junfeng He, Wei Liu, Shih-Fu Chang
GRID
2004
Springer
15 years 11 months ago
High Performance Threaded Data Streaming for Large Scale Simulations
We have developed a threaded parallel data streaming approach using Logistical Networking (LN) to transfer multi-terabyte simulation data from computers at NERSC to our local anal...
Viraj Bhat, Scott Klasky, Scott Atchley, Micah Bec...
WABI
2005
Springer
179views Bioinformatics» more  WABI 2005»
15 years 12 months ago
Spectral Clustering Gene Ontology Terms to Group Genes by Function
Abstract. With the invention of biotechnological high throughput methods like DNA microarrays, biologists are capable of producing huge amounts of data. During the analysis of such...
Nora Speer, Christian Spieth, Andreas Zell
SSD
2005
Springer
122views Database» more  SSD 2005»
15 years 12 months ago
Selectivity Estimation of High Dimensional Window Queries via Clustering
Abstract. Query optimization is an important functionality of modern database systems and often based on estimating the selectivity of queries before actually executing them. Well-...
Christian Böhm, Hans-Peter Kriegel, Peer Kr&o...
ICDM
2002
IEEE
163views Data Mining» more  ICDM 2002»
15 years 11 months ago
High Performance Data Mining Using the Nearest Neighbor Join
The similarity join has become an important database primitive to support similarity search and data mining. A similarity join combines two sets of complex objects such that the r...
Christian Böhm, Florian Krebs