Sciweavers

4137 search results - page 395 / 828
» Managing Expressions as Data in Relational Database Systems
Sort
View
SIGMOD
2008
ACM
158views Database» more  SIGMOD 2008»
16 years 7 months ago
Sampling cube: a framework for statistical olap over sampling data
Sampling is a popular method of data collection when it is impossible or too costly to reach the entire population. For example, television show ratings in the United States are g...
Xiaolei Li, Jiawei Han, Zhijun Yin, Jae-Gil Lee, Y...
KDD
2004
ACM
624views Data Mining» more  KDD 2004»
16 years 6 days ago
Programming the K-means clustering algorithm in SQL
Using SQL has not been considered an efficient and feasible way to implement data mining algorithms. Although this is true for many data mining, machine learning and statistical a...
Carlos Ordonez
NGITS
1997
Springer
15 years 11 months ago
Faster Joins, Self Joins and Multi-Way Joins Using Join Indices
We propose a new algorithm, called Stripe-join, for performing a join given a join index. Stripe-join is inspired by an algorithm called \Jive-join" developed by Li and Ross....
Hui Lei, Kenneth A. Ross
GPC
2007
Springer
16 years 1 months ago
Design of PeerSum: A Summary Service for P2P Applications
Sharing huge databases in distributed systems is inherently difficult. As the amount of stored data increases, data localization techniques become no longer sufficient. A more ef...
Rabab Hayek, Guillaume Raschia, Patrick Valduriez,...
GCC
2006
Springer
15 years 10 months ago
Grid Enabled Data Integration Framework for Bioinformatics Research
A framework is proposed to manage the distributed and heterogeneous databases in grid environment for understanding protein-protein interaction. Furthermore, the framework is used...
Jia Liu, Yongwei Wu, Weimin Zheng