Sciweavers

2554 search results - page 336 / 511
» Keyword query cleaning
Sort
View
SIGMOD
2011
ACM
248views Database» more  SIGMOD 2011»
14 years 9 months ago
Llama: leveraging columnar storage for scalable join processing in the MapReduce framework
To achieve high reliability and scalability, most large-scale data warehouse systems have adopted the cluster-based architecture. In this paper, we propose the design of a new clu...
Yuting Lin, Divyakant Agrawal, Chun Chen, Beng Chi...
SIGMOD
2011
ACM
150views Database» more  SIGMOD 2011»
14 years 9 months ago
WHAM: a high-throughput sequence alignment method
Over the last decade the cost of producing genomic sequences has dropped dramatically due to the current so called “next-gen” sequencing methods. However, these next-gen seque...
Yinan Li, Allison Terrell, Jignesh M. Patel
CIKM
2011
Springer
14 years 6 months ago
Hybrid models for future event prediction
We present a hybrid method to turn off-the-shelf information retrieval (IR) systems into future event predictors. Given a query, a time series model is trained on the publication...
Giuseppe Amodeo, Roi Blanco, Ulf Brefeld
SIGMOD
2012
ACM
276views Database» more  SIGMOD 2012»
13 years 9 months ago
SCARAB: scaling reachability computation on large graphs
Most of the existing reachability indices perform well on small- to medium- size graphs, but reach a scalability bottleneck around one million vertices/edges. As graphs become inc...
Ruoming Jin, Ning Ruan, Saikat Dey, Jeffrey Xu Yu
SIGMOD
2012
ACM
234views Database» more  SIGMOD 2012»
13 years 9 months ago
BloomUnit: declarative testing for distributed programs
We present BloomUnit, a testing framework for distributed programs written in the Bloom language. BloomUnit allows developers to write declarative test specifications that descri...
Peter Alvaro, Andrew Hutchinson, Neil Conway, Will...