Sciweavers

4670 search results - page 263 / 934
» Testing that distributions are close
Sort
View
EUROPAR
1999
Springer
15 years 11 months ago
Parallel k/h-Means Clustering for Large Data Sets
This paper describes the realization of a parallel version of the k/h-means clustering algorithm. This is one of the basic algorithms used in a wide range of data mining tasks. We ...
Kilian Stoffel, Abdelkader Belkoniene
CORR
2008
Springer
123views Education» more  CORR 2008»
15 years 6 months ago
Inference of Flow Statistics via Packet Sampling in the Internet
We show in this note that by deterministic packet sampling, the tail of the distribution of the original flow size can be obtained by rescaling that of the sampled flow size. To re...
Yousra Chabchoub, Christine Fricker, Fabrice Guill...
MOBIQUITOUS
2008
IEEE
16 years 1 months ago
ScreenSpot: multidimensional resource discovery for distributed applications in smart spaces
The big challenge related to the contemporary research on ubiquitous and pervasive computing is that of seamless integration. For the next generation of ubiquitous and distributed...
Marko Jurmu, Sebastian Boring, Jukka Riekki
HPDC
2006
IEEE
16 years 22 days ago
Adaptive I/O Scheduling for Distributed Multi-applications Environments
The aIOLi project aims at optimizing the I/O accesses within the cluster by providing a simple POSIX API, thus avoiding the constraints to use a dedicated parallel I/O library. Th...
Adrien Lebre, Yves Denneulin, Guillaume Huard, Prz...
TREC
2008
15 years 8 months ago
Distributed EDLSI, BM25, and Power Norm at TREC 2008
This paper describes our participation in the TREC Legal competition in 2008. Our first set of experiments involved the use of Latent Semantic Indexing (LSI) with a small number of...
April Kontostathis, Andrew Lilly, Raymond J. Spite...