Sciweavers

1458 search results - page 111 / 292
» Practical Preference Relations for Large Data Sets
Sort
View
BMCBI
2005
246views more  BMCBI 2005»
15 years 6 months ago
ParPEST: a pipeline for EST data analysis based on parallel computing
Background: Expressed Sequence Tags (ESTs) are short and error-prone DNA sequences generated from the 5' and 3' ends of randomly selected cDNA clones. They provide an im...
Nunzio D'Agostino, Mario Aversano, Maria Luisa Chi...
ICDE
2005
IEEE
135views Database» more  ICDE 2005»
16 years 7 months ago
Finding (Recently) Frequent Items in Distributed Data Streams
We consider the problem of maintaining frequency counts for items occurring frequently in the union of multiple distributed data streams. Na?ive methods of combining approximate f...
Amit Manjhi, Vladislav Shkapenyuk, Kedar Dhamdhere...
EDBT
2004
ACM
192views Database» more  EDBT 2004»
16 years 6 months ago
LIMBO: Scalable Clustering of Categorical Data
Abstract. Clustering is a problem of great practical importance in numerous applications. The problem of clustering becomes more challenging when the data is categorical, that is, ...
Periklis Andritsos, Panayiotis Tsaparas, Ren&eacut...
HICSS
2007
IEEE
130views Biometrics» more  HICSS 2007»
16 years 14 days ago
Analysis of Activity in the Open Source Software Development Community
— Open Source Software is computer software for which the source code is publicly open for inspection, modification, and redistribution. While research of a few, large, successf...
Scott Christley, Gregory R. Madey
BMCBI
2005
101views more  BMCBI 2005»
15 years 6 months ago
TmaDB: a repository for tissue microarray data
Background: Tissue microarray (TMA) technology has been developed to facilitate large, genome-scale molecular pathology studies. This technique provides a high-throughput method f...
Archana Sharma-Oates, Philip Quirke, David R. West...