Sciweavers

630 search results - page 72 / 126
» Optimized union of non-disjoint distributed data sets
Sort
View
BMCBI
2006
110views more  BMCBI 2006»
15 years 6 months ago
Bias in error estimation when using cross-validation for model selection
Background: Cross-validation (CV) is an effective method for estimating the prediction error of a classifier. Some recent articles have proposed methods for optimizing classifiers...
Sudhir Varma, Richard Simon
IEEEPACT
2002
IEEE
15 years 11 months ago
Workload Design: Selecting Representative Program-Input Pairs
Having a representative workload of the target domain of a microprocessor is extremely important throughout its design. The composition of a workload involves two issues: (i) whic...
Lieven Eeckhout, Hans Vandierendonck, Koenraad De ...
EDBT
2008
ACM
169views Database» more  EDBT 2008»
16 years 6 months ago
BioScout: a life-science query monitoring system
Scientific data are available through an increasing number of heterogeneous, independently evolving, sources. Although the sources themselves are independently evolving, the data ...
Anastasios Kementsietsidis, Frank Neven, Dieter Va...
KDD
2008
ACM
134views Data Mining» more  KDD 2008»
16 years 6 months ago
Privacy-preserving cox regression for survival analysis
Privacy-preserving data mining (PPDM) is an emergent research area that addresses the incorporation of privacy preserving concerns to data mining techniques. In this paper we prop...
Shipeng Yu, Glenn Fung, Rómer Rosales, Srir...
SSDBM
2005
IEEE
111views Database» more  SSDBM 2005»
15 years 11 months ago
Querying Streaming Geospatial Image Data: The GeoStreams Project
Data products generated from remotely-sensed, geospatial imagery (RSI) used in emerging areas, such as global climatology, environmental monitoring, land use, and disaster managem...
Quinn Hart, Michael Gertz