Sciweavers

2736 search results - page 307 / 548
» Indexing uncertain data
Sort
View
BMCBI
2010
84views more  BMCBI 2010»
15 years 6 months ago
Testing the additional predictive value of high-dimensional molecular data
Background: While high-dimensional molecular data such as microarray gene expression data have been used for disease outcome prediction or diagnosis purposes for about ten years i...
Anne-Laure Boulesteix, Torsten Hothorn
BIB
2011
14 years 10 months ago
Using cross-validation to evaluate predictive accuracy of survival risk classifiers based on high-dimensional data
Developments in whole genome biotechnology have stimulated statistical focus on prediction methods. We review here methodology for classifying patients into survival risk groups a...
Richard M. Simon, Jyothi Subramanian, Ming-Chung L...
WWW
2003
ACM
16 years 7 months ago
Text joins in an RDBMS for web data integration
The integration of data produced and collected across autonomous, heterogeneous web services is an increasingly important and challenging problem. Due to the lack of global identi...
Luis Gravano, Panagiotis G. Ipeirotis, Nick Koudas...
KDD
2005
ACM
125views Data Mining» more  KDD 2005»
16 years 7 months ago
Email data cleaning
Addressed in this paper is the issue of `email data cleaning' for text mining. Many text mining applications need take emails as input. Email data is usually noisy and thus i...
Jie Tang, Hang Li, Yunbo Cao, ZhaoHui Tang
VLDB
2007
ACM
138views Database» more  VLDB 2007»
16 years 6 months ago
CADS: Continuous Authentication on Data Streams
We study processing and authentication of long-running queries on outsourced data streams. In this scenario, a data owner (DO) constantly transmits its data to a service provider ...
Stavros Papadopoulos, Yin Yang, Dimitris Papadias