Sciweavers

17688 search results - page 346 / 3538
» Data Set Balancing
Sort
View
IDEAL
2005
Springer
16 years 5 days ago
Probabilistic Data Generation for Deduplication and Data Linkage
Abstract. In many data mining projects the data to be analysed contains personal information, like names and addresses. Cleaning and preprocessing of such data likely involves dedu...
Peter Christen
IPPS
2003
IEEE
15 years 12 months ago
Simulation of Dynamic Data Replication Strategies in Data Grids
Data Grids provide geographically distributed resources for large-scale data-intensive applications that generate large data sets. However, ensuring efficient access to such huge...
Houda Lamehamedi, Zujun Shentu, Boleslaw K. Szyman...
BNCOD
2008
143views Database» more  BNCOD 2008»
15 years 8 months ago
Reconciling Inconsistent Data in Probabilistic XML Data Integration
Abstract. The problem of dealing with inconsistent data while integrating XML data from different sources is an important task, necessary to improve data integration quality. Typic...
Tadeusz Pankowski
TIFS
2008
154views more  TIFS 2008»
15 years 6 months ago
Data Fusion and Cost Minimization for Intrusion Detection
Abstract--Statistical pattern recognition techniques have recently been shown to provide a finer balance between misdetections and false alarms than the more conventional intrusion...
Devi Parikh, Tsuhan Chen
MICCAI
2010
Springer
15 years 5 months ago
Multi-Class Sparse Bayesian Regression for Neuroimaging Data Analysis
The use of machine learning tools is gaining popularity in neuroimaging, as it provides a sensitive assessment of the information conveyed by brain images. In particular, finding ...
Vincent Michel, Evelyn Eger, Christine Keribin, Be...