Sciweavers

4651 search results - page 241 / 931
» A Data Quality Browser
Sort
View
KDD
2007
ACM
182views Data Mining» more  KDD 2007»
16 years 7 months ago
Cleaning disguised missing data: a heuristic approach
In some applications such as filling in a customer information form on the web, some missing values may not be explicitly represented as such, but instead appear as potentially va...
Ming Hua, Jian Pei
KDD
2008
ACM
135views Data Mining» more  KDD 2008»
16 years 7 months ago
DiMaC: a disguised missing data cleaning tool
In some applications such as filling in a customer information form on the web, some missing values may not be explicitly represented as such, but instead appear as potentially va...
Ming Hua, Jian Pei
ICSEA
2007
IEEE
16 years 29 days ago
Test Data Generation from UML State Machine Diagrams using GAs
Automatic test data generation helps testers to validate software against user requirements more easily. Test data can be generated from many sources; for example, experience of t...
Chartchai Doungsa-ard, Keshav P. Dahal, M. Alamgir...
DILS
2005
Springer
16 years 6 days ago
Semantic Correspondence in Federated Life Science Data Integration Systems
For execution of complex biological queries, data integration systems often use several intermediate data sources because the domain coverage of individual sources is limited. Qual...
Malika Mahoui, Harshad Kulkarni, Nianhua Li, Zina ...
KDD
2006
ACM
122views Data Mining» more  KDD 2006»
16 years 7 months ago
Tensor-CUR decompositions for tensor-based data
Motivated by numerous applications in which the data may be modeled by a variable subscripted by three or more indices, we develop a tensor-based extension of the matrix CUR decom...
Michael W. Mahoney, Mauro Maggioni, Petros Drineas