Sciweavers

2367 search results - page 138 / 474
» Measuring and Comparing Effectiveness of Data Quality Techni...
Sort
View
KDD
2008
ACM
176views Data Mining» more  KDD 2008»
16 years 7 months ago
Febrl -: an open source data cleaning, deduplication and record linkage system with a graphical user interface
Matching records that refer to the same entity across databases is becoming an increasingly important part of many data mining projects, as often data from multiple sources needs ...
Peter Christen
SIGIR
2010
ACM
15 years 10 months ago
Evaluating whole-page relevance
Whole page relevance defines how well the surface-level representation of all elements on a search result page and the corresponding holistic attributes of the presentation respon...
Peter Bailey, Nick Craswell, Ryen W. White, Liwei ...
SYSTOR
2009
ACM
16 years 1 months ago
The effectiveness of deduplication on virtual machine disk images
Virtualization is becoming widely deployed in servers to efficiently provide many logically separate execution environments while reducing the need for physical servers. While th...
Keren Jin, Ethan L. Miller
BALT
2006
15 years 10 months ago
A Multiple Correspondence Analysis to Organize Data Cubes
Abstract. On Line Analytical Processing (OLAP) is a technology basically created to provide users with tools in order to explore and navigate into data cubes. Unfortunately, in hug...
Riadh Ben Messaoud, Omar Boussaid, Sabine Loudcher...
SMC
2007
IEEE
16 years 24 days ago
Sense based organization of descriptive data
— In this paper we propose a new technique allowing to map descriptive data into relative distance space, which is based primarily on senses of the terms stored in our data. We u...
M. Shahriar Hossain, Monika Akbar, Rafal A. Angryk