Sciweavers

2367 search results - page 155 / 474
» Measuring and Comparing Effectiveness of Data Quality Techni...
Sort
View
VLSID
2006
IEEE
136views VLSI» more  VLSID 2006»
16 years 7 months ago
Improved Data Compression for Serial Interconnected Network on Chip through Unused Significant Bit Removal
Serial links in network on chip provide advantages in terms of reduced wiring area, reduced switch complexity and power. However, serial links offer lower bandwidth in comparison ...
Simon Ogg, Bashir M. Al-Hashimi
ICDM
2007
IEEE
129views Data Mining» more  ICDM 2007»
16 years 26 days ago
A Generalization of Proximity Functions for K-Means
K-means is a widely used partitional clustering method. A large amount of effort has been made on finding better proximity (distance) functions for K-means. However, the common c...
Junjie Wu, Hui Xiong, Jian Chen, Wenjun Zhou
SIGMOD
2003
ACM
119views Database» more  SIGMOD 2003»
16 years 6 months ago
Robust and Efficient Fuzzy Match for Online Data Cleaning
To ensure high data quality, data warehouses must validate and cleanse incoming data tuples from external sources. In many situations, clean tuples must match acceptable tuples in...
Surajit Chaudhuri, Kris Ganjam, Venkatesh Ganti, R...
ICDCS
2002
IEEE
15 years 11 months ago
Improving Search in Peer-to-Peer Networks
Peer-to-peer systems have emerged as a popular way to share huge volumes of data. The usability of these systems depends on effective techniques to find and retrieve data; howeve...
Beverly Yang, Hector Garcia-Molina
ICDM
2009
IEEE
145views Data Mining» more  ICDM 2009»
15 years 4 months ago
Significance of Episodes Based on Minimal Windows
Discovering episodes, frequent sets of events from a sequence has been an active field in pattern mining. Traditionally, a level-wise approach is used to discover all frequent epis...
Nikolaj Tatti