Sciweavers

1497 search results - page 205 / 300
» Information and Data Quality in Spreadsheets
Sort
View
WWW
2007
ACM
16 years 7 months ago
Detecting near-duplicates for web crawling
Near-duplicate web documents are abundant. Two such documents differ from each other in a very small portion that displays advertisements, for example. Such differences are irrele...
Gurmeet Singh Manku, Arvind Jain, Anish Das Sarma
ACHI
2009
IEEE
16 years 1 months ago
Model-Driven Instrumentation of Graphical User Interfaces
In today’s continuously changing markets newly developed products often do not meet the demands and expectations of customers. Research on this problem identified a large gap b...
Mathias Funk, Philip Hoyer, Stefan Link
DASFAA
2009
IEEE
133views Database» more  DASFAA 2009»
16 years 1 months ago
Probabilistic Ranking in Uncertain Vector Spaces
Abstract. In many application domains, e.g. sensor databases, traffic management or recognition systems, objects have to be compared based on positionally and existentially uncert...
Thomas Bernecker, Hans-Peter Kriegel, Matthias Ren...
PAKDD
2009
ACM
135views Data Mining» more  PAKDD 2009»
16 years 1 months ago
On Mining Rating Dependencies in Online Collaborative Rating Networks
The trend of social information processing sees e-commerce and social web applications increasingly relying on user-generated content, such as rating, to determine the quality of o...
Hady Wirawan Lauw, Ee-Peng Lim, Ke Wang
ICDM
2009
IEEE
97views Data Mining» more  ICDM 2009»
16 years 1 months ago
Hierarchical Probabilistic Segmentation of Discrete Events
—Segmentation, the task of splitting a long sequence of discrete symbols into chunks, can provide important information about the nature of the sequence that is understandable to...
Guy Shani, Christopher Meek, Asela Gunawardana