Sciweavers

2277 search results - page 374 / 456
» Clustering by pattern similarity in large data sets
Sort
View
BTW
2003
Springer
103views Database» more  BTW 2003»
15 years 11 months ago
XPath-Aware Chunking of XML-Documents
Dissemination systems are used to route information received from many publishers individually to multiple subscribers. The core of a dissemination system consists of an efficient...
Wolfgang Lehner, Florian Irmert
ICDM
2009
IEEE
112views Data Mining» more  ICDM 2009»
16 years 29 days ago
Resolving Identity Uncertainty with Learned Random Walks
A pervasive problem in large relational databases is identity uncertainty which occurs when multiple entries in a database refer to the same underlying entity in the world. Relati...
Ted Sandler, Lyle H. Ungar, Koby Crammer
COMSWARE
2007
IEEE
16 years 19 days ago
Scalable Multicast Platforms for a New Generation of Robust Distributed Applications
1 As distributed systems scale up and are deployed into increasingly sensitive settings, demand is rising for a new generation of communications middleware in support of applicati...
Ken Birman, Mahesh Balakrishnan, Danny Dolev, Tudo...
EUROSYS
2007
ACM
16 years 3 months ago
Dryad: distributed data-parallel programs from sequential building blocks
Dryad is a general-purpose distributed execution engine for coarse-grain data-parallel applications. A Dryad application combines computational “vertices” with communication ...
Michael Isard, Mihai Budiu, Yuan Yu, Andrew Birrel...
SEMCO
2007
IEEE
16 years 17 days ago
Cross-Genre Feature Comparisons for Spoken Sentence Segmentation
Automatic sentence segmentation of spoken language is an important precursor to downstream natural language processing. Previous studies combine lexical and prosodic features, but...
Sébastien Cuendet, Dilek Z. Hakkani-Tü...