Sciweavers

4874 search results - page 603 / 975
» Information theory for data management
Sort
View
KDD
2008
ACM
156views Data Mining» more  KDD 2008»
16 years 7 months ago
Unsupervised deduplication using cross-field dependencies
Recent work in deduplication has shown that collective deduplication of different attribute types can improve performance. But although these techniques cluster the attributes col...
Robert Hall, Charles A. Sutton, Andrew McCallum
KDD
2005
ACM
181views Data Mining» more  KDD 2005»
16 years 7 months ago
Evaluating similarity measures: a large-scale study in the orkut social network
Online information services have grown too large for users to navigate without the help of automated tools such as collaborative filtering, which makes recommendations to users ba...
Ellen Spertus, Mehran Sahami, Orkut Buyukkokten
ITNG
2007
IEEE
16 years 1 months ago
Practical Challenges Facing Communities of Interest in the Net-Centric Department of Defense
The United States Department of Defense (DoD) – one of the world’s largest heterogeneous and distributed enterprises – is transforming its information management and sharing...
C. L. Connors, M. A. Malloy
ITICSE
2004
ACM
16 years 7 days ago
Use of large databases for group projects at the nexus of teaching and research
Final year, group (capstone) projects in computing disciplines are often expected to fill multiple roles: in addition to allowing students to learn important domain-specific knowl...
Richard C. Thomas, Rebecca Mancy
MKM
2004
Springer
16 years 4 days ago
A Graph-Based Approach Towards Discerning Inherent Structures in a Digital Library of Formal Mathematics
As the amount of online formal mathematical content grows, for example through active efforts such as the Mathweb [21], MOWGLI [4], Formal Digital Library, or FDL [1], and others, ...
Lori Lorigo, Jon M. Kleinberg, Richard Eaton, Robe...