Sciweavers

4560 search results - page 263 / 912
» Finding Data in the Neighborhood
Sort
View
FAST
2011
14 years 10 months ago
A Study of Practical Deduplication
We collected file system content data from 857 desktop computers at Microsoft over a span of 4 weeks. We analyzed the data to determine the relative efficacy of data deduplication...
Dutch T. Meyer, William J. Bolosky
GRAPHITE
2003
ACM
15 years 12 months ago
A framework for a dynamic interactive 3D GIS for non-expert users
Many substantial geographic information systems (GIS) have been designed for use by expert users. As a result, nonexpert users often find them difficult to use. This paper present...
Arron R. Walker, Binh Pham, Anthony J. Maeder
EMNLP
2006
15 years 8 months ago
Random Indexing using Statistical Weight Functions
Random Indexing is a vector space technique that provides an efficient and scalable approximation to distributional similarity problems. We present experiments showing Random Inde...
James Gorman, James R. Curran
NAACL
2010
15 years 4 months ago
The Effect of Ambiguity on the Automated Acquisition of WSD Examples
Several methods for automatically generating labeled examples that can be used as training data for WSD systems have been proposed, including a semisupervised approach based on re...
Mark Stevenson, Yikun Guo
AUSDM
2006
Springer
110views Data Mining» more  AUSDM 2006»
15 years 10 months ago
Discovering Debtor Patterns of Centrelink Customers
Data mining is currently becoming an increasingly hot research field, but a large gap still remains between the research of data mining and its application in real-world business....
Yanchang Zhao, Longbing Cao, Yvonne Morrow, Yuming...