Sciweavers

17688 search results - page 313 / 3538
» Data Set Balancing
Sort
View
FGR
2004
IEEE
133views Biometrics» more  FGR 2004»
15 years 10 months ago
Finding Temporal Patterns by Data Decomposition
We present a new unsupervised learning technique for the discovery of temporal clusters in large data sets. Our method performs hierarchical decomposition of the data to find stru...
David C. Minnen, Christopher Richard Wren
CORR
2010
Springer
117views Education» more  CORR 2010»
15 years 6 months ago
Reductions Between Expansion Problems
The Small-Set Expansion Hypothesis (Raghavendra, Steurer, STOC 2010) is a natural hardness assumption concerning the problem of approximating the edge expansion of small sets in g...
Prasad Raghavendra, David Steurer, Madhur Tulsiani
COMAD
2008
15 years 8 months ago
Disk-Based Sampling for Outlier Detection in High Dimensional Data
We propose an efficient sampling based outlier detection method for large high-dimensional data. Our method consists of two phases. In the first phase, we combine a "sampling...
Timothy de Vries, Sanjay Chawla, Pei Sun, Gia Vinh...
IPTPS
2003
Springer
15 years 11 months ago
SOMO: Self-Organized Metadata Overlay for Resource Management in P2P DHT
– In this paper, we first describe the concept of data overlay, which is a mechanism to implement arbitrary data structure on top of any structured P2P DHT. With this ion, we dev...
Zheng Zhang, Shuming Shi, Jing Zhu
JAIR
2008
173views more  JAIR 2008»
15 years 6 months ago
Creating Relational Data from Unstructured and Ungrammatical Data Sources
In order for agents to act on behalf of users, they will have to retrieve and integrate vast amounts of textual data on the World Wide Web. However, much of the useful data on the...
Matthew Michelson, Craig A. Knoblock