Sciweavers

8312 search results - page 428 / 1663
» Performance data collection using a hybrid approach
Sort
View
192
Voted
SIGIR
2006
ACM
16 years 24 days ago
Near-duplicate detection by instance-level constrained clustering
For the task of near-duplicated document detection, both traditional fingerprinting techniques used in database community and bag-of-word comparison approaches used in information...
Hui Yang, James P. Callan
194
Voted
BMCBI
2010
125views more  BMCBI 2010»
15 years 7 months ago
Asymmetric microarray data produces gene lists highly predictive of research literature on multiple cancer types
Background: Much of the public access cancer microarray data is asymmetric, belonging to datasets containing no samples from normal tissue. Asymmetric data cannot be used in stand...
Noor B. Dawany, Aydin Tozeren
176
Voted
PR
2007
107views more  PR 2007»
15 years 6 months ago
Newtonian clustering: An approach based on molecular dynamics and global optimization
Given a data set, a dynamical procedure is applied to the data points in order to shrink and separate, possibly overlapping clusters. Namely, Newton’s equations of motion are em...
Konstantinos Blekas, Isaac E. Lagaris
SIGMOD
2011
ACM
221views Database» more  SIGMOD 2011»
14 years 9 months ago
Scalable query rewriting: a graph-based approach
In this paper we consider the problem of answering queries using views, which is important for data integration, query optimization, and data warehouses. We consider its simplest ...
George Konstantinidis, José Luis Ambite
159
Voted
SSDBM
2010
IEEE
153views Database» more  SSDBM 2010»
15 years 12 months ago
Scalable Clustering Algorithm for N-Body Simulations in a Shared-Nothing Cluster
Abstract. Scientists’ ability to generate and collect massive-scale datasets is increasing. As a result, constraints in data analysis capability rather than limitations in the av...
YongChul Kwon, Dylan Nunley, Jeffrey P. Gardner, M...