Sciweavers

4466 search results - page 274 / 894
» Large-Scale Data Analysis Using Heuristic Methods
Sort
View
SIGMOD
2005
ACM
123views Database» more  SIGMOD 2005»
16 years 6 days ago
To Do or Not To Do: The Dilemma of Disclosing Anonymized Data
Decision makers of companies often face the dilemma of whether to release data for knowledge discovery, vis a vis the risk of disclosing proprietary or sensitive information. Whil...
Laks V. S. Lakshmanan, Raymond T. Ng, Ganesh Rames...
SIGMOD
2005
ACM
119views Database» more  SIGMOD 2005»
16 years 6 months ago
DogmatiX Tracks down Duplicates in XML
Duplicate detection is the problem of detecting different entries in a data source representing the same real-world entity. While research abounds in the realm of duplicate detect...
Melanie Weis, Felix Naumann
TPDS
1998
112views more  TPDS 1998»
15 years 6 months ago
Parallel Computation in Biological Sequence Analysis
—A massive volume of biological sequence data is available in over 36 different databases worldwide, including the sequence data generated by the Human Genome project. These data...
Tieng K. Yap, Ophir Frieder, Robert L. Martino
BMCBI
2011
15 years 1 months ago
Pathway-based analysis using reduced gene subsets in genome-wide association studies
Background: Single Nucleotide Polymorphism (SNP) analysis only captures a small proportion of associated genetic variants in Genome-Wide Association Studies (GWAS) partly due to s...
Jingyuan Zhao, Simone Gupta, Mark Seielstad, Jianj...
ESANN
2006
15 years 8 months ago
Stochastic Processes for Canonical Correlation Analysis
We consider two stochastic process methods for performing canonical correlation analysis (CCA). The first uses a Gaussian Process formulation of regression in which we use the cur...
Colin Fyfe, Gayle Leen