Sciweavers

400 search results - page 47 / 80
» Sentiment Analysis and the Use of Extrinsic Datasets in Eval...
Sort
View
ESEM
2007
ACM
15 years 10 months ago
The Effects of Over and Under Sampling on Fault-prone Module Detection
The goal of this paper is to improve the prediction performance of fault-prone module prediction models (fault-proneness models) by employing over/under sampling methods, which ar...
Yasutaka Kamei, Akito Monden, Shinsuke Matsumoto, ...
I3
2007
15 years 7 months ago
Performing Object Consolidation on the Semantic Web Data Graph
An important aspect of Semantic Web technologies is the issue of identity and uniquely identifying resources, which is essential for integrating data across sources. Currently, th...
Aidan Hogan, Andreas Harth, Stefan Decker
ICDM
2009
IEEE
147views Data Mining» more  ICDM 2009»
15 years 3 months ago
Greedy Optimization for Contiguity-Constrained Hierarchical Clustering
The discovery and construction of inherent regions in large spatial datasets is an important task for many research domains such as climate zoning, eco-region analysis, public heal...
Diansheng Guo
145
Voted
IDA
2009
Springer
16 years 20 days ago
Bayesian Robust PCA for Incomplete Data
Abstract. We present a probabilistic model for robust principal component analysis (PCA) in which the observation noise is modelled by Student-t distributions that are independent ...
Jaakko Luttinen, Alexander Ilin, Juha Karhunen
BIODATAMINING
2008
96views more  BIODATAMINING 2008»
15 years 6 months ago
Fast approximate hierarchical clustering using similarity heuristics
Background: Agglomerative hierarchical clustering (AHC) is a common unsupervised data analysis technique used in several biological applications. Standard AHC methods require that...
Meelis Kull, Jaak Vilo