— Results of queries by personal names often contain documents related to several people because of the namesake problem. In order to differentiate documents related to different...
Open source software is often considered to be secure because large developer communities can be leveraged to find and fix security vulnerabilities. Eric Raymond states Linus’ L...
Abstract: Software clustering is an established approach to automatic architecture recovery. It groups components that are in some way similar to each other. Usually, the similarit...
The Signature Quadratic Form Distance is an adaptive similarity measure for flexible content-based feature representations of multimedia data. In this paper, we present a deep su...
We present a nonparametric Bayesian model for multi-task learning, with a focus on feature selection in binary classification. The model jointly identifies groups of similar tas...