— In this paper we address the reliability of policies derived by Reinforcement Learning on a limited amount of observations. This can be done in a principled manner by taking in...
—The ontology learning from text cycle consists of the consecutive phases of term, synonym, concept, taxonomy and relation extraction. In this paper, a proposal towards the unsup...
Witold Abramowicz, Maria Vargas-Vera, Marek Wisnie...
The method of paired comparison based on Thurstone's Case V of his Law of Comparative Judgments is often used as a psychophysical method to derive interval scales of perceptua...
Model based learning systems usually face to a problem of forgetting as a result of the incremental learning of new instances. Normally, the systems have to re-learn past instances...
Actor-critic algorithms for reinforcement learning are achieving renewed popularity due to their good convergence properties in situations where other approaches often fail (e.g.,...