Sciweavers

2327 search results - page 80 / 466
» Consistency of functional learning methods based on derivati...
Sort
View
IJCNN
2008
IEEE
16 years 21 days ago
Uncertainty propagation for quality assurance in Reinforcement Learning
— In this paper we address the reliability of policies derived by Reinforcement Learning on a limited amount of observations. This can be done in a principled manner by taking in...
Daniel Schneegaß, Steffen Udluft, Thomas Mar...
DEXAW
2008
IEEE
185views Database» more  DEXAW 2008»
16 years 23 days ago
Axiom-Based Feedback Cycle for Relation Extraction in Ontology Learning from Text
—The ontology learning from text cycle consists of the consecutive phases of term, synonym, concept, taxonomy and relation extraction. In this paper, a proposal towards the unsup...
Witold Abramowicz, Maria Vargas-Vera, Marek Wisnie...
JEI
2006
110views more  JEI 2006»
15 years 6 months ago
Empirical formula for creating error bars for the method of paired comparison
The method of paired comparison based on Thurstone's Case V of his Law of Comparative Judgments is often used as a psychophysical method to derive interval scales of perceptua...
Ethan D. Montag
ICANN
2003
Springer
15 years 11 months ago
Meta-learning for Fast Incremental Learning
Model based learning systems usually face to a problem of forgetting as a result of the incremental learning of new instances. Normally, the systems have to re-learn past instances...
Takayuki Oohira, Koichiro Yamauchi, Takashi Omori
NIPS
2008
15 years 7 months ago
Temporal Difference Based Actor Critic Learning - Convergence and Neural Implementation
Actor-critic algorithms for reinforcement learning are achieving renewed popularity due to their good convergence properties in situations where other approaches often fail (e.g.,...
Dotan Di Castro, Dmitry Volkinshtein, Ron Meir