Given an adequate simulation model of the task environment and payoff function that measures the quality of partially successful plans, competition-based heuristics such as geneti...
In recent years, there are many educational systems which incorporate the agents such as pedagogical agents and peer agents, as a means to realize the teaching, coaching and suppo...
This paper introduces a generic theoretical framework for predictive learning, and relates it to data-driven and learning applications in earth and environmental sciences. The iss...
Vladimir Cherkassky, Vladimir M. Krasnopolsky, Dim...
Abstract--Since the fuzzy cerebellar model articulation controller (FCMAC) uses linguistic variables, it is highly intuitive and easily comprehended. Despite the FCMAC's good ...
Wen Yu, Floriberto Ortiz Rodriguez, Marco A. Moren...
Abstract-- Policy Gradients with Parameter-based Exploration (PGPE) is a novel model-free reinforcement learning method that alleviates the problem of high-variance gradient estima...
Frank Sehnke, Alex Graves, Christian Osendorfer, J...