Sciweavers

2566 search results - page 236 / 514
» Relating reinforcement learning performance to classificatio...
Sort
View
184
Voted
SASO
2009
IEEE
16 years 1 months ago
Distributed W-Learning: Multi-Policy Optimization in Self-Organizing Systems
—Large-scale agent-based systems are required to self-optimize towards multiple, potentially conflicting, policies of varying spatial and temporal scope. As a result, not all ag...
Ivana Dusparic, Vinny Cahill
PKDD
2009
Springer
152views Data Mining» more  PKDD 2009»
16 years 1 months ago
Feature Selection for Value Function Approximation Using Bayesian Model Selection
Abstract. Feature selection in reinforcement learning (RL), i.e. choosing basis functions such that useful approximations of the unkown value function can be obtained, is one of th...
Tobias Jung, Peter Stone
IROS
2007
IEEE
172views Robotics» more  IROS 2007»
16 years 1 months ago
Motor control optimization of compliant one-legged locomotion in rough terrain
— While underactuated robotic systems are capable of energy efficient and rapid dynamic behavior, we still do not fully understand how body dynamics can be actively used for ada...
Fumiya Iida, Russ Tedrake
NETCOOP
2007
Springer
16 years 26 days ago
Load Shared Sequential Routing in MPLS Networks: System and User Optimal Solutions
Recently Gerald Ash has shown through case studies that event dependent routing is attractive in large scale multi-service MPLS networks. In this paper, we consider the application...
Gilles Brunet, Fariba Heidari, Lorne Mason
GECCO
2005
Springer
162views Optimization» more  GECCO 2005»
16 years 7 days ago
An autonomous explore/exploit strategy
In reinforcement learning problems it has been considered that neither exploitation nor exploration can be pursued exclusively without failing at the task. The optimal balance bet...
Alex McMahon, Dan Scott, William N. L. Browne