We consider the policy search approach to reinforcement learning. We show that if a “baseline distribution” is given (indicating roughly how often we expect a good policy to v...
J. Andrew Bagnell, Sham Kakade, Andrew Y. Ng, Jeff...
Object detection and tracking has various application areas including intelligent transportation systems. We introduce an object detection and tracking approach that combines the ...
This paper presents an integrated image registration algorithm to correct the motion induced by patient breathing for dynamic renal perfusion MR images. Registration of kidneys th...
Reinforcement learning algorithms can become unstable when combined with linear function approximation. Algorithms that minimize the mean-square Bellman error are guaranteed to co...
— This paper presents a dynamic and distributed reconfiguration planning algorithm for chain-type selfreconfigurable robots, by which a robot can autonomously self-reconfigure fr...