Sciweavers

4035 search results - page 302 / 807
» Useless Actions Are Useful
Sort
View
ICML
2010
IEEE
15 years 7 months ago
Convergence of Least Squares Temporal Difference Methods Under General Conditions
We consider approximate policy evaluation for finite state and action Markov decision processes (MDP) in the off-policy learning context and with the simulation-based least square...
Huizhen Yu
AIPS
2010
15 years 6 months ago
Coming Up With Good Excuses: What to do When no Plan Can be Found
When using a planner-based agent architecture, many things can go wrong. First and foremost, an agent might fail to execute one of the planned actions for some reasons. Even more ...
Moritz Göbelbecker, Thomas Keller, Patrick Ey...
165
Voted
CORR
2010
Springer
106views Education» more  CORR 2010»
15 years 6 months ago
MDPs with Unawareness
Markov decision processes (MDPs) are widely used for modeling decision-making problems in robotics, automated control, and economics. Traditional MDPs assume that the decision mak...
Joseph Y. Halpern, Nan Rong, Ashutosh Saxena
189
Voted
ENTCS
2008
110views more  ENTCS 2008»
15 years 6 months ago
Modelling Devices for Natural Interaction
We do not interact with systems without first performing some physical action on a physical device. This paper shows how formal notations and formal models can be developed to acc...
Alan J. Dix, Masitah Ghazali, Devina Ramduny-Ellis
ENGL
2006
135views more  ENGL 2006»
15 years 6 months ago
An Integral Plus States Adaptive Neural Control of Aerobic Continuous Stirred Tank Reactor
A direct adaptive neural network control system with and without integral action term is designed for the general class of continuous biological fermentation processes. The control...
Ieroham S. Baruch, Petia Georgieva, Josefina Barre...