Sciweavers

1445 search results - page 185 / 289
» Properties of state spaces and their applications
Sort
View
AAAI
2007
15 years 8 months ago
Thresholded Rewards: Acting Optimally in Timed, Zero-Sum Games
In timed, zero-sum games, the goal is to maximize the probability of winning, which is not necessarily the same as maximizing our expected reward. We consider cumulative intermedi...
Colin McMillen, Manuela M. Veloso
ATAL
2010
Springer
15 years 7 months ago
Augmenting appearance-based localization and navigation using belief update
Appearance-based localization compares the current image taken from a robot's camera to a set of pre-recorded images in order to estimate the current location of the robot. S...
George Chrysanthakopoulos, Guy Shani
CORR
2011
Springer
210views Education» more  CORR 2011»
15 years 1 months ago
Online Learning of Rested and Restless Bandits
In this paper we study the online learning problem involving rested and restless multiarmed bandits with multiple plays. The system consists of a single player/user and a set of K...
Cem Tekin, Mingyan Liu
PKDD
2009
Springer
152views Data Mining» more  PKDD 2009»
16 years 27 days ago
Feature Selection for Value Function Approximation Using Bayesian Model Selection
Abstract. Feature selection in reinforcement learning (RL), i.e. choosing basis functions such that useful approximations of the unkown value function can be obtained, is one of th...
Tobias Jung, Peter Stone
CDC
2008
IEEE
143views Control Systems» more  CDC 2008»
16 years 25 days ago
A nonlinear, control-oriented model for ionic polymer-metal composite actuators
Ionic polymer-metal composites (IPMCs) form an important category of electroactive polymers and have many potential applications in biomedical, robotic and micro/nano manipulation ...
Zheng Chen, Dawn R. Hedgepeth, Xiaobo Tan