Sciweavers

1974 search results - page 215 / 395
» Online learning in online auctions
Sort
View
ICML
2007
IEEE
16 years 7 months ago
Reinforcement learning by reward-weighted regression for operational space control
Many robot control problems of practical importance, including operational space control, can be reformulated as immediate reward reinforcement learning problems. However, few of ...
Jan Peters, Stefan Schaal
ICML
2009
IEEE
16 years 7 months ago
Proto-predictive representation of states with simple recurrent temporal-difference networks
We propose a new neural network architecture, called Simple Recurrent Temporal-Difference Networks (SR-TDNs), that learns to predict future observations in partially observable en...
Takaki Makino
ACE
2004
188views Education» more  ACE 2004»
15 years 8 months ago
A Computing Education Vision for the Sight Impaired
Vision is the main sensory modality employed in learning. Teaching materials in the areas of information technology and computer engineering are highly visual in nature and vision...
Iain Murray, Helen Armstrong
ML
2002
ACM
100views Machine Learning» more  ML 2002»
15 years 6 months ago
Structure in the Space of Value Functions
Solving in an efficient manner many different optimal control tasks within the same underlying environment requires decomposing the environment into its computationally elemental ...
David J. Foster, Peter Dayan
ICCV
2007
IEEE
16 years 8 months ago
Linear Predictors for Fast Simultaneous Modeling and Tracking
An approach for fast tracking of arbitrary image features with no prior model and no offline learning stage is presented. Fast tracking is achieved using banks of linear displacem...
Liam Ellis, Nicholas Dowson, Jiri Matas, Richard B...