Sciweavers

4446 search results - page 478 / 890
» Learning Observer Agents
Sort
View
AAAI
2008
15 years 9 months ago
Multi-HDP: A Non Parametric Bayesian Model for Tensor Factorization
Matrix factorization algorithms are frequently used in the machine learning community to find low dimensional representations of data. We introduce a novel generative Bayesian pro...
Ian Porteous, Evgeniy Bart, Max Welling
ATAL
2006
Springer
15 years 8 months ago
Can good learners always compensate for poor learners?
Can a good learner compensate for a poor learner when paired in a coordination game? Previous work has given an example where a special learning algorithm (FMQ) is capable of doin...
Keith Sullivan, Liviu Panait, Gabriel Catalin Bala...
ALT
2010
Springer
15 years 8 months ago
Consistency of Feature Markov Processes
We are studying long term sequence prediction (forecasting). We approach this by investigating criteria for choosing a compact useful state representation. The state is supposed t...
Peter Sunehag, Marcus Hutter
AAAI
2010
15 years 8 months ago
Relative Entropy Policy Search
Policy search is a successful approach to reinforcement learning. However, policy improvements often result in the loss of information. Hence, it has been marred by premature conv...
Jan Peters, Katharina Mülling, Yasemin Altun
AAAI
2004
15 years 8 months ago
A Correspondence Metric for Imitation
Abstract-- Learning by imitation and learning from demonstration have received considerable attention in robotics. However, very little research has been in the direction of provid...
R. Amit, Maja J. Mataric