Sciweavers

4035 search results - page 388 / 807
» Useless Actions Are Useful
Sort
View
IJMMSC
2010
67views more  IJMMSC 2010»
15 years 1 months ago
Universal Verma Modules and the Misra-Miwa Fock Space
The Misra-Miwa v-deformed Fock space is a representation of the quantized affine algebra Uv(bsl ). It has a standard basis indexed by partitions and the non-zero matrix entries of ...
Arun Ram, Peter Tingley
165
Voted
AAAI
2011
14 years 6 months ago
Termination and Correctness Analysis of Cyclic Control
The utility of including cyclic flow of control in plans has been long recognized by the planning community. Loops in a plan increase both its applicability and the compactness o...
Siddharth Srivastava, Neil Immerman, Shlomo Zilber...
ATAL
2009
Springer
16 years 1 months ago
Generalized model learning for reinforcement learning in factored domains
Improving the sample efficiency of reinforcement learning algorithms to scale up to larger and more realistic domains is a current research challenge in machine learning. Model-ba...
Todd Hester, Peter Stone
ATAL
2005
Springer
16 years 11 days ago
Improving reinforcement learning function approximators via neuroevolution
Reinforcement learning problems are commonly tackled with temporal difference methods, which use dynamic programming and statistical sampling to estimate the long-term value of ta...
Shimon Whiteson
HICSS
2003
IEEE
207views Biometrics» more  HICSS 2003»
16 years 3 days ago
Formalizing Multi-Agent POMDP's in the context of network routing
This paper uses partially observable Markov decision processes (POMDP’s) as a basic framework for MultiAgent planning. We distinguish three perspectives: first one is that of a...
Bharaneedharan Rathnasabapathy, Piotr J. Gmytrasie...