Search Sciweavers | Sciweavers

4035 search results - page 388 / 807

» Useless Actions Are Useful

129

click to vote

IJMMSC
2010

67views more IJMMSC 2010»

Universal Verma Modules and the Misra-Miwa Fock Space

15 years 1 months ago

Download www-math.mit.edu

The Misra-Miwa v-deformed Fock space is a representation of the quantized affine algebra Uv(bsl ). It has a standard basis indexed by partitions and the non-zero matrix entries of ...

Arun Ram, Peter Tingley

claim paper

Read More »

165

Voted

AAAI
2011

128views Intelligent Agents» more AAAI 2011»

Termination and Correctness Analysis of Cyclic Control

14 years 6 months ago

Download rbr.cs.umass.edu

The utility of including cyclic ﬂow of control in plans has been long recognized by the planning community. Loops in a plan increase both its applicability and the compactness o...

Siddharth Srivastava, Neil Immerman, Shlomo Zilber...

claim paper

Read More »

171

click to vote

ATAL
2009
Springer

137views Intelligent Agents» more ATAL 2009»

Generalized model learning for reinforcement learning in factored domains

16 years 1 months ago

Download userweb.cs.utexas.edu

Improving the sample eﬃciency of reinforcement learning algorithms to scale up to larger and more realistic domains is a current research challenge in machine learning. Model-ba...

Todd Hester, Peter Stone

claim paper

Read More »

220

click to vote

ATAL
2005
Springer

181views Intelligent Agents» more ATAL 2005»

Improving reinforcement learning function approximators via neuroevolution

16 years 11 days ago

Download www.aaai.org

Reinforcement learning problems are commonly tackled with temporal difference methods, which use dynamic programming and statistical sampling to estimate the long-term value of ta...

Shimon Whiteson

claim paper

Read More »

199

click to vote

HICSS
2003
IEEE

207views Biometrics» more HICSS 2003»

Formalizing Multi-Agent POMDP's in the context of network routing

16 years 3 days ago

Download www.hicss.hawaii.edu

This paper uses partially observable Markov decision processes (POMDP’s) as a basic framework for MultiAgent planning. We distinguish three perspectives: ﬁrst one is that of a...

Bharaneedharan Rathnasabapathy, Piotr J. Gmytrasie...

claim paper

Read More »

« Prev « First page 388 / 807 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers