Search Sciweavers | Sciweavers

3412 search results - page 335 / 683

» Efficient Reinforcement Learning

173

click to vote

JMLR
2008

108views more JMLR 2008»

A Recursive Method for Structural Learning of Directed Acyclic Graphs

15 years 6 months ago

Download jmlr.csail.mit.edu

In this paper, we propose a recursive method for structural learning of directed acyclic graphs (DAGs), in which a problem of structural learning for a large DAG is first decompos...

Xianchao Xie, Zhi Geng

claim paper

Read More »

182

click to vote

AGENTS
2000
Springer

119views Security Privacy» more AGENTS 2000»

Adaptivity in agent-based routing for data networks

15 years 11 months ago

Download web.engr.oregonstate.edu

Adaptivity, both of the individual agents and of the interaction structure among the agents, seems indispensable for scaling up multi-agent systems MAS's in noisy environme...

David Wolpert, Sergey Kirshner, Christopher J. Mer...

claim paper

Read More »

171

click to vote

ATAL
2008
Springer

180views Intelligent Agents» more ATAL 2008»

On the usefulness of opponent modeling: the Kuhn Poker case study

15 years 8 months ago

Download www.ifaamas.org

The application of reinforcement learning algorithms to Partially Observable Stochastic Games (POSG) is challenging since each agent does not have access to the whole state inform...

Alessandro Lazaric, Mario Quaresimale, Marcello Re...

claim paper

Read More »

176

click to vote

NIPS
1993

128views Information Technology» more NIPS 1993»

Convergence of Stochastic Iterative Dynamic Programming Algorithms

15 years 8 months ago

Download www.bitsavers.org

Recent developments in the area of reinforcement learning have yielded a number of new algorithms for the prediction and control of Markovian environments. These algorithms,includ...

Tommi Jaakkola, Michael I. Jordan, Satinder P. Sin...

claim paper

Read More »

188

click to vote

PEPM
2011
ACM

210views Software Engineering» more PEPM 2011»

Adaptation-based programming in java

14 years 9 months ago

Download web.engr.oregonstate.edu

Writing deterministic programs is often difﬁcult for problems whose optimal solutions depend on unpredictable properties of the programs’ inputs. Difﬁculty is also encounter...

Tim Bauer, Martin Erwig, Alan Fern, Jervis Pinto

claim paper

Read More »

« Prev « First page 335 / 683 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers