Search Sciweavers | Sciweavers

3491 search results - page 270 / 699

» Cascaded Markov Models

215

click to vote

NMR
2004
Springer

216views Automated Reasoning» more NMR 2004»

Probabilistic reasoning in dynamic multiagent systems

16 years 3 days ago

Download events.pims.math.ca

Probabilistic reasoning with multiply sectioned Bayesian networks (MSBNs) has been successfully applied in static domains under the cooperative multiagent paradigm. Probabilistic ...

Xiangdong An, Yang Xiang, Nick Cercone

claim paper

Read More »

212

click to vote

SIGECOM
2003
ACM

134views ECommerce» more SIGECOM 2003»

16 years 5 hour ago

Correlated equilibria in graphical games

Download www.cis.upenn.edu

We examine correlated equilibria in the recently introduced formalism of graphical games, a succinct representation for multiplayer games. We establish a natural and powerful rela...

Sham Kakade, Michael J. Kearns, John Langford, Lui...

claim paper

Read More »

170

click to vote

COLT
1994
Springer

111views Machine Learning» more COLT 1994»

Learning Probabilistic Automata with Variable Memory Length

15 years 11 months ago

Download www.cs.huji.ac.il

We propose and analyze a distribution learning algorithm for variable memory length Markov processes. These processes can be described by a subclass of probabilistic nite automata...

Dana Ron, Yoram Singer, Naftali Tishby

claim paper

Read More »

166

click to vote

UAI
2000

133views Artificial Intelligence» more UAI 2000»

PEGASUS: A policy search method for large MDPs and POMDPs

15 years 8 months ago

Download ai.stanford.edu

We propose a new approach to the problem of searching a space of policies for a Markov decision process (MDP) or a partially observable Markov decision process (POMDP), given a mo...

Andrew Y. Ng, Michael I. Jordan

claim paper

Read More »

181

click to vote

CORR
2010
Springer

105views Education» more CORR 2010»

Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence

15 years 5 months ago

Download hal.archives-ouvertes.fr

We consider model-based reinforcement learning in ﬁnite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...

Sarah Filippi, Olivier Cappé, Aurelien Gari...

claim paper

Read More »

« Prev « First page 270 / 699 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers