Search Sciweavers | Sciweavers

515 search results - page 56 / 103

» Approximating Markov Processes by Averaging

225

click to vote

NIPS
2007

207views Information Technology» more NIPS 2007»

Bayes-Adaptive POMDPs

15 years 7 months ago

Download books.nips.cc

Bayesian Reinforcement Learning has generated substantial interest recently, as it provides an elegant solution to the exploration-exploitation trade-off in reinforcement learning...

Stéphane Ross, Brahim Chaib-draa, Joelle Pi...

claim paper

Read More »

174

click to vote

TSP
2008

151views more TSP 2008»

Convergence Analysis of Reweighted Sum-Product Algorithms

15 years 6 months ago

Download stat-www.berkeley.edu

Markov random fields are designed to represent structured dependencies among large collections of random variables, and are well-suited to capture the structure of real-world sign...

Tanya Roosta, Martin J. Wainwright, Shankar S. Sas...

claim paper

Read More »

167

click to vote

GLOBECOM
2007
IEEE

204views Communications» more GLOBECOM 2007»

ARMA Synthesis of Fading Channels- an Application to the Generation of Dynamic MIMO Channels

16 years 13 days ago

Download post.queensu.ca

— Adaptive transceivers play an important role in wireless communications and the design of MIMO systems. Therefore models that enable simulation of dynamic and time varying chan...

Hani Mehrpouyan, Steven D. Blostein

claim paper

Read More »

173

click to vote

ICML
2007
IEEE

139views Machine Learning» more ICML 2007»

Learning state-action basis functions for hierarchical MDPs

16 years 7 months ago

Download www.machinelearning.org

This paper introduces a new approach to actionvalue function approximation by learning basis functions from a spectral decomposition of the state-action manifold. This paper exten...

Sarah Osentoski, Sridhar Mahadevan

claim paper

Read More »

155

click to vote

ICML
2006
IEEE

103views Machine Learning» more ICML 2006»

Using inaccurate models in reinforcement learning

16 years 7 months ago

Download ai.stanford.edu

In the model-based policy search approach to reinforcement learning (RL), policies are found using a model (or "simulator") of the Markov decision process. However, for ...

Pieter Abbeel, Morgan Quigley, Andrew Y. Ng

claim paper

Read More »

« Prev « First page 56 / 103 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers