Search Sciweavers | Sciweavers

829 search results - page 97 / 166

» A time aggregation approach to Markov decision processes

147

click to vote

DATE
2002
IEEE

86views Hardware» more DATE 2002»

A Layered, Codesign Virtual Machine Approach to Modeling Computer Systems

15 years 11 months ago

Download www.date-conference.com

By using a macro/micro state model we show how assumptions on the resolution of logical and physical timing of computation in computer systems has resulted in design methodologies...

JoAnn M. Paul, Donald E. Thomas

claim paper

Read More »

167

click to vote

ICRA
2007
IEEE

155views Robotics» more ICRA 2007»

Value Function Approximation on Non-Linear Manifolds for Robot Motor Control

16 years 15 days ago

Download sugiyama-www.cs.titech.ac.jp

— The least squares approach works efﬁciently in value function approximation, given appropriate basis functions. Because of its smoothness, the Gaussian kernel is a popular an...

Masashi Sugiyama, Hirotaka Hachiya, Christopher To...

claim paper

Read More »

152

click to vote

AIPS
2008

151views Artificial Intelligence» more AIPS 2008»

Criticality Metrics for Distributed Plan and Schedule Management

15 years 8 months ago

Download www.aaai.org

We address the problem of coordinating the plans and schedules for a team of agents in an uncertain and dynamic environment. Bounded rationality, bounded communication, subjectivi...

Rajiv T. Maheswaran, Pedro A. Szekely

claim paper

Read More »

184

click to vote

IJCAI
2007

254views Artificial Intelligence» more IJCAI 2007»

Bayesian Inverse Reinforcement Learning

15 years 7 months ago

Download www.ijcai.org

Inverse Reinforcement Learning (IRL) is the problem of learning the reward function underlying a Markov Decision Process given the dynamics of the system and the behaviour of an e...

Deepak Ramachandran, Eyal Amir

claim paper

Read More »

152

click to vote

IJCAI
2003

130views Artificial Intelligence» more IJCAI 2003»

Multiple-Goal Reinforcement Learning with Modular Sarsa(0)

15 years 7 months ago

Download www.cc.gatech.edu

We present a new algorithm, GM-Sarsa(0), for ﬁnding approximate solutions to multiple-goal reinforcement learning problems that are modeled as composite Markov decision processe...

Nathan Sprague, Dana H. Ballard

claim paper

Read More »

« Prev « First page 97 / 166 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers