Search Sciweavers | Sciweavers

2422 search results - page 311 / 485

» Security Policy Consistency

174

click to vote

CDC
2009
IEEE

132views Control Systems» more CDC 2009»

Q-learning and Pontryagin's Minimum Principle

15 years 11 months ago

Download www.stanford.edu

Abstract— Q-learning is a technique used to compute an optimal policy for a controlled Markov chain based on observations of the system controlled using a non-optimal policy. It ...

Prashant G. Mehta, Sean P. Meyn

claim paper

Read More »

192

click to vote

GECCO
2000
Springer

143views Optimization» more GECCO 2000»

A Genetic Algorithm for Automatically Designing Modular Reinforcement Learning Agents

15 years 10 months ago

Download www.cs.bham.ac.uk

Reinforcement learning (RL) is one of the machine learning techniques and has been received much attention as a new self-adaptive controller for various systems. The RL agent auto...

Isao Ono, Tetsuo Nijo, Norihiko Ono

claim paper

Read More »

161

click to vote

AAAI
2010

154views Intelligent Agents» more AAAI 2010»

Towards Multiagent Meta-level Control

15 years 8 months ago

Download coitweb.uncc.edu

Embedded systems consisting of collaborating agents capable of interacting with their environment are becoming ubiquitous. It is crucial for these systems to be able to adapt to t...

Shanjun Cheng, Anita Raja, Victor R. Lesser

claim paper

Read More »

191

click to vote

NIPS
2007

158views Information Technology» more NIPS 2007»

Reinforcement Learning in Continuous Action Spaces through Sequential Monte Carlo Methods

15 years 8 months ago

Download books.nips.cc

Learning in real-world domains often requires to deal with continuous state and action spaces. Although many solutions have been proposed to apply Reinforcement Learning algorithm...

Alessandro Lazaric, Marcello Restelli, Andrea Bona...

claim paper

Read More »

178

click to vote

ECAI
2010
Springer

219views Artificial Intelligence» more ECAI 2010»

Knowledge Compilation Using Interval Automata and Applications to Planning

15 years 6 months ago

Download www.cert.fr

Knowledge compilation [6, 5, 14, 8] consists in transforming a problem offline into a form which is tractable online. In this paper, we introduce new structures, based on the notio...

Alexandre Niveau, Hélène Fargier, C&...

claim paper

Read More »

« Prev « First page 311 / 485 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers