Search Sciweavers | Sciweavers

233 search results - page 23 / 47

» Composing and combining policies under the policy machine

156

click to vote

ICML
2007
IEEE

159views Machine Learning» more ICML 2007»

A novel orthogonal NMF-based belief compression for POMDPs

16 years 7 months ago

Download www.machinelearning.org

High dimensionality of POMDP's belief state space is one major cause that makes the underlying optimal policy computation intractable. Belief compression refers to the method...

Xin Li, William Kwok-Wai Cheung, Jiming Liu, Zhili...

claim paper

Read More »

161

click to vote

HPDC
2010
IEEE

143views Distributed And Parallel Com...» more HPDC 2010»

Cluster-wide context switch of virtualized jobs

15 years 7 months ago

Download hal.inria.fr

Clusters are mostly used through Resources Management Systems (RMS) with a static allocation of resources for a bounded amount of time. Those approaches are known to be insufficie...

Fabien Hermenier, Adrien Lebre, Jean-Marc Menaud

claim paper

Read More »

177

click to vote

GECCO
2000
Springer

143views Optimization» more GECCO 2000»

A Genetic Algorithm for Automatically Designing Modular Reinforcement Learning Agents

15 years 9 months ago

Download www.cs.bham.ac.uk

Reinforcement learning (RL) is one of the machine learning techniques and has been received much attention as a new self-adaptive controller for various systems. The RL agent auto...

Isao Ono, Tetsuo Nijo, Norihiko Ono

claim paper

Read More »

205

click to vote

IPPS
2010
IEEE

153views Distributed And Parallel Com...» more IPPS 2010»

Structuring the execution of OpenMP applications for multicore architectures

15 years 4 months ago

Download hal.archives-ouvertes.fr

Abstract--The now commonplace multi-core chips have introduced, by design, a deep hierarchy of memory and cache banks within parallel computers as a tradeoff between the user frien...

François Broquedis, Olivier Aumage, Brice G...

claim paper

Read More »

158

click to vote

ICML
1998
IEEE

268views Machine Learning» more ICML 1998»

The MAXQ Method for Hierarchical Reinforcement Learning

16 years 6 months ago

Download www.cs.ualberta.ca

This paper presents a new approach to hierarchical reinforcement learning based on the MAXQ decomposition of the value function. The MAXQ decomposition has both a procedural seman...

Thomas G. Dietterich

claim paper

Read More »

« Prev « First page 23 / 47 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers