Search Sciweavers | Sciweavers

4894 search results - page 224 / 979

» The Guarding Problem - Complexity and Approximation

199

click to vote

AAAI
2006

161views Intelligent Agents» more AAAI 2006»

Sample-Efficient Evolutionary Function Approximation for Reinforcement Learning

15 years 8 months ago

Download staff.science.uva.nl

Reinforcement learning problems are commonly tackled with temporal difference methods, which attempt to estimate the agent's optimal value function. In most real-world proble...

Shimon Whiteson, Peter Stone

claim paper

Read More »

189

click to vote

ATAL
2010
Springer

128views Intelligent Agents» more ATAL 2010»

Approximate dynamic programming with affine ADDs

15 years 1 months ago

Download eprints.pascal-network.org

The Affine ADD (AADD) is an extension of the Algebraic Decision Diagram (ADD) that compactly represents context-specific, additive and multiplicative structure in functions from a...

Scott Sanner, William T. B. Uther, Karina Valdivia...

claim paper

Read More »

164

click to vote

WADS
2005
Springer

132views Algorithms» more WADS 2005»

Communication-Aware Processor Allocation for Supercomputers

16 years 18 hour ago

Download www.cs.sunysb.edu

Abstract. We give processor-allocation algorithms for grid architectures, where the objective is to select processors from a set of available processors to minimize the average num...

Michael A. Bender, David P. Bunde, Erik D. Demaine...

claim paper

Read More »

184

click to vote

CDC
2010
IEEE

160views Control Systems» more CDC 2010»

Adaptive bases for Q-learning

15 years 1 months ago

Download webee.technion.ac.il

Abstract-- We consider reinforcement learning, and in particular, the Q-learning algorithm in large state and action spaces. In order to cope with the size of the spaces, a functio...

Dotan Di Castro, Shie Mannor

claim paper

Read More »

168

click to vote

SIAMNUM
2010

103views more SIAMNUM 2010»

Hybridization and Postprocessing Techniques for Mixed Eigenfunctions

15 years 1 months ago

Download www.rpi.edu

Abstract. We introduce hybridization and postprocessing techniques for the RaviartThomas approximation of second-order elliptic eigenvalue problems. Hybridization reduces the Ravia...

Bernardo Cockburn, Jayadeep Gopalakrishnan, F. Li,...

claim paper

Read More »

« Prev « First page 224 / 979 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers