Search Sciweavers | Sciweavers

2914 search results - page 336 / 583

» Optimization by Stochastic Continuation

175

click to vote

WSC
2008

214views Modeling And Simulation» more WSC 2008»

Approximate dynamic programming: Lessons from the field

15 years 9 months ago

Download www.informs-sim.org

Approximate dynamic programming is emerging as a powerful tool for certain classes of multistage stochastic, dynamic problems that arise in operations research. It has been applie...

Warren B. Powell

claim paper

Read More »

158

click to vote

COLT
2008
Springer

140views Machine Learning» more COLT 2008»

Regret Bounds for Sleeping Experts and Bandits

15 years 8 months ago

Download colt2008.cs.helsinki.fi

We study on-line decision problems where the set of actions that are available to the decision algorithm vary over time. With a few notable exceptions, such problems remained larg...

Robert D. Kleinberg, Alexandru Niculescu-Mizil, Yo...

claim paper

Read More »

235

click to vote

FPGA
2008
ACM

191views FPGA» more FPGA 2008»

A hardware framework for the fast generation of multiple long-period random number streams

15 years 8 months ago

Download cuaa.cooper.edu

Stochastic simulations and other scientific applications that depend on random numbers are increasingly implemented in a parallelized manner in programmable logic. High-quality ps...

Ishaan L. Dalal, Deian Stefan

claim paper

Read More »

162

click to vote

AAAI
2010

171views Intelligent Agents» more AAAI 2010»

Reinforcement Learning via AIXI Approximation

15 years 8 months ago

Download jveness.info

This paper introduces a principled approach for the design of a scalable general reinforcement learning agent. This approach is based on a direct approximation of AIXI, a Bayesian...

Joel Veness, Kee Siong Ng, Marcus Hutter, David Si...

claim paper

Read More »

184

click to vote

DAGSTUHL
2007

107views Software Engineering» more DAGSTUHL 2007»

Learning Probabilistic Relational Dynamics for Multiple Tasks

15 years 8 months ago

Download people.csail.mit.edu

The ways in which an agent’s actions affect the world can often be modeled compactly using a set of relational probabilistic planning rules. This paper addresses the problem of ...

Ashwin Deshpande, Brian Milch, Luke S. Zettlemoyer...

claim paper

Read More »

« Prev « First page 336 / 583 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers