Search Sciweavers | Sciweavers

1912 search results - page 343 / 383

» Optimizing interconnection policies

184

click to vote

AAAI
2006

161views Intelligent Agents» more AAAI 2006»

Sample-Efficient Evolutionary Function Approximation for Reinforcement Learning

15 years 7 months ago

Download staff.science.uva.nl

Reinforcement learning problems are commonly tackled with temporal difference methods, which attempt to estimate the agent's optimal value function. In most real-world proble...

Shimon Whiteson, Peter Stone

claim paper

Read More »

148

click to vote

IC
2004

106views Applied Computing» more IC 2004»

Improved Selective Acknowledgment Scheme for TCP

15 years 7 months ago

Download www.mcs.anl.gov

A selective acknowledgment (SACK) mechanism, combined with a selective repeat retransmission policy, has been proposed to overcome the limitations with the cumulative acknowledgme...

Rajkumar Kettimuthu, William E. Allcock

claim paper

Read More »

164

click to vote

NIPS
2001

153views Information Technology» more NIPS 2001»

Estimating Car Insurance Premia: a Case Study in High-Dimensional Data Inference

15 years 7 months ago

Download www.iro.umontreal.ca

Estimating insurance premia from data is a difficult regression problem for several reasons: the large number of variables, many of which are discrete, and the very peculiar shape...

Nicolas Chapados, Yoshua Bengio, Pascal Vincent, J...

claim paper

Read More »

191

click to vote

AAAI
1996

119views Intelligent Agents» more AAAI 1996»

Rewarding Behaviors

15 years 7 months ago

Download www.cs.toronto.edu

Markov decision processes (MDPs) are a very popular tool for decision theoretic planning (DTP), partly because of the welldeveloped, expressive theory that includes effective solu...

Fahiem Bacchus, Craig Boutilier, Adam J. Grove

claim paper

Read More »

142

click to vote

NIPS
1996

117views Information Technology» more NIPS 1996»

Reinforcement Learning for Mixed Open-loop and Closed-loop Control

15 years 7 months ago

Download anytime.cs.umass.edu

Closed-loop control relies on sensory feedback that is usually assumed to be free. But if sensing incurs a cost, it may be coste ective to take sequences of actions in open-loop m...

Eric A. Hansen, Andrew G. Barto, Shlomo Zilberstei...

claim paper

Read More »

« Prev « First page 343 / 383 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers