Search Sciweavers | Sciweavers

2095 search results - page 139 / 419

» Improved pebbling bounds

145

click to vote

COLT
2008
Springer

96views Machine Learning» more COLT 2008»

The True Sample Complexity of Active Learning

15 years 8 months ago

Download www.cs.cmu.edu

We describe and explore a new perspective on the sample complexity of active learning. In many situations where it was generally believed that active learning does not help, we sh...

Maria-Florina Balcan, Steve Hanneke, Jennifer Wort...

claim paper

Read More »

153

click to vote

AAAI
2006

121views Intelligent Agents» more AAAI 2006»

Focused Real-Time Dynamic Programming for MDPs: Squeezing More Out of a Heuristic

15 years 7 months ago

Download www.cs.cmu.edu

Real-time dynamic programming (RTDP) is a heuristic search algorithm for solving MDPs. We present a modified algorithm called Focused RTDP with several improvements. While RTDP ma...

Trey Smith, Reid G. Simmons

claim paper

Read More »

170

click to vote

AIPS
2004

102views Artificial Intelligence» more AIPS 2004»

Incremental Maximum Flows for Fast Envelope Computation

15 years 7 months ago

Download www.aaai.org

Resource envelopes provide the tightest exact bounds on the resource consumption and production caused by all possible executions of a temporally flexible plan. We present a new c...

Nicola Muscettola

claim paper

Read More »

181

click to vote

AAAI
1996

149views Intelligent Agents» more AAAI 1996»

Forward Estimation for Game-Tree Search

15 years 7 months ago

Download www.aaai.org

It is known that bounds on the minimax values of nodes in a game tree can be used to reduce the computational complexity of minimax search for two-player games. We describe a very...

Weixiong Zhang

claim paper

Read More »

113

click to vote

ICML
2010
IEEE

167views Machine Learning» more ICML 2010»

Internal Rewards Mitigate Agent Boundedness

15 years 7 months ago

Download www-personal.umich.edu

Abstract--Reinforcement learning (RL) research typically develops algorithms for helping an RL agent best achieve its goals-however they came to be defined--while ignoring the rela...

Jonathan Sorg, Satinder P. Singh, Richard Lewis

claim paper

Read More »

« Prev « First page 139 / 419 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers