Search Sciweavers | Sciweavers

3874 search results - page 517 / 775

» Approximation Algorithms for k-hurdle Problems

200

click to vote

AGENTS
2001
Springer

201views Security Privacy» more AGENTS 2001»

Using background knowledge to speed reinforcement learning in physical agents

15 years 11 months ago

Download www.isle.org

This paper describes Icarus, an agent architecture that embeds a hierarchical reinforcement learning algorithm within a language for specifying agent behavior. An Icarus program e...

Daniel G. Shapiro, Pat Langley, Ross D. Shachter

claim paper

Read More »

169

click to vote

AAAI
2007

126views Intelligent Agents» more AAAI 2007»

Point-Based Policy Iteration

15 years 9 months ago

Download www.cs.duke.edu

We describe a point-based policy iteration (PBPI) algorithm for inﬁnite-horizon POMDPs. PBPI replaces the exact policy improvement step of Hansen’s policy iteration with point...

Shihao Ji, Ronald Parr, Hui Li, Xuejun Liao, Lawre...

claim paper

Read More »

155

click to vote

COMPGEOM
2005
ACM

123views Discrete Geometry» more COMPGEOM 2005»

1-link shortest paths in weighted regions

15 years 8 months ago

Download www.tiger-marmalade.com

We illustrate the Link Solver software for computing 1-link shortest paths in weighted regions. The Link Solver implements a prune-and-search method that can be used to approximat...

Ovidiu Daescu, James D. Palmer

claim paper

Read More »

165

click to vote

AAAI
2010

171views Intelligent Agents» more AAAI 2010»

Multi-Agent Learning with Policy Prediction

15 years 8 months ago

Download www.cs.umass.edu

Due to the non-stationary environment, learning in multi-agent systems is a challenging problem. This paper first introduces a new gradient-based learning algorithm, augmenting th...

Chongjie Zhang, Victor R. Lesser

claim paper

Read More »

163

click to vote

DAGSTUHL
2007

152views Software Engineering» more DAGSTUHL 2007»

An Inner/Outer Stationary Iteration for Computing PageRank

15 years 8 months ago

Download drops.dagstuhl.de

We present a stationary iterative scheme for PageRank computation. The algorithm is based on a linear system formulation of the problem, uses inner/outer iterations, and amounts to...

Andrew P. Gray, Chen Greif, Tracy Lau

claim paper

Read More »

« Prev « First page 517 / 775 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers