Search Sciweavers | Sciweavers

5333 search results - page 820 / 1067

» Optimizing Computer System Configurations

185

click to vote

GECCO
2010
Springer

153views Optimization» more GECCO 2010»

Multi-task evolutionary shaping without pre-specified representations

15 years 10 months ago

Download www.science.uva.nl

Shaping functions can be used in multi-task reinforcement learning (RL) to incorporate knowledge from previously experienced tasks to speed up learning on a new task. So far, rese...

Matthijs Snel, Shimon Whiteson

claim paper

Read More »

147

click to vote

AIIDE
2008

143views Artificial Intelligence» more AIIDE 2008»

A Cover-Based Approach to Multi-Agent Moving Target Pursuit

15 years 9 months ago

Download www.aaai.org

We explore the task of designing an efficient multi-agent system that is capable of capturing a single moving target, assuming that every agent knows the location of all agents on...

Alejandro Isaza, Jieshan Lu, Vadim Bulitko, Russel...

claim paper

Read More »

205

click to vote

ATAL
2006
Springer

177views Intelligent Agents» more ATAL 2006»

Evaluating bidding strategies for simultaneous auctions

15 years 8 months ago

Download euler.mcs.utulsa.edu

Bidding for multiple items or bundles on online auctions raises challenging problems. We assume that an agent has a valuation function that returns its valuation for an arbitrary ...

Teddy Candale, Sandip Sen

claim paper

Read More »

162

click to vote

DAGSTUHL
2008

109views Software Engineering» more DAGSTUHL 2008»

Uniprocessor EDF Feasibility is an Integer Problem

15 years 8 months ago

Download pico.sssup.it

The research on real-time scheduling has mostly focused on the development of algorithms that allows to test whether the constraints imposed on the task execution (often expressed ...

Enrico Bini

claim paper

Read More »

155

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 8 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

« Prev « First page 820 / 1067 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers