Search Sciweavers | Sciweavers

2900 search results - page 223 / 580

» On the Convergence of Immune Algorithms

183

click to vote

ATAL
2008
Springer

92views Intelligent Agents» more ATAL 2008»

Stochastic search methods for nash equilibrium approximation in simulation-based games

15 years 8 months ago

Download www.seas.upenn.edu

We define the class of games called simulation-based games, in which the payoffs are available as an output of an oracle (simulator), rather than specified analytically or using a...

Yevgeniy Vorobeychik, Michael P. Wellman

claim paper

Read More »

188

click to vote

SIAMJO
2008

114views more SIAMJO 2008»

An Inexact SQP Method for Equality Constrained Optimization

15 years 6 months ago

Download users.eecs.northwestern.edu

We present an algorithm for large-scale equality constrained optimization. The method is based on a characterization of inexact sequential quadratic programming (SQP) steps that ca...

Richard H. Byrd, Frank E. Curtis, Jorge Nocedal

claim paper

Read More »

243

click to vote

AI
2002
Springer

171views Artificial Intelligence» more AI 2002»

Multiagent learning using a variable learning rate

15 years 6 months ago

Download www.cs.cmu.edu

Learning to act in a multiagent environment is a difficult problem since the normal definition of an optimal policy no longer applies. The optimal policy at any moment depends on ...

Michael H. Bowling, Manuela M. Veloso

claim paper

Read More »

238

click to vote

Publication

372views

Multi-Objects Interpretation

16 years 7 months ago

Download perso.lcpc.fr

We describe a general-purpose method for the accurate and robust interpretation of a data set of p-dimensional points by several deformable prototypes. This method is based on the ...

Jean-Philippe Tarel

posted by jptarel

Read More »

194

click to vote

ICML
2010
IEEE

222views Machine Learning» more ICML 2010»

Temporal Difference Bayesian Model Averaging: A Bayesian Perspective on Adapting Lambda

15 years 4 months ago

Download www.icml2010.org

Temporal difference (TD) algorithms are attractive for reinforcement learning due to their ease-of-implementation and use of "bootstrapped" return estimates to make effi...

Carlton Downey, Scott Sanner

claim paper

Read More »

« Prev « First page 223 / 580 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers