Sciweavers

2900 search results - page 260 / 580
» On the Convergence of Immune Algorithms
Sort
View
NIPS
2008
15 years 8 months ago
Regularized Policy Iteration
In this paper we consider approximate policy-iteration-based reinforcement learning algorithms. In order to implement a flexible function approximation scheme we propose the use o...
Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csab...
AAAI
2006
15 years 8 months ago
Focused Real-Time Dynamic Programming for MDPs: Squeezing More Out of a Heuristic
Real-time dynamic programming (RTDP) is a heuristic search algorithm for solving MDPs. We present a modified algorithm called Focused RTDP with several improvements. While RTDP ma...
Trey Smith, Reid G. Simmons
147
Voted
JMLR
2006
115views more  JMLR 2006»
15 years 6 months ago
Structured Prediction, Dual Extragradient and Bregman Projections
We present a simple and scalable algorithm for maximum-margin estimation of structured output models, including an important class of Markov networks and combinatorial models. We ...
Benjamin Taskar, Simon Lacoste-Julien, Michael I. ...
TEC
2002
120views more  TEC 2002»
15 years 6 months ago
Optimization based on bacterial chemotaxis
We present an optimization algorithm based on a model of bacterial chemotaxis. The original biological model is used to formulate a simple optimization algorithm, which is evaluate...
Sibylle D. Müller, Jarno Marchetto, Stefano A...
ICASSP
2009
IEEE
15 years 4 months ago
A performance-weighted mixture of LMS filters
In this paper, we explore the use of a particular multistage adaptation algorithm for a variety of adaptive filtering applications where the structure of the underlying process to...
Suleyman Serdar Kozat, Andrew C. Singer