Search Sciweavers | Sciweavers

2900 search results - page 260 / 580

» On the Convergence of Immune Algorithms

193

click to vote

NIPS
2008

165views Information Technology» more NIPS 2008»

Regularized Policy Iteration

15 years 8 months ago

Download webdocs.cs.ualberta.ca

In this paper we consider approximate policy-iteration-based reinforcement learning algorithms. In order to implement a flexible function approximation scheme we propose the use o...

Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csab...

claim paper

Read More »

156

click to vote

AAAI
2006

121views Intelligent Agents» more AAAI 2006»

Focused Real-Time Dynamic Programming for MDPs: Squeezing More Out of a Heuristic

15 years 8 months ago

Download www.cs.cmu.edu

Real-time dynamic programming (RTDP) is a heuristic search algorithm for solving MDPs. We present a modified algorithm called Focused RTDP with several improvements. While RTDP ma...

Trey Smith, Reid G. Simmons

claim paper

Read More »

147

Voted

JMLR
2006

115views more JMLR 2006»

Structured Prediction, Dual Extragradient and Bregman Projections

15 years 6 months ago

Download www.stat.berkeley.edu

We present a simple and scalable algorithm for maximum-margin estimation of structured output models, including an important class of Markov networks and combinatorial models. We ...

Benjamin Taskar, Simon Lacoste-Julien, Michael I. ...

claim paper

Read More »

167

click to vote

TEC
2002

120views more TEC 2002»

Optimization based on bacterial chemotaxis

15 years 6 months ago

Download www.cse-lab.ethz.ch

We present an optimization algorithm based on a model of bacterial chemotaxis. The original biological model is used to formulate a simple optimization algorithm, which is evaluate...

Sibylle D. Müller, Jarno Marchetto, Stefano A...

claim paper

Read More »

183

click to vote

ICASSP
2009
IEEE

128views Signal Processing» more ICASSP 2009»

A performance-weighted mixture of LMS filters

15 years 4 months ago

Download www.ifp.illinois.edu

In this paper, we explore the use of a particular multistage adaptation algorithm for a variety of adaptive filtering applications where the structure of the underlying process to...

Suleyman Serdar Kozat, Andrew C. Singer

claim paper

Read More »

« Prev « First page 260 / 580 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers