Search Sciweavers | Sciweavers

9119 search results - page 358 / 1824

» A Simultaneous Search Problem

151

Voted

AAAI
2007

142views Intelligent Agents» more AAAI 2007»

Temporal Difference and Policy Search Methods for Reinforcement Learning: An Empirical Comparison

15 years 9 months ago

Download staff.science.uva.nl

Reinforcement learning (RL) methods have become popular in recent years because of their ability to solve complex tasks with minimal feedback. Both genetic algorithms (GAs) and te...

Matthew E. Taylor, Shimon Whiteson, Peter Stone

claim paper

Read More »

193

click to vote

ECAI
2006
Springer

117views Artificial Intelligence» more ECAI 2006»

Calibrating Probability Density Forecasts with Multi-Objective Search

15 years 8 months ago

Download www.scss.tcd.ie

Abstract. In this paper, we show that the optimization of density forecasting models for regression in machine learning can be formulated as a multi-objective problem. We describe ...

Michael Carney, Padraig Cunningham

claim paper

Read More »

153

click to vote

CP
2008
Springer

68views Artificial Intelligence» more CP 2008»

Guiding Search in QCSP+ with Back-Propagation

15 years 8 months ago

Download www.lirmm.fr

The Quantified Constraint Satisfaction Problem (QCSP) has been introduced to express situations in which we are not able to control the value of some of the variables (the universa...

Guillaume Verger, Christian Bessiere

claim paper

Read More »

162

click to vote

ICMLA
2007

94views Machine Learning» more ICMLA 2007»

Phase transition and heuristic search in relational learning

15 years 8 months ago

Download www-lipn.univ-paris13.fr

Several works have shown that the covering test in relational learning exhibits a phase transition in its covering probability. It is argued that this phase transition dooms every...

Érick Alphonse, Aomar Osmani

claim paper

Read More »

178

click to vote

IJCAI
2003

169views Artificial Intelligence» more IJCAI 2003»

Covariant Policy Search

15 years 8 months ago

Download www.ri.cmu.edu

We investigate the problem of non-covariant behavior of policy gradient reinforcement learning algorithms. The policy gradient approach is amenable to analysis by information geom...

J. Andrew Bagnell, Jeff G. Schneider

claim paper

Read More »

« Prev « First page 358 / 1824 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers