Search Sciweavers | Sciweavers

2263 search results - page 178 / 453

» On learning in agent-centered search

211

click to vote

ATAL
2005
Springer

181views Intelligent Agents» more ATAL 2005»

Improving reinforcement learning function approximators via neuroevolution

16 years 1 days ago

Download www.aaai.org

Reinforcement learning problems are commonly tackled with temporal difference methods, which use dynamic programming and statistical sampling to estimate the long-term value of ta...

Shimon Whiteson

claim paper

Read More »

157

click to vote

ICML
2009
IEEE

131views Machine Learning» more ICML 2009»

Monte-Carlo simulation balancing

16 years 7 months ago

Download www.cs.ualberta.ca

In this paper we introduce the first algorithms for efficiently learning a simulation policy for Monte-Carlo search. Our main idea is to optimise the balance of a simulation polic...

David Silver, Gerald Tesauro

claim paper

Read More »

180

click to vote

ATAL
2009
Springer

167views Intelligent Agents» more ATAL 2009»

Solving multiagent assignment Markov decision processes

16 years 1 months ago

Download www.aamas-conference.org

We consider the setting of multiple collaborative agents trying to complete a set of tasks as assigned by a centralized controller. We propose a scalable method called“Assignmen...

Scott Proper, Prasad Tadepalli

claim paper

Read More »

170

click to vote

ICCS
1993
Springer

99views Applied Computing» more ICCS 1993»

Towards Domain-Independent Machine Intelligence

15 years 10 months ago

Download www.soe.ucsc.edu

Adaptive predictive search (APS), is a learning system framework, which given little initial domain knowledge, increases its decision-making abilities in complex problems domains....

Robert Levinson

claim paper

Read More »

190

click to vote

ENC
2004
IEEE

160views Theoretical Computer Science» more ENC 2004»

A Method Based on Genetic Algorithms and Fuzzy Logic to Induce Bayesian Networks

15 years 10 months ago

Download www.uv.mx

A method to induce bayesian networks from data to overcome some limitations of other learning algorithms is proposed. One of the main features of this method is a metric to evalua...

Manuel Martínez-Morales, Ramiro Garza-Dom&i...

claim paper

Read More »

« Prev « First page 178 / 453 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers