Search Sciweavers | Sciweavers

1829 search results - page 175 / 366

» On Learning Soccer Strategies

162

click to vote

ILP
2003
Springer

116views Automated Reasoning» more ILP 2003»

Disjunctive Learning with a Soft-Clustering Method

15 years 11 months ago

Download www.univ-orleans.fr

In the case of concept learning from positive and negative examples, it is rarely possible to ﬁnd a unique discriminating conjunctive rule; in most cases, a disjunctive descripti...

Guillaume Cleuziou, Lionel Martin, Christel Vrain

claim paper

Read More »

152

click to vote

ATAL
2007
Springer

81views Intelligent Agents» more ATAL 2007»

Multiagent learning in adaptive dynamic systems

16 years 17 days ago

Download www.damas.ift.ulaval.ca

Classically, an approach to the multiagent policy learning supposed that the agents, via interactions and/or by using preliminary knowledge about the reward functions of all playe...

Andriy Burkov, Brahim Chaib-draa

claim paper

Read More »

173

click to vote

ICML
2002
IEEE

156views Machine Learning» more ICML 2002»

Algorithm-Directed Exploration for Model-Based Reinforcement Learning in Factored MDPs

16 years 7 months ago

Download select.cs.cmu.edu

One of the central challenges in reinforcement learning is to balance the exploration/exploitation tradeoff while scaling up to large problems. Although model-based reinforcement ...

Carlos Guestrin, Relu Patrascu, Dale Schuurmans

claim paper

Read More »

228

click to vote

SDM
2012
SIAM

252views Data Mining» more SDM 2012»

Learning from Heterogeneous Sources via Gradient Boosting Consensus

13 years 8 months ago

Download david.grangier.info

Multiple data sources containing diﬀerent types of features may be available for a given task. For instance, users’ proﬁles can be used to build recommendation systems. In a...

Xiaoxiao Shi, Jean-François Paiement, David...

claim paper

Read More »

183

click to vote

AAMAS
2002
Springer

157views Intelligent Agents» more AAMAS 2002»

Adapting Populations of Agents

15 years 6 months ago

Download www.macs.hw.ac.uk

We control a population of interacting software agents. The agents have a strategy, and receive a payoff for executing that strategy. Unsuccessful agents become extinct. We investi...

Philippe De Wilde, Maria Chli, Luís Correia...

claim paper

Read More »

« Prev « First page 175 / 366 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers