Search Sciweavers | Sciweavers

4446 search results - page 324 / 890

» Learning Observer Agents

181

click to vote

ISDA
2009
IEEE

144views Operating System» more ISDA 2009»

Postponed Updates for Temporal-Difference Reinforcement Learning

16 years 1 months ago

Download www.science.uva.nl

This paper presents postponed updates, a new strategy for TD methods that can improve sample efﬁciency without incurring the computational and space requirements of model-based ...

Harm van Seijen, Shimon Whiteson

claim paper

Read More »

152

click to vote

ICRA
2003
IEEE

116views Robotics» more ICRA 2003»

Learning to role-switch in multi-robot systems

15 years 12 months ago

Download www.cc.gatech.edu

We present an approach that uses Q-learning on individual robotic agents, for coordinating a missiontasked team of robots in a complex scenario. To reduce the size of the state sp...

Eric Martinson, Ronald C. Arkin

claim paper

Read More »

161

click to vote

AAAI
2007

123views Intelligent Agents» more AAAI 2007»

Learning and Inference for Hierarchically Split PCFGs

15 years 9 months ago

Download www.petrovi.de

Treebank parsing can be seen as the search for an optimally reﬁned grammar consistent with a coarse training treebank. We describe a method in which a minimal grammar is hierarc...

Slav Petrov, Dan Klein

claim paper

Read More »

157

click to vote

AAAI
2006

142views Intelligent Agents» more AAAI 2006»

Learning Basis Functions in Hybrid Domains

15 years 8 months ago

Download www.aaai.org

Markov decision processes (MDPs) with discrete and continuous state and action components can be solved efficiently by hybrid approximate linear programming (HALP). The main idea ...

Branislav Kveton, Milos Hauskrecht

claim paper

Read More »

175

click to vote

AAAI
2006

153views Intelligent Agents» more AAAI 2006»

kFOIL: Learning Simple Relational Kernels

15 years 8 months ago

Download www.aaai.org

A novel and simple combination of inductive logic programming with kernel methods is presented. The kFOIL algorithm integrates the well-known inductive logic programming system FO...

Niels Landwehr, Andrea Passerini, Luc De Raedt, Pa...

claim paper

Read More »

« Prev « First page 324 / 890 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers