Search Sciweavers | Sciweavers

1974 search results - page 263 / 395

» Online learning in online auctions

166

click to vote

NIPS
1993

86views Information Technology» more NIPS 1993»

Robust Reinforcement Learning in Motion Planning

15 years 7 months ago

Download www.cs.cmu.edu

While exploring to nd better solutions, an agent performing online reinforcement learning (RL) can perform worse than is acceptable. In some cases, exploration might have unsafe, ...

Satinder P. Singh, Andrew G. Barto, Roderic A. Gru...

claim paper

Read More »

167

click to vote

AIIDE
2009

221views Artificial Intelligence» more AIIDE 2009»

Learning Character Behaviors Using Agent Modeling in Games

15 years 7 months ago

Download webdocs.cs.ualberta.ca

Our goal is to provide learning mechanisms to game agents so they are capable of adapting to new behaviors based on the actions of other agents. We introduce a new on-line reinfor...

Richard Zhao, Duane Szafron

claim paper

Read More »

175

click to vote

ICML
2010
IEEE

304views Machine Learning» more ICML 2010»

FAB-MAP: Appearance-Based Place Recognition and Mapping using a Learned Visual Vocabulary Model

15 years 6 months ago

Download www.icml2010.org

We present an overview of FAB-MAP, an algorithm for place recognition and mapping developed for infrastructure-free mobile robot navigation in large environments. The system allow...

Mark Joseph Cummins, Paul M. Newman

claim paper

Read More »

150

click to vote

ML
2000
ACM

126views Machine Learning» more ML 2000»

Learning to Play Chess Using Temporal Differences

15 years 6 months ago

Download www.cs.princeton.edu

In this paper we present TDLEAF( ), a variation on the TD( ) algorithm that enables it to be used in conjunction with game-tree search. We present some experiments in which our che...

Jonathan Baxter, Andrew Tridgell, Lex Weaver

claim paper

Read More »

156

click to vote

NN
1998
Springer

87views Neural Networks» more NN 1998»

Distributed ARTMAP: a neural network for fast distributed supervised learning

15 years 6 months ago

Download techlab.bu.edu

Distributed coding at the hidden layer of a multi-layer perceptron (MLP) endows the network with memory compression and noise tolerance capabilities. However, an MLP typically req...

Gail A. Carpenter, Boriana L. Milenova, Benjamin W...

claim paper

Read More »

« Prev « First page 263 / 395 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers