Search Sciweavers | Sciweavers

3378 search results - page 198 / 676

» Learning in Friedberg Numberings

177

click to vote

IJCAI
2007

173views Artificial Intelligence» more IJCAI 2007»

Reinforcement Learning of Local Shape in the Game of Go

15 years 8 months ago

Download webdocs.cs.ualberta.ca

We explore an application to the game of Go of a reinforcement learning approach based on a linear evaluation function and large numbers of binary features. This strategy has prov...

David Silver, Richard S. Sutton, Martin Mülle...

claim paper

Read More »

162

click to vote

IJCAI
2007

160views Artificial Intelligence» more IJCAI 2007»

Learning from Partial Observations

15 years 8 months ago

Download www.ijcai.org

We present a general machine learning framework for modelling the phenomenon of missing information in data. We propose a masking process model to capture the stochastic nature of...

Loizos Michael

claim paper

Read More »

174

click to vote

AAAI
2006

113views Intelligent Agents» more AAAI 2006»

Active Learning with Near Misses

15 years 8 months ago

Download www.aaai.org

Assume that we are trying to build a visual recognizer for a particular class of objects--chairs, for example--using existing induction methods. Assume the assistance of a human t...

Nela Gurevich, Shaul Markovitch, Ehud Rivlin

claim paper

Read More »

177

click to vote

AAAI
2006

136views Intelligent Agents» more AAAI 2006»

Learning Partially Observable Action Schemas

15 years 8 months ago

Download reason.cs.uiuc.edu

We present an algorithm that derives actions' effects and preconditions in partially observable, relational domains. Our algorithm has two unique features: an expressive rela...

Dafna Shahaf, Eyal Amir

claim paper

Read More »

177

click to vote

ATAL
2010
Springer

146views Intelligent Agents» more ATAL 2010»

PAC-MDP learning with knowledge-based admissible models

15 years 6 months ago

Download www.aamas-conference.org

PAC-MDP algorithms approach the exploration-exploitation problem of reinforcement learning agents in an effective way which guarantees that with high probability, the algorithm pe...

Marek Grzes, Daniel Kudenko

claim paper

Read More »

« Prev « First page 198 / 676 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers