Search Sciweavers | Sciweavers

8468 search results - page 1428 / 1694

» Intelligent agents in e-services

182

click to vote

AAAI
2006

136views Intelligent Agents» more AAAI 2006»

Learning Partially Observable Action Schemas

15 years 8 months ago

Download reason.cs.uiuc.edu

We present an algorithm that derives actions' effects and preconditions in partially observable, relational domains. Our algorithm has two unique features: an expressive rela...

Dafna Shahaf, Eyal Amir

claim paper

Read More »

165

click to vote

AAAI
2006

108views Intelligent Agents» more AAAI 2006»

Using Homomorphisms to Transfer Options across Continuous Reinforcement Learning Domains

15 years 8 months ago

Download www.eecs.umich.edu

We examine the problem of Transfer in Reinforcement Learning and present a method to utilize knowledge acquired in one Markov Decision Process (MDP) to bootstrap learning in a mor...

Vishal Soni, Satinder P. Singh

claim paper

Read More »

146

click to vote

AAAI
2006

105views Intelligent Agents» more AAAI 2006»

An Asymptotically Optimal Algorithm for the Max k-Armed Bandit Problem

15 years 8 months ago

Download www.aaai.org

We present an asymptotically optimal algorithm for the max variant of the k-armed bandit problem. Given a set of k slot machines, each yielding payoff from a fixed (but unknown) d...

Matthew J. Streeter, Stephen F. Smith

claim paper

Read More »

201

click to vote

AAAI
2006

127views Intelligent Agents» more AAAI 2006»

Reinforcement Learning with Human Teachers: Evidence of Feedback and Guidance with Implications for Learning Performance

15 years 8 months ago

Download robotic.media.mit.edu

As robots become a mass consumer product, they will need to learn new skills by interacting with typical human users. Past approaches have adapted reinforcement learning (RL) to a...

Andrea Lockerd Thomaz, Cynthia Breazeal

claim paper

Read More »

156

click to vote

AAAI
2006

112views Intelligent Agents» more AAAI 2006»

From Pigeons to Humans: Grounding Relational Learning in Concrete Examples

15 years 8 months ago

Download www.aaai.org

We present a cognitive model that bridges work in analogy and category learning. The model, Building Relations through Instance Driven Gradient Error Shifting (BRIDGES), extends A...

Marc T. Tomlinson, Bradley C. Love

claim paper

Read More »

« Prev « First page 1428 / 1694 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers