Search Sciweavers | Sciweavers

8651 search results - page 1433 / 1731

» Intelligent agents as innovations

149

click to vote

AAAI
2006

106views Intelligent Agents» more AAAI 2006»

Contingent Planning with Goal Preferences

15 years 8 months ago

Download www.aaai.org

The importance of the problems of contingent planning with actions that have non-deterministic effects and of planning with goal preferences has been widely recognized, and severa...

Dmitry Shaparau, Marco Pistore, Paolo Traverso

claim paper

Read More »

160

click to vote

AAAI
2006

110views Intelligent Agents» more AAAI 2006»

Unsupervised Order-Preserving Regression Kernel for Sequence Analysis

15 years 8 months ago

Download www.aaai.org

In this work, a generalized method for learning from sequence of unlabelled data points based on unsupervised order-preserving regression is proposed. Sequence learning is a funda...

Young-In Shin

claim paper

Read More »

179

click to vote

AAAI
2006

136views Intelligent Agents» more AAAI 2006»

Learning Partially Observable Action Schemas

15 years 8 months ago

Download reason.cs.uiuc.edu

We present an algorithm that derives actions' effects and preconditions in partially observable, relational domains. Our algorithm has two unique features: an expressive rela...

Dafna Shahaf, Eyal Amir

claim paper

Read More »

164

click to vote

AAAI
2006

108views Intelligent Agents» more AAAI 2006»

Using Homomorphisms to Transfer Options across Continuous Reinforcement Learning Domains

15 years 8 months ago

Download www.eecs.umich.edu

We examine the problem of Transfer in Reinforcement Learning and present a method to utilize knowledge acquired in one Markov Decision Process (MDP) to bootstrap learning in a mor...

Vishal Soni, Satinder P. Singh

claim paper

Read More »

146

click to vote

AAAI
2006

105views Intelligent Agents» more AAAI 2006»

An Asymptotically Optimal Algorithm for the Max k-Armed Bandit Problem

15 years 8 months ago

Download www.aaai.org

We present an asymptotically optimal algorithm for the max variant of the k-armed bandit problem. Given a set of k slot machines, each yielding payoff from a fixed (but unknown) d...

Matthew J. Streeter, Stephen F. Smith

claim paper

Read More »

« Prev « First page 1433 / 1731 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers