Sciweavers

8651 search results - page 1434 / 1731
» Intelligent agents as innovations
Sort
View
AAAI
2006
15 years 8 months ago
Reinforcement Learning with Human Teachers: Evidence of Feedback and Guidance with Implications for Learning Performance
As robots become a mass consumer product, they will need to learn new skills by interacting with typical human users. Past approaches have adapted reinforcement learning (RL) to a...
Andrea Lockerd Thomaz, Cynthia Breazeal
AAAI
2006
15 years 8 months ago
From Pigeons to Humans: Grounding Relational Learning in Concrete Examples
We present a cognitive model that bridges work in analogy and category learning. The model, Building Relations through Instance Driven Gradient Error Shifting (BRIDGES), extends A...
Marc T. Tomlinson, Bradley C. Love
AAAI
2006
15 years 8 months ago
Compact, Convex Upper Bound Iteration for Approximate POMDP Planning
Partially observable Markov decision processes (POMDPs) are an intuitive and general way to model sequential decision making problems under uncertainty. Unfortunately, even approx...
Tao Wang, Pascal Poupart, Michael H. Bowling, Dale...
AAAI
2006
15 years 8 months ago
Evaluating Preference-based Search Tools: A Tale of Two Approaches
People frequently use the world-wide web to find their most preferred item among a large range of options. We call this task preference-based search. The most common tool for pref...
Paolo Viappiani, Boi Faltings, Pearl Pu
AAAI
2006
15 years 8 months ago
Mixtures of Predictive Linear Gaussian Models for Nonlinear, Stochastic Dynamical Systems
The Predictive Linear Gaussian model (or PLG) improves upon traditional linear dynamical system models by using a predictive representation of state, which makes consistent parame...
David Wingate, Satinder P. Singh
« Prev « First page 1434 / 1731 Last » Next »