Sciweavers

4446 search results - page 105 / 890
» Learning Observer Agents
Sort
View
AAAI
2006
15 years 7 months ago
RL-CD: Dealing with Non-Stationarity in Reinforcement Learning
Bruno Castro da Silva, Eduardo W. Basso, Ana L. C....
ICAART
2010
INSTICC
15 years 7 months ago
Dialogue-based Management of user Feedback in an Autonomous Preference Learning System
Juan Manuel Lucas-Cuesta, Javier Ferreiros, Asier ...
AIIDE
2008
15 years 8 months ago
Agent Learning using Action-Dependent Learning Rates in Computer Role-Playing Games
We introduce the ALeRT (Action-dependent Learning Rates with Trends) algorithm that makes two modifications to the learning rate and one change to the exploration rate of traditio...
Maria Cutumisu, Duane Szafron, Michael H. Bowling,...