Sciweavers

4446 search results - page 476 / 890
» Learning Observer Agents
Sort
View
NIPS
2007
15 years 8 months ago
The Tradeoffs of Large Scale Learning
This contribution develops a theoretical framework that takes into account the effect of approximate optimization on learning algorithms. The analysis shows distinct tradeoffs for...
Léon Bottou, Olivier Bousquet
ALT
2006
Springer
16 years 3 months ago
General Discounting Versus Average Reward
Consider an agent interacting with an environment in cycles. In every interaction cycle the agent is rewarded for its performance. We compare the average reward U from cycle 1 to ...
Marcus Hutter
IAT
2009
IEEE
16 years 1 months ago
Automated Web Site Evaluation - An Approach Based on Ranking SVM
This paper proposes an automated web site evaluation approach using machine learning to cope with ranking problems. Evaluating web sites is a significant task for web service beca...
Peng Li, Seiji Yamada
ECAL
2007
Springer
16 years 1 months ago
Grounding Action-Selection in Event-Based Anticipation
Anticipation is one of the key aspects involved in flexible and adaptive behavior. The ability for an autonomous agent to extract a relevant model of its coupling with the environ...
Philippe Capdepuy, Daniel Polani, Chrystopher L. N...
IAT
2006
IEEE
16 years 26 days ago
Resolution-Based Policy Search for Imperfect Information Differential Games
Differential games (DGs), considered as a typical model of game with continuous states and non-linear dynamics, play an important role in control and optimization. Finding optimal...
Minh Nguyen-Duc, Brahim Chaib-draa