Sciweavers

3274 search results - page 357 / 655
» Using Learning in a Control Agent
Sort
View
ATAL
2005
Springer
16 years 6 days ago
Towards a functional ontology of reputation
This paper proposes a functional ontology of reputation for agents. The goal of this ontology is twofold. First, to put together the broad knowledge about reputation produced in s...
Sara J. Casare, Jaime Simão Sichman
WOA
2000
15 years 8 months ago
An Agent-based Paradigm for Allocating Multi-Provider Service Demands
The increasing number of competitors and the growing traffic demand are the main factors pushing for a more dynamic and flexible service demand allocation mechanism. Human interac...
Monique Calisti, Boi Faltings
ATAL
2010
Springer
15 years 7 months ago
Merging example plans into generalized plans for non-deterministic environments
We present a new approach for finding generalized contingent plans with loops and branches in situations where there is uncertainty in state properties and object quantities, but ...
Siddharth Srivastava, Neil Immerman, Shlomo Zilber...
ICML
2005
IEEE
16 years 7 months ago
Dynamic preferences in multi-criteria reinforcement learning
The current framework of reinforcement learning is based on maximizing the expected returns based on scalar rewards. But in many real world situations, tradeoffs must be made amon...
Sriraam Natarajan, Prasad Tadepalli
IDEAL
2003
Springer
15 years 12 months ago
On Hadamard-Type Output Coding in Multiclass Learning
The error-correcting output coding (ECOC) method reduces the multiclass learning problem into a series of binary classifiers. In this paper, we consider the dense ECOC methods, co...
Aijun Zhang, Zhi-Li Wu, Chun Hung Li, Kai-Tai Fang