Sciweavers

4446 search results - page 431 / 890
» Learning Observer Agents
Sort
View
192
Voted
PKDD
2010
Springer
183views Data Mining» more  PKDD 2010»
15 years 5 months ago
Fast Active Exploration for Link-Based Preference Learning Using Gaussian Processes
Abstract. In preference learning, the algorithm observes pairwise relative judgments (preference) between items as training data for learning an ordering of all items. This is an i...
Zhao Xu, Kristian Kersting, Thorsten Joachims
ATAL
2008
Springer
15 years 8 months ago
Modeling how humans reason about others with partial information
Computer agents participate in many collaborative and competitive multiagent domains in which humans make decisions. For computer agents to interact successfully with people in su...
Sevan G. Ficici, Avi Pfeffer
149
Voted
ICML
2003
IEEE
16 years 7 months ago
BL-WoLF: A Framework For Loss-Bounded Learnability In Zero-Sum Games
We present BL-WoLF, a framework for learnability in repeated zero-sum games where the cost of learning is measured by the losses the learning agent accrues (rather than the number...
Vincent Conitzer, Tuomas Sandholm
191
Voted
SAB
2010
Springer
117views Optimization» more  SAB 2010»
15 years 5 months ago
Indirectly Encoding Neural Plasticity as a Pattern of Local Rules
Biological brains can adapt and learn from past experience. In neuroevolution, i.e. evolving artificial neural networks (ANNs), one way that agents controlled by ANNs can evolve t...
Sebastian Risi, Kenneth O. Stanley
ATAL
2009
Springer
16 years 1 months ago
Generalization risk minimization in empirical game models
Experimental analysis of agent strategies in multiagent systems presents a tradeoff between granularity and statistical confidence. Collecting a large amount of data about each s...
Patrick R. Jordan, Michael P. Wellman