Sciweavers

2023 search results - page 220 / 405
» Human Agents and Intelligent Agents: An Experiment on the In...
Sort
View
IAT
2008
IEEE
15 years 6 months ago
Scaling Up Multi-agent Reinforcement Learning in Complex Domains
TD-FALCON (Temporal Difference - Fusion Architecture for Learning, COgnition, and Navigation) is a class of self-organizing neural networks that incorporates Temporal Difference (...
Dan Xiao, Ah-Hwee Tan
AAMAS
2010
Springer
15 years 6 months ago
What the 2007 TAC Market Design Game tells us about effective auction mechanisms
This paper analyzes the entrants to the 2007 TAC Market Design Game. We present a classification of the entries to the competition, and use this classification to compare these ent...
Jinzhong Niu, Kai Cai, Simon Parsons, Peter McBurn...
MP
2011
15 years 1 months ago
A first-order interior-point method for linearly constrained smooth optimization
Abstract: We propose a first-order interior-point method for linearly constrained smooth optimization that unifies and extends first-order affine-scaling method and replicator d...
Paul Tseng, Immanuel M. Bomze, Werner Schachinger
AAAI
2011
14 years 6 months ago
Value Function Approximation in Reinforcement Learning Using the Fourier Basis
We describe the Fourier Basis, a linear value function approximation scheme based on the Fourier Series. We empirically evaluate its properties, and demonstrate that it performs w...
George Konidaris, Sarah Osentoski, Philip Thomas
AAAI
2011
14 years 6 months ago
Controlling Selection Bias in Causal Inference
Selection bias, caused by preferential exclusion of samples from the data, is a major obstacle to valid causal and statistical inferences; it cannot be removed by randomized exper...
Elias Bareinboim, Judea Pearl