Sciweavers

4446 search results - page 452 / 890
» Learning Observer Agents
Sort
View
EMNLP
2006
15 years 8 months ago
Competitive generative models with structure learning for NLP classification tasks
In this paper we show that generative models are competitive with and sometimes superior to discriminative models, when both kinds of models are allowed to learn structures that a...
Kristina Toutanova
UAI
2001
15 years 8 months ago
The Optimal Reward Baseline for Gradient-Based Reinforcement Learning
There exist a number of reinforcement learning algorithms which learn by climbing the gradient of expected reward. Their long-run convergence has been proved, even in partially ob...
Lex Weaver, Nigel Tao
IJCAI
2003
15 years 8 months ago
Qualitative Map Learning Based on Co-visibility of Objects
This paper proposes a unique map learning method for mobile robots based on the co-visibility infor­ mation of objects i.e., the information on whether two objects are visible at...
Takehisa Yairi, Koichi Hori
JUCS
2008
104views more  JUCS 2008»
15 years 6 months ago
Optimal Transit Price Negotiation: The Distributed Learning Perspective
: We present a distributed learning algorithm for optimizing transit prices in the inter-domain routing framework. We present a combined game theoretical and distributed algorithmi...
Dominique Barth, Loubna Echabbi, Chahinez Hamlaoui
ML
2002
ACM
121views Machine Learning» more  ML 2002»
15 years 6 months ago
Near-Optimal Reinforcement Learning in Polynomial Time
We present new algorithms for reinforcement learning, and prove that they have polynomial bounds on the resources required to achieve near-optimal return in general Markov decisio...
Michael J. Kearns, Satinder P. Singh