Search Sciweavers | Sciweavers

4446 search results - page 452 / 890

» Learning Observer Agents

201

click to vote

EMNLP
2006

190views Natural Language Processing» more EMNLP 2006»

Competitive generative models with structure learning for NLP classification tasks

15 years 8 months ago

Download nlp.stanford.edu

In this paper we show that generative models are competitive with and sometimes superior to discriminative models, when both kinds of models are allowed to learn structures that a...

Kristina Toutanova

claim paper

Read More »

157

click to vote

UAI
2001

129views Artificial Intelligence» more UAI 2001»

The Optimal Reward Baseline for Gradient-Based Reinforcement Learning

15 years 8 months ago

Download cs.anu.edu.au

There exist a number of reinforcement learning algorithms which learn by climbing the gradient of expected reward. Their long-run convergence has been proved, even in partially ob...

Lex Weaver, Nigel Tao

claim paper

Read More »

174

click to vote

IJCAI
2003

103views Artificial Intelligence» more IJCAI 2003»

Qualitative Map Learning Based on Co-visibility of Objects

15 years 8 months ago

Download ijcai.org

This paper proposes a unique map learning method for mobile robots based on the co-visibility infor mation of objects i.e., the information on whether two objects are visible at...

Takehisa Yairi, Koichi Hori

claim paper

Read More »

182

click to vote

JUCS
2008

104views more JUCS 2008»

Optimal Transit Price Negotiation: The Distributed Learning Perspective

15 years 6 months ago

Download www.jucs.org

: We present a distributed learning algorithm for optimizing transit prices in the inter-domain routing framework. We present a combined game theoretical and distributed algorithmi...

Dominique Barth, Loubna Echabbi, Chahinez Hamlaoui

claim paper

Read More »

176

click to vote

ML
2002
ACM

121views Machine Learning» more ML 2002»

Near-Optimal Reinforcement Learning in Polynomial Time

15 years 6 months ago

Download www.cis.upenn.edu

We present new algorithms for reinforcement learning, and prove that they have polynomial bounds on the resources required to achieve near-optimal return in general Markov decisio...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

« Prev « First page 452 / 890 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers