Search Sciweavers | Sciweavers

2566 search results - page 273 / 514

» Relating reinforcement learning performance to classificatio...

219

click to vote

DCOSS
2008
Springer

193views Distributed And Parallel Com...» more DCOSS 2008»

Efficient Node Discovery in Mobile Wireless Sensor Networks

15 years 8 months ago

Download www.cl.cam.ac.uk

Energy is one of the most crucial aspects in real deployments of mobile sensor networks. As a result of scarce resources, the duration of most real deployments can be limited to ju...

Vladimir Dyo, Cecilia Mascolo

claim paper

Read More »

165

click to vote

UAI
2008

187views Artificial Intelligence» more UAI 2008»

Knowledge Combination in Graphical Multiagent Models

15 years 8 months ago

Download uai2008.cs.helsinki.fi

A graphical multiagent model (GMM) represents a joint distribution over the behavior of a set of agents. One source of knowledge aboutagents'behaviormaycomefromgametheoretic ...

Quang Duong, Michael P. Wellman, Satinder P. Singh

claim paper

Read More »

188

click to vote

NN
2010
Springer

125views Neural Networks» more NN 2010»

Parameter-exploring policy gradients

15 years 5 months ago

Download www.kyb.mpg.de

We present a model-free reinforcement learning method for partially observable Markov decision problems. Our method estimates a likelihood gradient by sampling directly in paramet...

Frank Sehnke, Christian Osendorfer, Thomas Rü...

claim paper

Read More »

182

click to vote

AGI
2011

286views Artificial Intelligence» more AGI 2011»

Comparing Humans and AI Agents

14 years 10 months ago

Download users.dsic.upv.es

Comparing humans and machines is one important source of information about both machine and human strengths and limitations. Most of these comparisons and competitions are performe...

Javier Insa-Cabrera, David L. Dowe, Sergio Espa&nt...

claim paper

Read More »

272

click to vote

BDA
2007

171views Knowledge Management» more BDA 2007»

Hyperplane Queries in a Feature-Space M-tree for Speeding up Active Learning

15 years 8 months ago

Download perso.lcpc.fr

In content-based retrieval, relevance feedback (RF) is a noticeable method for reducing the “semantic gap” between the low-level features describing the content and the usually...

Michel Crucianu, Daniel Estevez, Vincent Oria, Jea...

posted by jptarel

Read More »

« Prev « First page 273 / 514 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers