Search Sciweavers | Sciweavers

6279 search results - page 338 / 1256

» Studies in Solution Sampling

179

click to vote

IJCAI
2001

163views Artificial Intelligence» more IJCAI 2001»

Exploiting Multiple Secondary Reinforcers in Policy Gradient Reinforcement Learning

15 years 8 months ago

Download www.cs.colorado.edu

Most formulations of Reinforcement Learning depend on a single reinforcement reward value to guide the search for the optimal policy solution. If observation of this reward is rar...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

174

click to vote

AI
2008
Springer

126views Artificial Intelligence» more AI 2008»

Sequential Monte Carlo in reachability heuristics for probabilistic planning

15 years 6 months ago

Download rakaposhi.eas.asu.edu

The current best conformant probabilistic planners encode the problem as a bounded length CSP or SAT problem. While these approaches can find optimal solutions for given plan leng...

Daniel Bryce, Subbarao Kambhampati, David E. Smith

claim paper

Read More »

160

click to vote

CJ
2008

97views more CJ 2008»

Three Kinds of Probabilistic Induction: Universal Distributions and Convergence Theorems

15 years 6 months ago

Download world.std.com

We will describe three kinds of probabilistic induction problems, and give general solutions for each , with associated convergence theorems that show they tend to give good proba...

Ray J. Solomonoff

claim paper

Read More »

168

click to vote

MP
2006

107views more MP 2006»

Convergence theory for nonconvex stochastic programming with an application to mixed logit

15 years 6 months ago

Download www.fundp.ac.be

Monte Carlo methods have been used extensively in the area of stochastic programming. As with other methods that involve a level of uncertainty, theoretical properties are required...

Fabian Bastin, Cinzia Cirillo, Philippe L. Toint

claim paper

Read More »

165

click to vote

PAMI
2006

164views more PAMI 2006»

A Binary Linear Programming Formulation of the Graph Edit Distance

15 years 6 months ago

Download www.eecs.umich.edu

A binary linear programming formulation of the graph edit distance for unweighted, undirected graphs with vertex attributes is derived and applied to a graph recognition problem. ...

Derek Justice, Alfred O. Hero

claim paper

Read More »

« Prev « First page 338 / 1256 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers