Search Sciweavers | Sciweavers

213

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

15 years 1 months ago

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

218

click to vote

JOLLI
2010

171views more JOLLI 2010»

A Dynamic Logic of Agency I: STIT, Capabilities and Powers

15 years 1 months ago

Download www.springerlink.com

The aim of this paper, is to provide a logical framework for reasoning about actions, agency, and powers of agents and coalitions in game-like multi-agent systems. First we define ...

Andreas Herzig, Emiliano Lorini

claim paper

Read More »

171

click to vote

JSAC
2010

116views more JSAC 2010»

Utility-based asynchronous flow control algorithm for wireless sensor networks

15 years 1 months ago

Download bbcr.uwaterloo.ca

Abstract--In this paper, we formulate a flow control optimization problem for wireless sensor networks with lifetime constraint and link interference in an asynchronous setting. Ou...

Jiming Chen, WeiQiang Xu, Shibo He, Youxian Sun, P...

claim paper

Read More »

192

click to vote

JSCIC
2010

145views more JSCIC 2010»

Interior Penalty Continuous and Discontinuous Finite Element Approximations of Hyperbolic Equations

15 years 1 months ago

Download www.maths.sussex.ac.uk

In this paper we present the continuous and discontinuous Galerkin methods in a unified setting for the numerical approximation of the transport dominated advection-reaction equati...

Erik Burman, Alfio Quarteroni, Benjamin Stamm

claim paper

Read More »

175

click to vote

JSCIC
2010

57views more JSCIC 2010»

A Proof of the Stability of the Spectral Difference Method for All Orders of Accuracy

15 years 1 months ago

Download aero-comlab.stanford.edu

While second order methods for computational simulations of fluid flow provide the basis of widely used commercial software, there is a need for higher order methods for more accur...

Antony Jameson

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers