Sciweavers

12194 search results - page 260 / 2439
» cans 2010
Sort
View
JMLR
2010
189views more  JMLR 2010»
15 years 1 months ago
Adaptive Step-size Policy Gradients with Average Reward Metric
In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...
Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...
JOLLI
2010
171views more  JOLLI 2010»
15 years 1 months ago
A Dynamic Logic of Agency I: STIT, Capabilities and Powers
The aim of this paper, is to provide a logical framework for reasoning about actions, agency, and powers of agents and coalitions in game-like multi-agent systems. First we define ...
Andreas Herzig, Emiliano Lorini
JSAC
2010
116views more  JSAC 2010»
15 years 1 months ago
Utility-based asynchronous flow control algorithm for wireless sensor networks
Abstract--In this paper, we formulate a flow control optimization problem for wireless sensor networks with lifetime constraint and link interference in an asynchronous setting. Ou...
Jiming Chen, WeiQiang Xu, Shibo He, Youxian Sun, P...
JSCIC
2010
145views more  JSCIC 2010»
15 years 1 months ago
Interior Penalty Continuous and Discontinuous Finite Element Approximations of Hyperbolic Equations
In this paper we present the continuous and discontinuous Galerkin methods in a unified setting for the numerical approximation of the transport dominated advection-reaction equati...
Erik Burman, Alfio Quarteroni, Benjamin Stamm
JSCIC
2010
57views more  JSCIC 2010»
15 years 1 months ago
A Proof of the Stability of the Spectral Difference Method for All Orders of Accuracy
While second order methods for computational simulations of fluid flow provide the basis of widely used commercial software, there is a need for higher order methods for more accur...
Antony Jameson