Search Sciweavers | Sciweavers

233 search results - page 15 / 47

» Composing and combining policies under the policy machine

172

click to vote

ICML
2000
IEEE

165views Machine Learning» more ICML 2000»

A Bayesian Framework for Reinforcement Learning

15 years 10 months ago

Download www.ece.uvic.ca

The reinforcement learning problem can be decomposed into two parallel types of inference: (i) estimating the parameters of a model for the underlying process; (ii) determining be...

Malcolm J. A. Strens

claim paper

Read More »

198

click to vote

SACMAT
2009
ACM

122views Control Systems» more SACMAT 2009»

Dynamic mandatory access control for multiple stakeholders

16 years 18 days ago

Download www.cse.psu.edu

In this paper, we present a mandatory access control system that uses input from multiple stakeholders to compose policies based on runtime information. In the emerging ubiquitous...

Vikhyath Rao, Trent Jaeger

claim paper

Read More »

152

click to vote

ECAI
2010
Springer

227views Artificial Intelligence» more ECAI 2010»

On Finding Compromise Solutions in Multiobjective Markov Decision Processes

15 years 7 months ago

Download www-desir.lip6.fr

A Markov Decision Process (MDP) is a general model for solving planning problems under uncertainty. It has been extended to multiobjective MDP to address multicriteria or multiagen...

Patrice Perny, Paul Weng

claim paper

Read More »

141

click to vote

CORR
2008
Springer

77views Education» more CORR 2008»

Energy-efficient Scheduling of Delay Constrained Traffic over Fading Channels

15 years 6 months ago

Download www.ece.umn.edu

Abstract--A delay-constrained scheduling problem for pointto-point communication is considered: a packet of B bits must be transmitted by a hard deadline of T slots over a timevary...

Juyul Lee, Nihar Jindal

claim paper

Read More »

148

click to vote

ICML
2000
IEEE

126views Machine Learning» more ICML 2000»

Reinforcement Learning in POMDP's via Direct Gradient Ascent

16 years 6 months ago

Download reference.kfupm.edu.sa

This paper discusses theoretical and experimental aspects of gradient-based approaches to the direct optimization of policy performance in controlled ??? ?s. We introduce ??? ?, a...

Jonathan Baxter, Peter L. Bartlett

claim paper

Read More »

« Prev « First page 15 / 47 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers