Search Sciweavers | Sciweavers

7121 search results - page 423 / 1425

» Functions as Session-Typed Processes

212

click to vote

ICML
1996
IEEE

196views Machine Learning» more ICML 1996»

A Convergent Reinforcement Learning Algorithm in the Continuous Case: The Finite-Element Reinforcement Learning

15 years 11 months ago

Download www.ri.cmu.edu

This paper presents a direct reinforcement learning algorithm, called Finite-Element Reinforcement Learning, in the continuous case, i.e. continuous state-space and time. The eval...

Rémi Munos

claim paper

Read More »

178

click to vote

RTSS
1993
IEEE

128views Control Systems» more RTSS 1993»

Object-Based Semantic Real-Time Concurrency Control

15 years 11 months ago

Download homepage.cs.uri.edu

This paper presents a technique that is capable of supporting two major requirements for concurrency control in real-time databases: data temporal consistency, and data logical co...

Lisa Cingiser DiPippo, Victor Fay Wolfe

claim paper

Read More »

305

click to vote

MICCAI
2000
Springer

239views Medical Imaging» more MICCAI 2000»

Robust 3D Segmentation of Anatomical Structures with Level Sets

15 years 10 months ago

Download www.irisa.fr

This paper is focused on the use of the level set formalism to segment anatomical structures in 3D images (ultrasound ou magnetic resonance images). A closed 3D surface propagates...

C. Baillard, Christian Barillot

claim paper

Read More »

193

click to vote

AAAI
2007

117views Intelligent Agents» more AAAI 2007»

Authorial Idioms for Target Distributions in TTD-MDPs

15 years 9 months ago

Download www.cc.gatech.edu

In designing Markov Decision Processes (MDP), one must deﬁne the world, its dynamics, a set of actions, and a reward function. MDPs are often applied in situations where there i...

David L. Roberts, Sooraj Bhat, Kenneth St. Clair, ...

claim paper

Read More »

179

click to vote

AAAI
2010

136views Intelligent Agents» more AAAI 2010»

Robust Policy Computation in Reward-Uncertain MDPs Using Nondominated Policies

15 years 8 months ago

Download www.cs.toronto.edu

The precise specification of reward functions for Markov decision processes (MDPs) is often extremely difficult, motivating research into both reward elicitation and the robust so...

Kevin Regan, Craig Boutilier

claim paper

Read More »

« Prev « First page 423 / 1425 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers