Search Sciweavers | Sciweavers

1912 search results - page 294 / 383

» Optimizing interconnection policies

146

click to vote

ICML
2006
IEEE

101views Machine Learning» more ICML 2006»

Qualitative reinforcement learning

16 years 7 months ago

Download www.cs.uiuc.edu

When the transition probabilities and rewards of a Markov Decision Process are specified exactly, the problem can be solved without any interaction with the environment. When no s...

Arkady Epshteyn, Gerald DeJong

claim paper

Read More »

153

click to vote

ICML
2004
IEEE

158views Machine Learning» more ICML 2004»

Adaptive cognitive orthotics: combining reinforcement learning and constraint-based temporal reasoning

16 years 7 months ago

Download www.eecs.umich.edu

Reminder systems support people with impaired prospective memory and/or executive function, by providing them with reminders of their functional daily activities. We integrate tem...

Matthew R. Rudary, Satinder P. Singh, Martha E. Po...

claim paper

Read More »

159

click to vote

ICML
2003
IEEE

124views Machine Learning» more ICML 2003»

Exploration in Metric State Spaces

16 years 7 months ago

Download www.cis.upenn.edu

We present metric?? , a provably near-optimal algorithm for reinforcement learning in Markov decision processes in which there is a natural metric on the state space that allows t...

Sham Kakade, Michael J. Kearns, John Langford

claim paper

Read More »

164

click to vote

ICML
1998
IEEE

268views Machine Learning» more ICML 1998»

The MAXQ Method for Hierarchical Reinforcement Learning

16 years 7 months ago

Download www.cs.ualberta.ca

This paper presents a new approach to hierarchical reinforcement learning based on the MAXQ decomposition of the value function. The MAXQ decomposition has both a procedural seman...

Thomas G. Dietterich

claim paper

Read More »

181

click to vote

EUROSYS
2010
ACM

189views Software Engineering» more EUROSYS 2010»

Dr. Multicast: Rx for Data Center Communication Scalability

16 years 3 months ago

Download www.cs.cornell.edu

Data centers avoid IP Multicast because of a series of problems with the technology. We propose Dr. Multicast (MCMD), a system that maps IPMC operations to a combination of point-...

Ymir Vigfusson, Hussam Abu-Libdeh, Mahesh Balakris...

claim paper

Read More »

« Prev « First page 294 / 383 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers