Search Sciweavers | Sciweavers

12194 search results - page 240 / 2439

» Numberings Optimal for Learning

192

click to vote

UAI
2003

104views Artificial Intelligence» more UAI 2003»

Optimal Limited Contingency Planning

15 years 8 months ago

Download ti.arc.nasa.gov

For a given problem, the optimal Markov policy over a ﬁnite horizon is a conditional plan containing a potentially large number of branches. However, there are applications wher...

Nicolas Meuleau, David E. Smith

claim paper

Read More »

150

click to vote

VLDB
1995
ACM

96views Database» more VLDB 1995»

The Fittest Survives: An Adaptive Approach to Query Optimization

15 years 10 months ago

Download www.vldb.org

Traditionally, optimizers are “programmed” to optimize queries following a set of buildin procedures. However, optimizers should be robust to its changing environment to gener...

Hongjun Lu, Kian-Lee Tan, Son Dao

claim paper

Read More »

178

click to vote

ICML
2004
IEEE

214views Machine Learning» more ICML 2004»

Apprenticeship learning via inverse reinforcement learning

16 years 7 months ago

Download ai.stanford.edu

We consider learning in a Markov decision process where we are not explicitly given a reward function, but where instead we can observe an expert demonstrating the task that we wa...

Pieter Abbeel, Andrew Y. Ng

claim paper

Read More »

171

click to vote

IPPS
2005
IEEE

141views Distributed And Parallel Com...» more IPPS 2005»

Optimal Channel Assignments for Lattices with Conditions at Distance Two

16 years 4 days ago

Download www.math.sc.edu

The problem of radio channel assignments with multiple levels of interference can be modeled using graph theory. Given a graph G, possibly inﬁnite, and real numbers k1, k2, . . ...

Jerrold R. Griggs, Xiaohua Teresa Jin

claim paper

Read More »

176

click to vote

ICML
2005
IEEE

127views Machine Learning» more ICML 2005»

Exploration and apprenticeship learning in reinforcement learning

16 years 7 months ago

Download ai.stanford.edu

We consider reinforcement learning in systems with unknown dynamics. Algorithms such as E3 (Kearns and Singh, 2002) learn near-optimal policies by using "exploration policies...

Pieter Abbeel, Andrew Y. Ng

claim paper

Read More »

« Prev « First page 240 / 2439 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers