Sciweavers

1974 search results - page 215 / 395
» On Unbiased Linear Approximations
Sort
View
ATAL
2008
Springer
15 years 8 months ago
Sigma point policy iteration
In reinforcement learning, least-squares temporal difference methods (e.g., LSTD and LSPI) are effective, data-efficient techniques for policy evaluation and control with linear v...
Michael H. Bowling, Alborz Geramifard, David Winga...
172
Voted
AAAI
2006
15 years 8 months ago
Functional Value Iteration for Decision-Theoretic Planning with General Utility Functions
We study how to find plans that maximize the expected total utility for a given MDP, a planning objective that is important for decision making in high-stakes domains. The optimal...
Yaxin Liu, Sven Koenig
UAI
2004
15 years 8 months ago
Solving Factored MDPs with Continuous and Discrete Variables
Although many real-world stochastic planning problems are more naturally formulated by hybrid models with both discrete and continuous variables, current state-of-the-art methods ...
Carlos Guestrin, Milos Hauskrecht, Branislav Kveto...
FSS
2008
126views more  FSS 2008»
15 years 5 months ago
Linguistic summarization of time series using a fuzzy quantifier driven aggregation
We propose new types of linguistic summaries of time-series data that extend those proposed in our previous papers. The proposed summaries of time series refer to the summaries of...
Janusz Kacprzyk, Anna Wilbik, Slawomir Zadrozny
IPL
2010
114views more  IPL 2010»
15 years 5 months ago
Alphabetic coding with exponential costs
An alphabetic binary tree formulation applies to problems in which an outcome needs to be determined via alphabetically ordered search prior to the termination of some window of o...
Michael B. Baer