Sciweavers

3934 search results - page 103 / 787
» Approximate Schedulability Analysis
Sort
View
ML
2002
ACM
154views Machine Learning» more  ML 2002»
15 years 6 months ago
Technical Update: Least-Squares Temporal Difference Learning
TD() is a popular family of algorithms for approximate policy evaluation in large MDPs. TD() works by incrementally updating the value function after each observed transition. It h...
Justin A. Boyan