Sciweavers
Explore
Publications
Books
Software
Tutorials
Presentations
Lectures Notes
Datasets
Labs
Conferences
Community
Upcoming
Conferences
Top Ranked Papers
Most Viewed Conferences
Conferences by Acronym
Conferences by Subject
Conferences by Year
Tools
PDF Tools
Image Tools
Text Tools
OCR Tools
Symbol and Emoji Tools
On-screen Keyboard
Latex Math Equation to Image
Smart IPA Phonetic Keyboard
Community
Sciweavers
About
Terms of Use
Privacy Policy
Cookies
46
search results - page 10 / 10
»
Breaking All Value Symmetries in Surjection Problems
Sort
relevance
views
votes
recent
update
View
thumb
title
170
click to vote
ICML
2001
IEEE
185
views
Machine Learning
»
more
ICML 2001
»
Off-Policy Temporal Difference Learning with Function Approximation
16 years 6 months ago
Download
www.cs.ualberta.ca
We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...
Doina Precup, Richard S. Sutton, Sanjoy Dasgupta
claim paper
Read More »
« Prev
« First
page 10 / 10
Last »
Next »