Sciweavers

4544 search results - page 565 / 909
» Reinforcement Learning with Time
Sort
View
AAAI
2007
15 years 9 months ago
Combining Multiple Heuristics Online
We present black-box techniques for learning how to interleave the execution of multiple heuristics in order to improve average-case performance. In our model, a user is given a s...
Matthew J. Streeter, Daniel Golovin, Stephen F. Sm...
EWRL
2008
15 years 8 months ago
Optimistic Planning of Deterministic Systems
If one possesses a model of a controlled deterministic system, then from any state, one may consider the set of all possible reachable states starting from that state and using any...
Jean-François Hren, Rémi Munos
IJSEKE
2007
120views more  IJSEKE 2007»
15 years 6 months ago
User Profiling in the Chronobot/Virtual Classroom System
The Chronobot/Virtual Classroom (CVC) system is a novel time knowledge exchange platform where any pair of users can exchange their time and knowledge. User profile that contains ...
Xin Li, Shi-Kuo Chang
ICMLA
2009
15 years 4 months ago
Exact Graph Structure Estimation with Degree Priors
We describe a generative model for graph edges under specific degree distributions which admits an exact and efficient inference method for recovering the most likely structure. T...
Bert Huang, Tony Jebara
ICALT
2005
IEEE
16 years 11 days ago
ActiveTutor
In this paper we present an architecture dedicated to an intelligently assisted educational tool which integrates within a unified framework software rational agents both at the m...
Jean Pierre Fournier