Sciweavers

4544 search results - page 606 / 909
» Reinforcement Learning with Time
Sort
View
CORR
2010
Springer
127views Education» more  CORR 2010»
15 years 6 months ago
Online Algorithms for the Multi-Armed Bandit Problem with Markovian Rewards
We consider the classical multi-armed bandit problem with Markovian rewards. When played an arm changes its state in a Markovian fashion while it remains frozen when not played. Th...
Cem Tekin, Mingyan Liu
COGSCI
2006
75views more  COGSCI 2006»
15 years 6 months ago
A Hierarchical Bayesian Model of Human Decision-Making on an Optimal Stopping Problem
We consider human performance on an optimal stopping problem where people are presented with a list of numbers independently chosen from a uniform distribution. People are told ho...
Michael D. Lee
PPL
2008
75views more  PPL 2008»
15 years 6 months ago
Modeling the Performance of Communication Schemes on Network Topologies
This paper investigates the influence of the interconnection network topology of a parallel system on the delivery time of an ensemble of messages, called the communication scheme...
Jan Lemeire, Erik F. Dirkx, Walter Colitti
IDA
2002
Springer
15 years 6 months ago
Online classification of nonstationary data streams
Most classification methods are based on the assumption that the data conforms to a stationary distribution. However, the real-world data is usually collected over certain periods...
Mark Last
HCI
2009
15 years 4 months ago
Studying Reactive, Risky, Complex, Long-Spanning, and Collaborative Work: The Case of IT Service Delivery
Abstract. IT service delivery is challenging to study. It is characterized by interacting systems of technology, people, and organizations. The work is sometimes reactive, sometime...
Eser Kandogan, Eben M. Haber, John H. Bailey, Paul...