Sciweavers

4035 search results - page 304 / 807
» Useless Actions Are Useful
Sort
View
PAAPP
2007
112views more  PAAPP 2007»
15 years 6 months ago
A study on video viewing behavior: application to movie trailer miner
In this paper, we present a study on video viewing behavior. Based on a well-suited Markovian model, we have developed a clustering algorithm called K-Models and inspired by the K...
Sylvain Mongy
ALDT
2009
Springer
158views Algorithms» more  ALDT 2009»
15 years 4 months ago
Axioms for a Class of Algorithms of Sequential Decision Making
Abstract. We axiomatically characterise a class of algorithms for making sequential decisions in situations of complete ignorance. These algorithms assume that a decision maker (DM...
Murali Agastya, Arkadii M. Slinko
ICMLA
2009
15 years 4 months ago
Multiagent Transfer Learning via Assignment-Based Decomposition
We describe a system that successfully transfers value function knowledge across multiple subdomains of realtime strategy games in the context of multiagent reinforcement learning....
Scott Proper, Prasad Tadepalli
CDC
2010
IEEE
160views Control Systems» more  CDC 2010»
15 years 1 months ago
Adaptive bases for Q-learning
Abstract-- We consider reinforcement learning, and in particular, the Q-learning algorithm in large state and action spaces. In order to cope with the size of the spaces, a functio...
Dotan Di Castro, Shie Mannor
TCAD
2010
103views more  TCAD 2010»
15 years 1 months ago
Supervised Learning Based Power Management for Multicore Processors
- This paper presents a supervised learning based power management framework for a multi-processor system, where a power manager (PM) learns to predict the system performance state...
Hwisung Jung, Massoud Pedram