Search Sciweavers | Sciweavers

4035 search results - page 304 / 807

» Useless Actions Are Useful

189

click to vote

PAAPP
2007

112views more PAAPP 2007»

A study on video viewing behavior: application to movie trailer miner

15 years 6 months ago

Download perso.numericable.fr

In this paper, we present a study on video viewing behavior. Based on a well-suited Markovian model, we have developed a clustering algorithm called K-Models and inspired by the K...

Sylvain Mongy

claim paper

Read More »

178

click to vote

ALDT
2009
Springer

158views Algorithms» more ALDT 2009»

Axioms for a Class of Algorithms of Sequential Decision Making

15 years 4 months ago

Download www.math.auckland.ac.nz

Abstract. We axiomatically characterise a class of algorithms for making sequential decisions in situations of complete ignorance. These algorithms assume that a decision maker (DM...

Murali Agastya, Arkadii M. Slinko

claim paper

Read More »

199

click to vote

ICMLA
2009

171views Machine Learning» more ICMLA 2009»

Multiagent Transfer Learning via Assignment-Based Decomposition

15 years 4 months ago

Download web.engr.oregonstate.edu

We describe a system that successfully transfers value function knowledge across multiple subdomains of realtime strategy games in the context of multiagent reinforcement learning....

Scott Proper, Prasad Tadepalli

claim paper

Read More »

190

click to vote

CDC
2010
IEEE

160views Control Systems» more CDC 2010»

Adaptive bases for Q-learning

15 years 1 months ago

Download webee.technion.ac.il

Abstract-- We consider reinforcement learning, and in particular, the Q-learning algorithm in large state and action spaces. In order to cope with the size of the spaces, a functio...

Dotan Di Castro, Shie Mannor

claim paper

Read More »

186

click to vote

TCAD
2010

103views more TCAD 2010»

Supervised Learning Based Power Management for Multicore Processors

15 years 1 months ago

Download atrak.usc.edu

- This paper presents a supervised learning based power management framework for a multi-processor system, where a power manager (PM) learns to predict the system performance state...

Hwisung Jung, Massoud Pedram

claim paper

Read More »

« Prev « First page 304 / 807 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers