Search Sciweavers | Sciweavers

2467 search results - page 268 / 494

» Finite State Machines

182

click to vote

ML
2002
ACM

143views Machine Learning» more ML 2002»

A Sparse Sampling Algorithm for Near-Optimal Planning in Large Markov Decision Processes

15 years 6 months ago

Download www.cis.upenn.edu

An issue that is critical for the application of Markov decision processes MDPs to realistic problems is how the complexity of planning scales with the size of the MDP. In stochas...

Michael J. Kearns, Yishay Mansour, Andrew Y. Ng

claim paper

Read More »

159

click to vote

ALGORITHMICA
2002

84views more ALGORITHMICA 2002»

On-Line Multi-Threaded Paging

15 years 6 months ago

Download www-2.dc.uba.ar

In this paper we introduce a generalization of Paging to the case where there are many threads of requests. This models situations in which the requests come from more than one ind...

Esteban Feuerstein, Alejandro Strejilevich de Loma

claim paper

Read More »

182

click to vote

ICML
2008
IEEE

123views Machine Learning» more ICML 2008»

An object-oriented representation for efficient reinforcement learning

16 years 7 months ago

Download paul.rutgers.edu

Rich representations in reinforcement learning have been studied for the purpose of enabling generalization and making learning feasible in large state spaces. We introduce Object...

Carlos Diuk, Andre Cohen, Michael L. Littman

claim paper

Read More »

149

click to vote

ICML
2008
IEEE

135views Machine Learning» more ICML 2008»

Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs

16 years 7 months ago

Download mapleleaf.csail.mit.edu

Partially Observable Markov Decision Processes (POMDPs) have succeeded in planning domains that require balancing actions that increase an agent's knowledge and actions that ...

Finale Doshi, Joelle Pineau, Nicholas Roy

claim paper

Read More »

198

click to vote

ICML
2006
IEEE

187views Machine Learning» more ICML 2006»

Dynamic topic models

16 years 7 months ago

Download www.cs.cmu.edu

A family of probabilistic time series models is developed to analyze the time evolution of topics in large document collections. The approach is to use state space models on the n...

David M. Blei, John D. Lafferty

claim paper

Read More »

« Prev « First page 268 / 494 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers