Search Sciweavers | Sciweavers

2914 search results - page 227 / 583

» Optimization by Stochastic Continuation

197

click to vote

AAAI
2007

102views Intelligent Agents» more AAAI 2007»

Thresholded Rewards: Acting Optimally in Timed, Zero-Sum Games

15 years 9 months ago

Download www.cs.cmu.edu

In timed, zero-sum games, the goal is to maximize the probability of winning, which is not necessarily the same as maximizing our expected reward. We consider cumulative intermedi...

Colin McMillen, Manuela M. Veloso

claim paper

Read More »

174

click to vote

AAAI
1996

133views Intelligent Agents» more AAAI 1996»

An Optimal Contracting Strategy in a Digital Library

15 years 7 months ago

Download www.aaai.org

Agents can benefit from contracting some of their tasks that cannot be performedby themselves or that can be performed moreefficiently by other agents. Developing an agent's ...

Sunju Park, Edmund H. Durfee

claim paper

Read More »

177

click to vote

ML
2002
ACM

143views Machine Learning» more ML 2002»

A Sparse Sampling Algorithm for Near-Optimal Planning in Large Markov Decision Processes

15 years 6 months ago

Download www.cis.upenn.edu

An issue that is critical for the application of Markov decision processes MDPs to realistic problems is how the complexity of planning scales with the size of the MDP. In stochas...

Michael J. Kearns, Yishay Mansour, Andrew Y. Ng

claim paper

Read More »

183

click to vote

COLT
2010
Springer

207views Machine Learning» more COLT 2010»

An Asymptotically Optimal Bandit Algorithm for Bounded Support Models

15 years 4 months ago

Download www.colt2010.org

Multiarmed bandit problem is a typical example of a dilemma between exploration and exploitation in reinforcement learning. This problem is expressed as a model of a gambler playi...

Junya Honda, Akimichi Takemura

claim paper

Read More »

179

click to vote

INFOCOM
2010
IEEE

163views Communications» more INFOCOM 2010»

Surfing the Blogosphere: Optimal Personalized Strategies for Searching the Web

15 years 4 months ago

Download www.cs.toronto.edu

We propose a distributed mechanism for finding websurfing strategies that is inspired by the StumbleUpon recommendation engine. Each day, a websurfer visits a sequence of websites ...

Stratis Ioannidis, Laurent Massoulié

claim paper

Read More »

« Prev « First page 227 / 583 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers