Search Sciweavers | Sciweavers

1310 search results - page 120 / 262

» Progressive Optimization in Action

168

click to vote

SODA
2010
ACM

371views Algorithms» more SODA 2010»

Online Learning with Queries

16 years 3 months ago

Download siam.org

The online learning problem requires a player to iteratively choose an action in an unknown and changing environment. In the standard setting of this problem, the player has to ch...

Chao-Kai Chiang, Chi-Jen Lu

claim paper

Read More »

193

click to vote

ATVA
2004
Springer

146views Hardware» more ATVA 2004»

A Global Timed Bisimulation Preserving Abstraction for Parametric Time-Interval Automata

15 years 11 months ago

Download www-higashi.ist.osaka-u.ac.jp

Timed Bisimulation Preserving Abstraction for Parametric Time-Interval Automata Akio Nakata, Tadaaki Tanimoto, Suguru Sasaki, Teruo Higashino Department of Information Networking, ...

Tadaaki Tanimoto, Suguru Sasaki, Akio Nakata, Teru...

claim paper

Read More »

177

click to vote

ATAL
2006
Springer

127views Intelligent Agents» more ATAL 2006»

Learning to commit in repeated games

15 years 10 months ago

Download staff.science.uva.nl

Learning to converge to an efficient, i.e., Pareto-optimal Nash equilibrium of the repeated game is an open problem in multiagent learning. Our goal is to facilitate the learning ...

Stéphane Airiau, Sandip Sen

claim paper

Read More »

155

click to vote

AAAI
2008

214views Intelligent Agents» more AAAI 2008»

Computational Influence for Training and Entertainment

15 years 8 months ago

Download www.aaai.org

2) a set of abstract drama manager; 3) a model of player response to drama manager actions; and 4) an author-specified evaluation function. The drama manager's task is to sele...

David L. Roberts

claim paper

Read More »

178

click to vote

JACM
2006

93views more JACM 2006»

Combining expert advice in reactive environments

15 years 6 months ago

Download web.mit.edu

"Experts algorithms" constitute a methodology for choosing actions repeatedly, when the rewards depend both on the choice of action and on the unknown current state of t...

Daniela Pucci de Farias, Nimrod Megiddo

claim paper

Read More »

« Prev « First page 120 / 262 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers