Search Sciweavers | Sciweavers

2900 search results - page 130 / 580

» On the Convergence of Immune Algorithms

169

click to vote

CRYPTO
2012
Springer

219views Cryptology» more CRYPTO 2012»

Tamper and Leakage Resilience in the Split-State Model

13 years 8 months ago

Download eprint.iacr.org

It is notoriously diﬃcult to create hardware that is immune from side channel and tampering attacks. A lot of recent literature, therefore, has instead considered algorithmic de...

Feng-Hao Liu, Anna Lysyanskaya

claim paper

Read More »

159

click to vote

COLT
2004
Springer

99views Machine Learning» more COLT 2004»

Reinforcement Learning for Average Reward Zero-Sum Games

15 years 11 months ago

Download www.ece.mcgill.ca

Abstract. We consider Reinforcement Learning for average reward zerosum stochastic games. We present and analyze two algorithms. The ﬁrst is based on relative Q-learning and the ...

Shie Mannor

claim paper

Read More »

138

click to vote

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

15 years 10 months ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

149

click to vote

NIPS
2000

150views Information Technology» more NIPS 2000»

Programmable Reinforcement Learning Agents

15 years 7 months ago

Download reference.kfupm.edu.sa

We present an expressive agent design language for reinforcement learning that allows the user to constrain the policies considered by the learning process.The language includes s...

David Andre, Stuart J. Russell

claim paper

Read More »

177

click to vote

NIPS
2000

161views Information Technology» more NIPS 2000»

From Margin to Sparsity

15 years 7 months ago

Download users.cecs.anu.edu.au

We present an improvement of Noviko 's perceptron convergence theorem. Reinterpreting this mistakebound as a margindependent sparsity guarantee allows us to give a PAC{style ...

Thore Graepel, Ralf Herbrich, Robert C. Williamson

claim paper

Read More »

« Prev « First page 130 / 580 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers