Sciweavers

1670 search results - page 183 / 334
» A Measure of Decision Flexibility
Sort
View
NN
2010
Springer
125views Neural Networks» more  NN 2010»
15 years 4 months ago
Parameter-exploring policy gradients
We present a model-free reinforcement learning method for partially observable Markov decision problems. Our method estimates a likelihood gradient by sampling directly in paramet...
Frank Sehnke, Christian Osendorfer, Thomas Rü...
TASLP
2010
98views more  TASLP 2010»
15 years 4 months ago
Trellis-Based Approaches to Rate-Distortion Optimized Audio Encoding
—Many important audio coding applications, such as streaming and playback of stored audio, involve offline compression. In such scenarios, encoding delays no longer represent a ...
Vinay Melkote, Kenneth Rose
MAGS
2010
153views more  MAGS 2010»
15 years 1 months ago
Designing bidding strategies in sequential auctions for risk averse agents
Designing efficient bidding strategies for sequential auctions remains an important, open problem area in agent-mediated electronic markets. In existing literature, a variety of bi...
Valentin Robu, Han La Poutré
SIGSOFT
2010
ACM
15 years 1 months ago
Evolution of a bluetooth test application product line: a case study
In this paper, we study the decision making process involved in the five year lifecycle of a Bluetooth software product produced by a large, multi-national test and measurement fi...
Narayan Ramasubbu, Rajesh Krishna Balan
AAAI
2011
14 years 6 months ago
An Online Spectral Learning Algorithm for Partially Observable Nonlinear Dynamical Systems
Recently, a number of researchers have proposed spectral algorithms for learning models of dynamical systems—for example, Hidden Markov Models (HMMs), Partially Observable Marko...
Byron Boots, Geoffrey J. Gordon