Search Sciweavers | Sciweavers

1670 search results - page 183 / 334

» A Measure of Decision Flexibility

176

click to vote

NN
2010
Springer

125views Neural Networks» more NN 2010»

Parameter-exploring policy gradients

15 years 4 months ago

Download www.kyb.mpg.de

We present a model-free reinforcement learning method for partially observable Markov decision problems. Our method estimates a likelihood gradient by sampling directly in paramet...

Frank Sehnke, Christian Osendorfer, Thomas Rü...

claim paper

Read More »

159

click to vote

TASLP
2010

98views more TASLP 2010»

Trellis-Based Approaches to Rate-Distortion Optimized Audio Encoding

15 years 4 months ago

Download www.uweb.ucsb.edu

—Many important audio coding applications, such as streaming and playback of stored audio, involve ofﬂine compression. In such scenarios, encoding delays no longer represent a ...

Vinay Melkote, Kenneth Rose

claim paper

Read More »

195

click to vote

MAGS
2010

153views more MAGS 2010»

Designing bidding strategies in sequential auctions for risk averse agents

15 years 1 months ago

Download users.ecs.soton.ac.uk

Designing efficient bidding strategies for sequential auctions remains an important, open problem area in agent-mediated electronic markets. In existing literature, a variety of bi...

Valentin Robu, Han La Poutré

claim paper

Read More »

233

click to vote

SIGSOFT
2010
ACM

148views Software Engineering» more SIGSOFT 2010»

Evolution of a bluetooth test application product line: a case study

15 years 1 months ago

Download apollo.smu.edu.sg

In this paper, we study the decision making process involved in the five year lifecycle of a Bluetooth software product produced by a large, multi-national test and measurement fi...

Narayan Ramasubbu, Rajesh Krishna Balan

claim paper

Read More »

237

click to vote

AAAI
2011

246views Intelligent Agents» more AAAI 2011»

An Online Spectral Learning Algorithm for Partially Observable Nonlinear Dynamical Systems

14 years 6 months ago

Download www.cs.cmu.edu

Recently, a number of researchers have proposed spectral algorithms for learning models of dynamical systems—for example, Hidden Markov Models (HMMs), Partially Observable Marko...

Byron Boots, Geoffrey J. Gordon

claim paper

Read More »

« Prev « First page 183 / 334 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers