Search Sciweavers | Sciweavers

160

TOMACS
2010

79views more TOMACS 2010»

A stochastic approximation method with max-norm projections and its applications to the Q-learning algorithm

15 years 1 months ago

In this paper, we develop a stochastic approximation method to solve a monotone estimation problem and use this method to enhance the empirical performance of the Q-learning algor...

Sumit Kunnumkal, Huseyin Topaloglu

claim paper

Read More »

185

click to vote

CORR
2011
Springer

161views Education» more CORR 2011»

Doubly Robust Policy Evaluation and Learning

14 years 10 months ago

Download www.icml-2011.org

We study decision making in environments where the reward is only partially observed, but can be modeled as a function of an action and an observed context. This setting, known as...

Miroslav Dudík, John Langford, Lihong Li

claim paper

Read More »

184

click to vote

ICASSP
2011
IEEE

239views Signal Processing» more ICASSP 2011»

Relevance language modeling for speech recognition

14 years 10 months ago

Download mirlab.org

Language models for speech recognition tend to be brittle across domains, since their performance is vulnerable to changes in the genre or topic of the text on which they are trai...

Kuan-Yu Chen, Berlin Chen

claim paper

Read More »

122

click to vote

ICASSP
2011
IEEE

85views Signal Processing» more ICASSP 2011»

Variability regularization in large-margin classification

14 years 10 months ago

Download mirlab.org

This paper introduces a novel regularization strategy to address the generalization issues for large-margin classiﬁers from the Empirical Risk Minimization (ERM) perspective. Fi...

Dwi Sianto Mansjur, Ted S. Wada, Biing-Hwang Juang

claim paper

Read More »

156

click to vote

CHI
2011
ACM

196views Human Computer Interaction» more CHI 2011»

The effects of task dimensionality, endpoint deviation, throughput calculation, and experiment design on pointing measures and m

14 years 10 months ago

Download faculty.washington.edu

Fitts’ law (1954) characterizes pointing speed-accuracy performance as throughput, whose invariance to target distances (A) and sizes (W) is known. However, it is unknown whethe...

Jacob O. Wobbrock, Kristen Shinohara, Alex Jansen

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers