Sciweavers

2889 search results - page 489 / 578
» An empirical study of
Sort
View
TOMACS
2010
79views more  TOMACS 2010»
15 years 1 months ago
A stochastic approximation method with max-norm projections and its applications to the Q-learning algorithm
In this paper, we develop a stochastic approximation method to solve a monotone estimation problem and use this method to enhance the empirical performance of the Q-learning algor...
Sumit Kunnumkal, Huseyin Topaloglu
CORR
2011
Springer
161views Education» more  CORR 2011»
14 years 10 months ago
Doubly Robust Policy Evaluation and Learning
We study decision making in environments where the reward is only partially observed, but can be modeled as a function of an action and an observed context. This setting, known as...
Miroslav Dudík, John Langford, Lihong Li
ICASSP
2011
IEEE
14 years 10 months ago
Relevance language modeling for speech recognition
Language models for speech recognition tend to be brittle across domains, since their performance is vulnerable to changes in the genre or topic of the text on which they are trai...
Kuan-Yu Chen, Berlin Chen
ICASSP
2011
IEEE
14 years 10 months ago
Variability regularization in large-margin classification
This paper introduces a novel regularization strategy to address the generalization issues for large-margin classifiers from the Empirical Risk Minimization (ERM) perspective. Fi...
Dwi Sianto Mansjur, Ted S. Wada, Biing-Hwang Juang
CHI
2011
ACM
14 years 10 months ago
The effects of task dimensionality, endpoint deviation, throughput calculation, and experiment design on pointing measures and m
Fitts’ law (1954) characterizes pointing speed-accuracy performance as throughput, whose invariance to target distances (A) and sizes (W) is known. However, it is unknown whethe...
Jacob O. Wobbrock, Kristen Shinohara, Alex Jansen