Sciweavers

7928 search results - page 388 / 1586
» Human-Like Learning Methods for a
Sort
View
NN
2010
Springer
125views Neural Networks» more  NN 2010»
15 years 5 months ago
Parameter-exploring policy gradients
We present a model-free reinforcement learning method for partially observable Markov decision problems. Our method estimates a likelihood gradient by sampling directly in paramet...
Frank Sehnke, Christian Osendorfer, Thomas Rü...
261
Voted
TON
2010
167views more  TON 2010»
15 years 1 months ago
A Machine Learning Approach to TCP Throughput Prediction
TCP throughput prediction is an important capability in wide area overlay and multi-homed networks where multiple paths may exist between data sources and receivers. In this paper...
Mariyam Mirza, Joel Sommers, Paul Barford, Xiaojin...
ECCV
2008
Springer
16 years 8 months ago
Output Regularized Metric Learning with Side Information
Distance metric learning has been widely investigated in machine learning and information retrieval. In this paper, we study a particular content-based image retrieval application ...
Wei Liu, Steven C. H. Hoi, Jianzhuang Liu
196
Voted
ICML
2007
IEEE
16 years 7 months ago
Multi-task learning for sequential data via iHMMs and the nested Dirichlet process
A new hierarchical nonparametric Bayesian model is proposed for the problem of multitask learning (MTL) with sequential data. Sequential data are typically modeled with a hidden M...
Kai Ni, Lawrence Carin, David B. Dunson
ICML
2006
IEEE
16 years 7 months ago
Active learning via transductive experimental design
This paper considers the problem of selecting the most informative experiments x to get measurements y for learning a regression model y = f(x). We propose a novel and simple conc...
Kai Yu, Jinbo Bi, Volker Tresp