Sciweavers

3050 search results - page 307 / 610
» On-line Algorithms in Machine Learning
Sort
View
ECML
2007
Springer
16 years 22 days ago
Policy Gradient Critics
We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...
Daan Wierstra, Jürgen Schmidhuber
ECML
2006
Springer
15 years 10 months ago
Combinatorial Markov Random Fields
Abstract. A combinatorial random variable is a discrete random variable defined over a combinatorial set (e.g., a power set of a given set). In this paper we introduce combinatoria...
Ron Bekkerman, Mehran Sahami, Erik G. Learned-Mill...
ALT
2010
Springer
15 years 8 months ago
Approximation Stability and Boosting
Stability has been explored to study the performance of learning algorithms in recent years and it has been shown that stability is sufficient for generalization and is sufficient ...
Wei Gao, Zhi-Hua Zhou
TNN
2008
178views more  TNN 2008»
15 years 6 months ago
IMORL: Incremental Multiple-Object Recognition and Localization
This paper proposes an incremental multiple-object recognition and localization (IMORL) method. The objective of IMORL is to adaptively learn multiple interesting objects in an ima...
Haibo He, Sheng Chen
ICDM
2007
IEEE
161views Data Mining» more  ICDM 2007»
16 years 26 days ago
Experimental Comparison of Feature Subset Selection Methods
In the field of machine learning and pattern recognition, feature subset selection is an important area, where many approaches have been proposed. In this paper, we choose some fe...
Chulmin Yun, Jihoon Yang