Search Sciweavers | Sciweavers

3718 search results - page 293 / 744

» On learning with dissimilarity functions

168

click to vote

PAKDD
2004
ACM

96views Data Mining» more PAKDD 2004»

Spectral Energy Minimization for Semi-supervised Learning

16 years 1 days ago

Download www.comp.hkbu.edu.hk

The use of unlabeled data to aid classification is important as labeled data is often available in limited quantity. Instead of utilizing training samples directly into semi-super...

Chun Hung Li, Zhi-Li Wu

claim paper

Read More »

220

click to vote

ECML
2007
Springer

167views Machine Learning» more ECML 2007»

Efficient Continuous-Time Reinforcement Learning with Adaptive State Graphs

15 years 10 months ago

Download www.igi.tugraz.at

Abstract. We present a new reinforcement learning approach for deterministic continuous control problems in environments with unknown, arbitrary reward functions. The difficulty of...

Gerhard Neumann, Michael Pfeiffer, Wolfgang Maass

claim paper

Read More »

197

click to vote

CVPR
2006
IEEE

168views Computer Vision» more CVPR 2006»

Region-based Image Annotation using Asymmetrical Support Vector Machine-based Multiple-Instance Learning

15 years 10 months ago

Download www.cs.wayne.edu

In region-based image annotation, keywords are usually associated with images instead of individual regions in the training data set. This poses a major challenge for any learning...

Changbo Yang, Ming Dong, Jing Hua

claim paper

Read More »

189

click to vote

ICML
1994
IEEE

152views Machine Learning» more ICML 1994»

Markov Games as a Framework for Multi-Agent Reinforcement Learning

15 years 10 months ago

Download www.cs.rutgers.edu

In the Markov decision process (MDP) formalization of reinforcement learning, a single adaptive agent interacts with an environment defined by a probabilistic transition function....

Michael L. Littman

claim paper

Read More »

208

click to vote

NIPS
2008

130views Information Technology» more NIPS 2008»

Temporal Difference Based Actor Critic Learning - Convergence and Neural Implementation

15 years 8 months ago

Download eprints.pascal-network.org

Actor-critic algorithms for reinforcement learning are achieving renewed popularity due to their good convergence properties in situations where other approaches often fail (e.g.,...

Dotan Di Castro, Dmitry Volkinshtein, Ron Meir

claim paper

Read More »

« Prev « First page 293 / 744 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers