Sciweavers

9787 search results - page 1568 / 1958
» A Utile Function Optimizer
Sort
View
WCE
2007
15 years 8 months ago
An Adaptive Cross-EntropyTuning of the PID Control for Robot Manipulators
— This paper proposes a population based adaptive tuning for dynamic position control of robot manipulators. The dynamic behavior of a robot manipulator is highly nonlinear, and ...
Mehmet Bodur
CIIA
2009
15 years 8 months ago
Dynamic Scheduling in Petroleum Process using Reinforcement Learning
Petroleum industry production systems are highly automatized. In this industry, all functions (e.g., planning, scheduling and maintenance) are automated and in order to remain comp...
Nassima Aissani, Bouziane Beldjilali
ICML
2010
IEEE
15 years 8 months ago
Internal Rewards Mitigate Agent Boundedness
Abstract--Reinforcement learning (RL) research typically develops algorithms for helping an RL agent best achieve its goals-however they came to be defined--while ignoring the rela...
Jonathan Sorg, Satinder P. Singh, Richard Lewis
ICML
2010
IEEE
15 years 8 months ago
Multi-Task Learning of Gaussian Graphical Models
We present multi-task structure learning for Gaussian graphical models. We discuss uniqueness and boundedness of the optimal solution of the maximization problem. A block coordina...
Jean Honorio, Dimitris Samaras
ICML
2010
IEEE
15 years 8 months ago
Feature Selection as a One-Player Game
This paper formalizes Feature Selection as a Reinforcement Learning problem, leading to a provably optimal though intractable selection policy. As a second contribution, this pape...
Romaric Gaudel, Michèle Sebag
« Prev « First page 1568 / 1958 Last » Next »