Sciweavers

2100 search results - page 150 / 420
» Observation Can Be as Effective as Action in Problem Solving
Sort
View
ICML
2006
IEEE
16 years 7 months ago
Relational temporal difference learning
We introduce relational temporal difference learning as an effective approach to solving multi-agent Markov decision problems with large state spaces. Our algorithm uses temporal ...
Nima Asgharbeygi, David J. Stracuzzi, Pat Langley
AAAI
2006
15 years 7 months ago
Hard Constrained Semi-Markov Decision Processes
In multiple criteria Markov Decision Processes (MDP) where multiple costs are incurred at every decision point, current methods solve them by minimising the expected primary cost ...
Wai-Leong Yeow, Chen-Khong Tham, Wai-Choong Wong
IEAAIE
1998
Springer
15 years 10 months ago
Generating Heuristics to Control Configuration Processes
Abstract. Configuration is the process of composing a system from a set of components such that the system fulfills a set of desired demands. The configuration process relies on a ...
Benno Stein
KDD
2010
ACM
244views Data Mining» more  KDD 2010»
15 years 4 months ago
Finding effectors in social networks
Assume a network (V, E) where a subset of the nodes in V are active. We consider the problem of selecting a set of k active nodes that best explain the observed activation state, ...
Theodoros Lappas, Evimaria Terzi, Dimitrios Gunopu...
ICRA
1995
IEEE
151views Robotics» more  ICRA 1995»
15 years 10 months ago
Learning Impedance Control for Robotic Manipulators
—Learning control is a concept for controlling dynamic systems in an iterative manner. It arises from the recognition that robotic manipulators are usually used to perform repeti...
Chien-Chern Cheah, Danwei Wang