Sciweavers

4544 search results - page 575 / 909
» Reinforcement Learning with Time
Sort
View
COLT
2008
Springer
15 years 8 months ago
Combining Expert Advice Efficiently
We show how models for prediction with expert advice can be defined concisely and clearly using hidden Markov models (HMMs); standard HMM algorithms can then be used to efficientl...
Wouter M. Koolen, Steven de Rooij
159
Voted
COLT
2008
Springer
15 years 8 months ago
Regret Bounds for Sleeping Experts and Bandits
We study on-line decision problems where the set of actions that are available to the decision algorithm vary over time. With a few notable exceptions, such problems remained larg...
Robert D. Kleinberg, Alexandru Niculescu-Mizil, Yo...
ISTA
2007
15 years 8 months ago
Game Theory-based Data Mining Technique for Strategy Making of a Soccer Simulation Coach Agent
: Soccer simulation is an effort to motivate researchers to perform artificial and robotic intelligence investigations in a multi-agent system framework. In this paper, we propose ...
Amin Milani Fard, Vahid Salmani, Mahmoud Naghibzad...
NIPS
2007
15 years 8 months ago
A Risk Minimization Principle for a Class of Parzen Estimators
This paper1 explores the use of a Maximal Average Margin (MAM) optimality principle for the design of learning algorithms. It is shown that the application of this risk minimizati...
Kristiaan Pelckmans, Johan A. K. Suykens, Bart De ...
CSREAEEE
2006
99views Business» more  CSREAEEE 2006»
15 years 8 months ago
Teaching Web Applications Development in a Fully Online Environment: Challenges, Approaches and Implementation
: This paper examines the re-development of university level web programming unit for delivery in a fully online mode. The unit, which teaches advanced xhtml, javascript, php and m...
Justin Brown