Sciweavers

3381 search results - page 346 / 677
» LEO - DB2's LEarning Optimizer
Sort
View
PKDD
2010
Springer
179views Data Mining» more  PKDD 2010»
15 years 4 months ago
Gaussian Processes for Sample Efficient Reinforcement Learning with RMAX-Like Exploration
Abstract. We present an implementation of model-based online reinforcement learning (RL) for continuous domains with deterministic transitions that is specifically designed to achi...
Tobias Jung, Peter Stone
AIIDE
2009
15 years 4 months ago
IMPLANT: An Integrated MDP and POMDP Learning AgeNT for Adaptive Games
This paper proposes an Integrated MDP and POMDP Learning AgeNT (IMPLANT) architecture for adaptation in modern games. The modern game world basically involves a human player actin...
Chek Tien Tan, Ho-Lun Cheng
CC
2010
Springer
120views System Software» more  CC 2010»
15 years 4 months ago
Lower Bounds for Agnostic Learning via Approximate Rank
We prove that the concept class of disjunctions cannot be pointwise approximated by linear combinations of any small set of arbitrary real-valued functions. That is, suppose that t...
Adam R. Klivans, Alexander A. Sherstov
CORR
2010
Springer
110views Education» more  CORR 2010»
15 years 4 months ago
Learning Multi-modal Similarity
In many applications involving multi-media data, the definition of similarity between items is integral to several key tasks, including nearest-neighbor retrieval, classification,...
Brian McFee, Gert R. G. Lanckriet
ISF
2010
164views more  ISF 2010»
15 years 4 months ago
An SVM-based machine learning method for accurate internet traffic classification
Accurate and timely traffic classification is critical in network security monitoring and traffic engineering. Traditional methods based on port numbers and protocols have proven t...
Ruixi Yuan, Zhu Li, Xiaohong Guan, Li Xu