Sciweavers

82 search results - page 7 / 17
» Balancing Exploration and Exploitation in Learning to Rank O...
Sort
View
CEC
2009
IEEE
16 years 24 days ago
On-line neuroevolution applied to The Open Racing Car Simulator
— The application of on-line learning techniques to modern computer games is a promising research direction. In fact, they can be used to improve the game experience and to achie...
Luigi Cardamone, Daniele Loiacono, Pier Luca Lanzi
GECCO
2008
Springer
137views Optimization» more  GECCO 2008»
15 years 7 months ago
Rank based variation operators for genetic algorithms
We show how and why using genetic operators that are applied with probabilities that depend on the fitness rank of a genotype or phenotype offers a robust alternative to the Sim...
Jorge Cervantes, Christopher R. Stephens
SMC
2007
IEEE
102views Control Systems» more  SMC 2007»
16 years 9 days ago
An improved immune Q-learning algorithm
—Reinforcement learning is a framework in which an agent can learn behavior without knowledge on a task or an environment by exploration and exploitation. Striking a balance betw...
Zhengqiao Ji, Q. M. Jonathan Wu, Maher A. Sid-Ahme...
WSDM
2012
ACM
267views Data Mining» more  WSDM 2012»
14 years 1 months ago
Learning to rank with multi-aspect relevance for vertical search
Many vertical search tasks such as local search focus on specific domains. The meaning of relevance in these verticals is domain-specific and usually consists of multiple well-d...
Changsung Kang, Xuanhui Wang, Yi Chang, Belle L. T...
KDD
2009
ACM
227views Data Mining» more  KDD 2009»
16 years 6 months ago
Efficiently learning the accuracy of labeling sources for selective sampling
Many scalable data mining tasks rely on active learning to provide the most useful accurately labeled instances. However, what if there are multiple labeling sources (`oracles...
Pinar Donmez, Jaime G. Carbonell, Jeff Schneider