Search Sciweavers | Sciweavers

3643 search results - page 204 / 729

» Learning Submodular Functions

157

click to vote

APPROX
2008
Springer

101views Algorithms» more APPROX 2008»

Learning Random Monotone DNF

15 years 8 months ago

Download www1.cs.columbia.edu

We give an algorithm that with high probability properly learns random monotone DNF with t(n) terms of length log t(n) under the uniform distribution on the Boolean cube {0, 1}n ....

Jeffrey C. Jackson, Homin K. Lee, Rocco A. Servedi...

claim paper

Read More »

214

click to vote

IJCAI
2007

179views Artificial Intelligence» more IJCAI 2007»

Heuristic Selection of Actions in Multiagent Reinforcement Learning

15 years 8 months ago

Download www.ijcai.org

This work presents a new algorithm, called Heuristically Accelerated Minimax-Q (HAMMQ), that allows the use of heuristics to speed up the wellknown Multiagent Reinforcement Learni...

Reinaldo A. C. Bianchi, Carlos H. C. Ribeiro, Anna...

claim paper

Read More »

175

click to vote

NIPS
2008

126views Information Technology» more NIPS 2008»

Multi-task Gaussian Process Learning of Robot Inverse Dynamics

15 years 8 months ago

Download www.sensopac.org

The inverse dynamics problem for a robotic manipulator is to compute the torques needed at the joints to drive it along a given trajectory; it is beneficial to be able to learn th...

Kian Ming Adam Chai, Christopher K. I. Williams, S...

claim paper

Read More »

186

click to vote

NIPS
2003

118views Information Technology» more NIPS 2003»

Online Learning via Global Feedback for Phrase Recognition

15 years 7 months ago

Download books.nips.cc

We present a system to recognize phrases based on perceptrons, and a global online learning algorithm to train them together. The recognition strategy applies learning in two laye...

Xavier Carreras, Lluís Màrquez

claim paper

Read More »

173

click to vote

NIPS
2003

105views Information Technology» more NIPS 2003»

Gaussian Processes in Reinforcement Learning

15 years 7 months ago

Download books.nips.cc

We exploit some useful properties of Gaussian process (GP) regression models for reinforcement learning in continuous state spaces and discrete time. We demonstrate how the GP mod...

Carl Edward Rasmussen, Malte Kuss

claim paper

Read More »

« Prev « First page 204 / 729 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers