Sciweavers

7930 search results - page 1195 / 1586
» Greedy in Approximation Algorithms
Sort
View
JMLR
2010
125views more  JMLR 2010»
15 years 1 months ago
Variational methods for Reinforcement Learning
We consider reinforcement learning as solving a Markov decision process with unknown transition distribution. Based on interaction with the environment, an estimate of the transit...
Thomas Furmston, David Barber
JMLR
2010
101views more  JMLR 2010»
15 years 1 months ago
Exploiting Feature Covariance in High-Dimensional Online Learning
Some online algorithms for linear classification model the uncertainty in their weights over the course of learning. Modeling the full covariance structure of the weights can prov...
Justin Ma, Alex Kulesza, Mark Dredze, Koby Crammer...
TASE
2010
IEEE
15 years 1 months ago
Sensor Placement for Triangulation-Based Localization
Robots operating in a workspace can localize themselves by querying nodes of a sensor-network deployed in the same workspace. This paper addresses the problem of computing the min...
Onur Tekdas, Volkan Isler
TOC
2010
127views Management» more  TOC 2010»
15 years 1 months ago
The Submodular Welfare Problem with Demand Queries
: We consider the Submodular Welfare Problem where we have m items and n players with given utility functions wi : 2[m] R+. The utility functions are assumed to be monotone and su...
Uriel Feige, Jan Vondrák
TSP
2010
15 years 1 months ago
Adaptive design of OFDM radar signal with improved wideband ambiguity function
We propose an adaptive technique to design the spectrum of an orthogonal frequency division multiplexing (OFDM) waveform to improve the radar's wideband ambiguity function (WA...
Satyabrata Sen, Arye Nehorai
« Prev « First page 1195 / 1586 Last » Next »