Sciweavers

6562 search results - page 277 / 1313
» Noetherianity and Combination Problems
Sort
View
ICONIP
2007
15 years 8 months ago
Blind Deconvolution of MIMO-IIR Systems: A Two-Stage EVA
This paper deals with a blind deconvolution (DB) problem for multiple-input multiple-output infinite impulse response (MIMO-IIR) systems. To solve this problem, we propose an eige...
Mitsuru Kawamoto, Yujiro Inouye, Kiyotaka Kohno
NIPS
2007
15 years 8 months ago
Random Sampling of States in Dynamic Programming
We combine three threads of research on approximate dynamic programming: sparse random sampling of states, value function and policy approximation using local models, and using lo...
Christopher G. Atkeson, Benjamin Stephens
ESANN
2006
15 years 8 months ago
Reducing policy degradation in neuro-dynamic programming
We focus on neuro-dynamic programming methods to learn state-action value functions and outline some of the inherent problems to be faced, when performing reinforcement learning in...
Thomas Gabel, Martin Riedmiller
NIPS
2004
15 years 8 months ago
VDCBPI: an Approximate Scalable Algorithm for Large POMDPs
Existing algorithms for discrete partially observable Markov decision processes can at best solve problems of a few thousand states due to two important sources of intractability:...
Pascal Poupart, Craig Boutilier
AAAI
2000
15 years 8 months ago
Conceptual Indexing: Practical Large-Scale AI for Efficient Information Access
Finding information is a problem shared by people and intelligent systems. This paper describes an experiment combining both human and machine aspects in a knowledgebased system t...
William A. Woods