In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...
Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...
Improving on recent work on joint source-filter analysis of speech waveforms, we explore improvements to an autoregressive model with exogenous inputs represented by flexible ba...
Compressive sensing and processing of radar waveforms enables high-resolution tracking while using low sampling rates and inexpensive processing. Compressive processing, however, ...
Nonparametric belief propagation (NBP) is a well-known particlebased method for distributed inference in wireless networks. NBP has a large number of applications, including coope...
Vladimir Savic, Henk Wymeersch, Federico Penna, Sa...
Combination of prior knowledge and implicit knowledge hidden in the data of system can enhance the quality of information services in Knowledge Grid. Fuzzy Cognitive Maps (FCMs) ar...