Sciweavers

744 search results - page 86 / 149
» Observations on the Decidability of Transitions
Sort
View
ICMAS
2000
15 years 7 months ago
Evolutionary On-line Learning of Cooperative Behavior with Situation-Action-Pairs
We present a concept to use off-line learning approaches to achieve on-line learning of cooperative behavior of agents and instantiate this concept for evolutionary learning with ...
Jörg Denzinger, Michael Kordt
ICML
2010
IEEE
15 years 7 months ago
Generalizing Apprenticeship Learning across Hypothesis Classes
This paper develops a generalized apprenticeship learning protocol for reinforcementlearning agents with access to a teacher who provides policy traces (transition and reward obse...
Thomas J. Walsh, Kaushik Subramanian, Michael L. L...
JAIR
2010
108views more  JAIR 2010»
15 years 4 months ago
Kalman Temporal Differences
This paper deals with value (and Q-) function approximation in deterministic Markovian decision processes (MDPs). A general statistical framework based on the Kalman filtering pa...
Matthieu Geist, Olivier Pietquin
INTERSPEECH
2010
15 years 29 days ago
Efficient HMM-based estimation of missing features, with applications to packet loss concealment
In this paper, we present efficient HMM-based techniques for estimating missing features. By assuming speech features to be observations of hidden Markov processes, we derive a mi...
Bengt J. Borgström, Per Henrik Borgström...
IPL
2007
105views more  IPL 2007»
15 years 6 months ago
A new algorithm for testing if a regular language is locally threshold testable
A new algorithm is presented for testing if a regular language is locally threshold testable. The new algorithm is slower than existing algorithms, but its correctness proof is sh...
Mikolaj Bojanczyk