Sciweavers

829 search results - page 69 / 166
» A time aggregation approach to Markov decision processes
Sort
View
NIPS
2007
15 years 7 months ago
Online Linear Regression and Its Application to Model-Based Reinforcement Learning
We provide a provably efficient algorithm for learning Markov Decision Processes (MDPs) with continuous state and action spaces in the online setting. Specifically, we take a mo...
Alexander L. Strehl, Michael L. Littman
ATAL
2006
Springer
15 years 10 months ago
Winning back the CUP for distributed POMDPs: planning over continuous belief spaces
Distributed Partially Observable Markov Decision Problems (Distributed POMDPs) are evolving as a popular approach for modeling multiagent systems, and many different algorithms ha...
Pradeep Varakantham, Ranjit Nair, Milind Tambe, Ma...
SAC
2004
ACM
15 years 11 months ago
A decision-theoretic approach for designing proactive communication in multi-agent teamwork
Techniques that support effective communication during teamwork processes are of particular importance. Psychological study shows that an effective team often can anticipate infor...
Yu Zhang, Richard A. Volz, Thomas R. Ioerger, John...
ICA
2010
Springer
15 years 4 months ago
Hybrid Channel Estimation Strategy for MIMO Systems with Decision Feedback Equalizer
We propose combining supervised and unsupervised algorithms in order to improve the performance of multiple-input multipleoutputdigitalcommunication systemswhich makeuseofdecision-...
Héctor J. Pérez-Iglesias, Adriana Da...
PAKDD
2009
ACM
126views Data Mining» more  PAKDD 2009»
16 years 29 days ago
Tree-Based Method for Classifying Websites Using Extended Hidden Markov Models
One important problem proposed recently in the field of web mining is website classification problem. The complexity together with the necessity to have accurate and fast algorit...
Majid Yazdani, Milad Eftekhar, Hassan Abolhassani