Search Sciweavers | Sciweavers

829 search results - page 49 / 166

» A time aggregation approach to Markov decision processes

194

click to vote

SEDE
2007

109views Software Engineering» more SEDE 2007»

A framework for constraint checking involving aggregates for multiple XML databases using schematron

15 years 7 months ago

Download www.mscs.mu.edu

Many internet and enterprise applications now not only use XML (eXtensible Markup Language) as a medium for communication but also for storing their data either temporarily for an...

Albin Laga, Praveen Madiraju

claim paper

Read More »

164

click to vote

ICML
2001
IEEE

172views Machine Learning» more ICML 2001»

Continuous-Time Hierarchical Reinforcement Learning

16 years 7 months ago

Download www.cs.ualberta.ca

Hierarchical reinforcement learning (RL) is a general framework which studies how to exploit the structure of actions and tasks to accelerate policy learning in large domains. Pri...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

190

click to vote

GLOBECOM
2010
IEEE

156views Communications» more GLOBECOM 2010»

Admission Control and Channel Allocation for Supporting Real-Time Applications in Cognitive Radio Networks

15 years 4 months ago

Download ncel.ie.cuhk.edu.hk

Abstract--Proper admission control in cognitive radio networks is critical in providing QoS guarantees to secondary unlicensed users. In this paper, we study the admission control ...

Feng Wang, Junhua Zhu, Jianwei Huang, Yuping Zhao

claim paper

Read More »

154

click to vote

ECAI
2006
Springer

141views Artificial Intelligence» more ECAI 2006»

Decision with Uncertainties, Feasibilities, and Utilities: Towards a Unified Algebraic Framework

15 years 9 months ago

Download www.inra.fr

Several formalisms exist to express and solve decision problems. Each is designed to capture different kinds of knowledge: utilities expressing preferences, uncertainties on the en...

Cédric Pralet, Gérard Verfaillie, Th...

claim paper

Read More »

142

click to vote

ICML
2008
IEEE

135views Machine Learning» more ICML 2008»

Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs

16 years 7 months ago

Download mapleleaf.csail.mit.edu

Partially Observable Markov Decision Processes (POMDPs) have succeeded in planning domains that require balancing actions that increase an agent's knowledge and actions that ...

Finale Doshi, Joelle Pineau, Nicholas Roy

claim paper

Read More »

« Prev « First page 49 / 166 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers