Sciweavers

829 search results - page 63 / 166
» A time aggregation approach to Markov decision processes
Sort
View
ECML
2007
Springer
15 years 8 months ago
Sequence Labeling with Reinforcement Learning and Ranking Algorithms
Many problems in areas such as Natural Language Processing, Information Retrieval, or Bioinformatic involve the generic task of sequence labeling. In many cases, the aim is to assi...
Francis Maes, Ludovic Denoyer, Patrick Gallinari
CCS
2010
ACM
15 years 6 months ago
Dialog-based payload aggregation for intrusion detection
Network-based Intrusion Detection Systems (IDSs) such as Snort or Bro that have to analyze the packet payload for all the received data show severe performance problems if used in...
Tobias Limmer, Falko Dressler
ICML
1995
IEEE
16 years 7 months ago
Learning Policies for Partially Observable Environments: Scaling Up
Partially observable Markov decision processes (pomdp's) model decision problems in which an agent tries to maximize its reward in the face of limited and/or noisy sensor fee...
Michael L. Littman, Anthony R. Cassandra, Leslie P...
WSC
2007
15 years 8 months ago
A toolbox for simulation-based optimization of supply chains
In this paper we present a general framework for simulating and optimizing the operational decisions in a supply chain network. We developed a supply chain network library for the...
Christian Almeder, Margaretha Preusser
AAMAS
2011
Springer
15 years 1 months ago
Optimizing coalition formation for tasks with dynamically evolving rewards and nondeterministic action effects
We consider a problem domain where coalitions of agents are formed in order to execute tasks. Each task is assigned at most one coalition of agents, and the coalition can be reorg...
Majid Ali Khan, Damla Turgut, Ladislau Böl&ou...