Sciweavers

4457 search results - page 270 / 892
» Tests of Machine Intelligence
Sort
View
EWRL
2008
15 years 8 months ago
Efficient Reinforcement Learning in Parameterized Models: Discrete Parameter Case
We consider reinforcement learning in the parameterized setup, where the model is known to belong to a parameterized family of Markov Decision Processes (MDPs). We further impose ...
Kirill Dyagilev, Shie Mannor, Nahum Shimkin
ALENEX
2008
133views Algorithms» more  ALENEX 2008»
15 years 8 months ago
Comparing Online Learning Algorithms to Stochastic Approaches for the Multi-Period Newsvendor Problem
The multi-period newsvendor problem describes the dilemma of a newspaper salesman--how many paper should he purchase each day to resell, when he doesn't know the demand? We d...
Shawn O'Neil, Amitabh Chaudhary
ICML
2010
IEEE
15 years 7 months ago
A Conditional Random Field for Multiple-Instance Learning
We present MI-CRF, a conditional random field (CRF) model for multiple instance learning (MIL). MI-CRF models bags as nodes in a CRF with instances as their states. It combines di...
Thomas Deselaers, Vittorio Ferrari
ATAL
2006
Springer
15 years 10 months ago
ACTIVE: agile coordinator testbed integrated virtual environment
This paper describes the specification, design and development of ACTIVE, a testbed for the testing and simulation of large-scale agent-based systems. ACTIVE is being developed as...
G. Emami, J. Cheng, D. Cornwell, M. Feldhousen, C....
WRAC
2005
Springer
16 years 6 days ago
Autonomy Without Independence: Animal Training as a Model for Robot Design
A classic autonomous robot is an autonomous agent for open, unpredictable environments. Such an agent is inherently autonomous but not independent. Independence implies unpredictab...
David C. Wyland