Sciweavers

3238 search results - page 549 / 648
» On the Computational Interpretation of Negation
Sort
View
NIPS
2001
15 years 7 months ago
Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning
Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...
Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...
NIPS
2001
15 years 7 months ago
MIME: Mutual Information Minimization and Entropy Maximization for Bayesian Belief Propagation
Bayesian belief propagation in graphical models has been recently shown to have very close ties to inference methods based in statistical physics. After Yedidia et al. demonstrate...
Anand Rangarajan, Alan L. Yuille
INTERACT
2003
15 years 7 months ago
Improving Usability of E-Commerce Sites by Tracking Eye Movements
: Usability evaluation techniques such as user-observations, cognitive walkthroughs, or heuristic evaluations can be applied to evaluate the usability of multimedia interfaces of c...
Ekaterini Tzanidou
WOA
2001
15 years 7 months ago
Multi-Agent Systems as Composition of Observable Systems
Observation is becoming a crucial issue in the engineering of today's systems: the common practice for dealing with their complexity is to encapsulate their subcomponents abs...
Mirko Viroli, Andrea Omicini
AAAI
2000
15 years 7 months ago
STA: Spatio-Temporal Aggregation with Applications to Analysis of Diffusion-Reaction Phenomena
Spatio-temporal data sets arise when time-varying physical fields are discretized for simulation or analysis. Examples of time-varying fields are isothermal regions in the sea or ...
Iván Ordóñez, Feng Zhao