Sciweavers

2263 search results - page 156 / 453
» On learning in agent-centered search
Sort
View
IJCAI
2001
15 years 7 months ago
Exploiting Multiple Secondary Reinforcers in Policy Gradient Reinforcement Learning
Most formulations of Reinforcement Learning depend on a single reinforcement reward value to guide the search for the optimal policy solution. If observation of this reward is rar...
Gregory Z. Grudic, Lyle H. Ungar
NIPS
1993
15 years 7 months ago
Temporal Difference Learning of Position Evaluation in the Game of Go
The game of Go has a high branching factor that defeats the tree search approach used in computer chess, and long-range spatiotemporal interactions that make position evaluation e...
Nicol N. Schraudolph, Peter Dayan, Terrence J. Sej...
ICASSP
2010
IEEE
15 years 6 months ago
Speech modeling based on committee-based active learning
We propose a committee-based active learning method for large vocabulary continuous speech recognition. In this approach, multiple recognizers are prepared beforehand, and the rec...
Yuzu Hamanaka, Koichi Shinoda, Sadaoki Furui, Tada...
CVPR
2007
IEEE
16 years 8 months ago
Multiple Instance Learning of Pulmonary Embolism Detection with Geodesic Distance along Vascular Structure
We propose a novel classification approach for automatically detecting pulmonary embolism (PE) from computedtomography-angiography images. Unlike most existing approaches that req...
Jinbo Bi, Jianming Liang
KDD
2005
ACM
145views Data Mining» more  KDD 2005»
16 years 6 months ago
Using and Learning Semantics in Frequent Subgraph Mining
The search for frequent subgraphs is becoming increasingly important in many application areas including Web mining and bioinformatics. Any use of graph structures in mining, howev...
Bettina Berendt