Sciweavers

2566 search results - page 335 / 514
» Relating reinforcement learning performance to classificatio...
Sort
View
JMLR
2006
124views more  JMLR 2006»
15 years 6 months ago
Policy Gradient in Continuous Time
Policy search is a method for approximately solving an optimal control problem by performing a parametric optimization search in a given class of parameterized policies. In order ...
Rémi Munos
ICCD
2006
IEEE
111views Hardware» more  ICCD 2006»
16 years 3 months ago
Implicit Search-Space Aware Cofactor Expansion: A Novel Preimage Computation Technique
Abstract— In this paper, we introduce a novel preimage computation technique that directly computes the circuit cofactors without an explicit search for any satisfiable solution...
Kameshwar Chandrasekar, Michael S. Hsiao
ECML
2006
Springer
15 years 10 months ago
A Discriminative Approach for the Retrieval of Images from Text Queries
This work proposes a new approach to the retrieval of images from text queries. Contrasting with previous work, this method relies on a discriminative model: the parameters are sel...
David Grangier, Florent Monay, Samy Bengio
ICML
2004
IEEE
16 years 7 months ago
Learning first-order rules from data with multiple parts: applications on mining chemical compound data
Inductive learning of first-order theory based on examples has serious bottleneck in the enormous hypothesis search space needed, making existing learning approaches perform poorl...
Cholwich Nattee, Sukree Sinthupinyo, Masayuki Numa...
SAC
2006
ACM
16 years 16 days ago
Approaches to text mining for clinical medical records
Clinical medical records contain a wealth of information, largely in free-text form. Means to extract structured information from free-text records is an important research endeav...
Xiaohua Zhou, Hyoil Han, Isaac Chankai, Ann Prestr...