Search Sciweavers | Sciweavers

12194 search results - page 429 / 2439

» Numberings Optimal for Learning

161

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 8 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

185

click to vote

COLING
1996

113views Computational Linguistics» more COLING 1996»

Learning to Recognize Names Across Languages

15 years 8 months ago

Download acl.ldc.upenn.edu

The development of natural language proccssing (NLP) systems that perform machine translation (MT) and information retrieval (IR) has highlighted the need for the automatic recogn...

Anthony F. Gallippi

claim paper

Read More »

230

click to vote

TEC
2008

165views more TEC 2008»

Population-Based Incremental Learning With Associative Memory for Dynamic Environments

15 years 6 months ago

Download www.cs.bham.ac.uk

In recent years, interest in studying evolutionary algorithms (EAs) for dynamic optimization problems (DOPs) has grown due to its importance in real-world applications. Several app...

Shengxiang Yang, Xin Yao

claim paper

Read More »

195

click to vote

ICTAI
2010
IEEE

211views Artificial Intelligence» more ICTAI 2010»

Combining Mixed Integer Programming and Supervised Learning for Fast Re-planning

15 years 4 months ago

Download www.montefiore.ulg.ac.be

We introduce a new plan repair method for problems cast as Mixed Integer Programs. In order to tackle the inherent complexity of these NP-hard problems, our approach relies on the ...

Emmanuel Rachelson, Ala Ben Abbes, Sebastien Dieme...

claim paper

Read More »

191

click to vote

IS
2010

109views Artificial Intelligence» more IS 2010»

Multicriteria reinforcement learning based on a Russian doll method for network routing

15 years 4 months ago

Download hal.archives-ouvertes.fr

The routing in communication networks is typically a multicriteria decision making (MCDM) problem. However, setting the parameters of most used MCDM methods to fit the preferences ...

Alain Pétrowski, Farouk Aissanou, Ilham Ben...

claim paper

Read More »

« Prev « First page 429 / 2439 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers