Search Sciweavers | Sciweavers

12673 search results - page 439 / 2535

» Learning while designing

185

click to vote

AAAI
2008

199views Intelligent Agents» more AAAI 2008»

Maximum Entropy Inverse Reinforcement Learning

15 years 9 months ago

Download www.andrew.cmu.edu

Recent research has shown the benefit of framing problems of imitation learning as solutions to Markov Decision Problems. This approach reduces learning to the problem of recoveri...

Brian Ziebart, Andrew L. Maas, J. Andrew Bagnell, ...

claim paper

Read More »

168

click to vote

AAAI
2010

154views Intelligent Agents» more AAAI 2010»

Assisting Users with Clustering Tasks by Combining Metric Learning and Classification

15 years 8 months ago

Download research.microsoft.com

Interactive clustering refers to situations in which a human labeler is willing to assist a learning algorithm in automatically clustering items. We present a related but somewhat...

Sumit Basu, Danyel Fisher, Steven M. Drucker, Hao ...

claim paper

Read More »

177

click to vote

AAAI
2010

174views Intelligent Agents» more AAAI 2010»

To Max or Not to Max: Online Learning for Speeding Up Optimal Planning

15 years 8 months ago

Download www.technion.ac.il

It is well known that there cannot be a single "best" heuristic for optimal planning in general. One way of overcoming this is by combining admissible heuristics (e.g. b...

Carmel Domshlak, Erez Karpas, Shaul Markovitch

claim paper

Read More »

176

click to vote

ECIS
2001

113views Information Technology» more ECIS 2001»

Knowledge, learning and IT support in a small software company

15 years 8 months ago

Download is2.lse.ac.uk

The literature in the field of knowledge management shows a certain preoccupation with information technology (IT) and technical solutions while it reflects a limited view of orga...

Karlheinz Kautz, Kim Thaysen

claim paper

Read More »

193

click to vote

AAAI
2000

139views Intelligent Agents» more AAAI 2000»

Localizing Search in Reinforcement Learning

15 years 8 months ago

Download www.cs.colorado.edu

Reinforcement learning (RL) can be impractical for many high dimensional problems because of the computational cost of doing stochastic search in large state spaces. We propose a ...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

« Prev « First page 439 / 2535 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers