登录
首页 » matlab » WindyGridWorldQLearning

WindyGridWorldQLearning

于 2013-04-19 发布 文件大小:2KB
0 73
下载积分: 1 下载次数: 31

代码说明:

  Q-learning (Watkins, 1989) is a simple way for agents to learn how to act optimally in controlled Markovian domains. It amounts to an incremental method for dynamic programming which imposes limited computational demands. It works by successively improving its evaluations of the quality of particular actions at particular states. This paper presents and proves in detail a convergence theorem for Q,-learning based on that outlined in Watkins (1989). We show that Q-learning converges to the optimum action-values with probability 1 so long as all actions are repeatedly sampled in all states and the action-values are represented discretely. We also sketch extensions to the cases of non-discounted, but absorbing, Markov environments, and where many Q values can be changed each iteration, rather than just one.

文件列表:

下载说明:请别用迅雷下载,失败请重下,重下不扣分!

发表评论

0 个回复

  • AnExampleOfKalmanFiltering
    Summary: An example of KALMAN FILTER MATLAB Release: R13SP1 Required Products: Communications Toolbox,Signal Processing Toolbox Description: THIS PROGRAM DEMONSTRATES AN EXAMPLE OF KALMAN FILTER.
    2007-11-27 00:47:06下载
    积分:1
  • SEP
    LEACH协议的改进SEP的Maltab仿真代码( SEP,a protocol to improve the LEACH by Maltab)
    2010-02-14 21:04:57下载
    积分:1
  • pll
    本文讲述锁相环的工作原理,锁相环路实际上是一个相差自动调节系统。(This article describes the working principle of PLL, PLL is actually a difference between the automatic adjustment system.)
    2011-01-09 20:25:55下载
    积分:1
  • DirectedSpanningTree
    find out minimum spanning tree
    2011-02-13 00:59:14下载
    积分:1
  • 53607901TOA_TR_RE
    说明:  hslogic算法仿真,基于MATLAB的TOA算法估计,可以获得较高的定位精度(Hslogic algorithm simulation, toa algorithm estimation based on MATLAB, can obtain high positioning accuracy)
    2019-12-08 00:33:51下载
    积分:1
  • radarSimulation
    radar matlab simulation
    2011-10-01 14:06:59下载
    积分:1
  • DVB-S2
    有关DVB—S2的技术特点及应用,及其相关的重要技术,如信道编码,LDPC码和BCH码的编译码算法(Technical characteristics and application of DVB-S2, and the important technology, such as channel coding, LDPC code and BCH code encryption algorithm)
    2020-07-02 06:40:02下载
    积分:1
  • ELDGA
    solves economic dispatch using ga
    2014-12-02 18:46:21下载
    积分:1
  • 07027162
    IMPARARE IL C++ in 2 ore e mezzo? Modulo-000
    2015-03-10 23:22:16下载
    积分:1
  • matlabsourcecodeforFingerprint
    matlab sourcecodeforFingerprint(sourcecodeforFingerprint)
    2009-05-10 13:13:48下载
    积分:1
  • 696521资源总数
  • 104066会员总数
  • 49今日下载