登录
首页 » matlab » WindyGridWorldQLearning

WindyGridWorldQLearning

于 2013-04-19 发布 文件大小:2KB
0 170
下载积分: 1 下载次数: 31

代码说明:

  Q-learning (Watkins, 1989) is a simple way for agents to learn how to act optimally in controlled Markovian domains. It amounts to an incremental method for dynamic programming which imposes limited computational demands. It works by successively improving its evaluations of the quality of particular actions at particular states. This paper presents and proves in detail a convergence theorem for Q,-learning based on that outlined in Watkins (1989). We show that Q-learning converges to the optimum action-values with probability 1 so long as all actions are repeatedly sampled in all states and the action-values are represented discretely. We also sketch extensions to the cases of non-discounted, but absorbing, Markov environments, and where many Q values can be changed each iteration, rather than just one.

文件列表:

下载说明:请别用迅雷下载,失败请重下,重下不扣分!

发表评论

0 个回复

  • Design720_varghese_terryn
    continuous control system
    2010-05-15 11:55:06下载
    积分:1
  • ssom
    利用som算法进行数据的聚类,实现无监督聚类(clustering the data with the som algorithm, that can relizing unsupervised cluster)
    2019-04-29 17:00:47下载
    积分:1
  • Independent_Component_Analysis
    Independent Component Analysis ICA
    2010-02-09 21:04:36下载
    积分:1
  • RWT_BIO
    Non-decimated wavelet transform, redundant wavelet transform, stationary % wavelet transform(Non-decimated wavelet transform, redundant wavelet transform, stationary wavelet transform)
    2007-08-30 10:04:26下载
    积分:1
  • RSC_BCJR_Mfile_Tail
    本程序设计的是咬尾编码的turbo码,用matlab编写。咬尾turbo码也即不需要拖尾比特,使得前后两个状态相同,译码采用修正的MAP译码算法(This program is designed to encode the turbo tail biting code, written with matlab. Tail biting turbo code that is not trailing bits to make the same before and after the two states, decoding, MAP decoding algorithm using modified)
    2010-10-26 20:40:00下载
    积分:1
  • image-Increase
    matlab 数字图像处理 图像增强 含代码和结果显示(Digital image processing matlab image enhancement Containing the code and the results show)
    2015-01-25 13:45:01下载
    积分:1
  • HMM-model
    隐马尔科夫模型是语音识别中的重要算法思想,这里在matlab下实现了一个原理性的算法(Hidden markov models is an important algorithm in speech recognition, here under the matlab implements a rational algorithm)
    2014-01-05 17:33:23下载
    积分:1
  • matlab
    有限元平面刚架结构的分析 matlab程序(Finite element analysis of plane frame structures matlab program)
    2013-12-28 11:06:50下载
    积分:1
  • huffman
    对给定的txt文件进行Huffman编码和解码,并加以分析,采用无记忆信源编码方式(For a given txt files Huffman encoding and decoding, and analyzed, using non-memory source coding method)
    2009-12-29 17:49:57下载
    积分:1
  • quk
    说明:  quk的博士论文和源代码,关于格子boltzmann方法的综合应用(quk doctoral thesis and the source code, on the comprehensive application of lattice boltzmann method)
    2011-03-02 16:38:50下载
    积分:1
  • 696518资源总数
  • 106182会员总数
  • 24今日下载