登录
首页 » matlab » WindyGridWorldQLearning

WindyGridWorldQLearning

于 2013-04-19 发布 文件大小:2KB
0 95
下载积分: 1 下载次数: 31

代码说明:

  Q-learning (Watkins, 1989) is a simple way for agents to learn how to act optimally in controlled Markovian domains. It amounts to an incremental method for dynamic programming which imposes limited computational demands. It works by successively improving its evaluations of the quality of particular actions at particular states. This paper presents and proves in detail a convergence theorem for Q,-learning based on that outlined in Watkins (1989). We show that Q-learning converges to the optimum action-values with probability 1 so long as all actions are repeatedly sampled in all states and the action-values are represented discretely. We also sketch extensions to the cases of non-discounted, but absorbing, Markov environments, and where many Q values can be changed each iteration, rather than just one.

文件列表:

下载说明:请别用迅雷下载,失败请重下,重下不扣分!

发表评论

0 个回复

  • (ASKOOKBPSKQPSK8PSK)
    实现数字基带系统的调制,主要包含2ask、4ask以及2fsk、2psk等,以及其图形化的界面显示(Digital base-band system modulation, mainly contains 2ask, 4ask and 2fsk, 2psk, as well as the graphical interface display)
    2010-12-28 16:54:19下载
    积分:1
  • 模块化光伏离网储能系统设计_赵治国。zip
    文章研究了一种模块化的小型光伏离网储能发电系统,该模块化光伏离网储能系统建成后用于偏远山区(This article studies a modular small-scale photovoltaic off-grid energy storage power generation system. This modular photovoltaic off-grid energy storage system is used in remote mountainous regions after it is completed.)
    2018-03-18 17:13:47下载
    积分:1
  • matlab
    matlab遗传算法的一些源程序,简单实用(matlab genetic algorithm source code of some simple and practical)
    2008-03-27 22:20:16下载
    积分:1
  • caponbeamforming
    鲁棒波束形成和加权系数波束形成(10元线性阵列)(Robust beamforming and beamforming weighting coefficients (10 linear array))
    2010-12-23 10:35:07下载
    积分:1
  • 111
    直流信号和阶跃信号的傅里叶分析和频谱图.(Fourier analysis and spectrogram DC signal and step signal.)
    2015-04-13 19:02:47下载
    积分:1
  • power-flow
    matlab_的电力系统潮流仿真计算,里面是一个文档,有详细步骤,还有MATLAB的M程序。(matlab_ the power flow simulation, which is a document with detailed steps, as well as MATLAB M program.)
    2012-06-06 15:38:43下载
    积分:1
  • bpsk
    用matlab实现2psk比较详细,从二进制码到成型滤波到最终2psk信号都能成功实现(Using matlab to achieve 2psk in more detail, the binary code to the final 2psk signal shaping filter can be successfully achieved)
    2014-11-06 09:23:46下载
    积分:1
  • Codes
    First a estimate of wireless sensor nodes is made and a network mesh will be created based on some topology techniques. A connection has to be established between Node I and all the vicinity nodes.
    2013-04-23 22:33:13下载
    积分:1
  • 1203.6571
    bat algorithm for optimization
    2013-09-16 14:56:51下载
    积分:1
  • msfuntmpl
    this is file for traffic networks benchmarks...
    2013-03-11 15:37:01下载
    积分:1
  • 696518资源总数
  • 104269会员总数
  • 42今日下载