-
WindyGridWorldQLearning
Q-learning (Watkins, 1989) is a simple way for agents to learn how to act optimally in controlled Markovian
domains. It amounts to an incremental method for dynamic programming which imposes limited computational
demands. It works by successively improving its evaluations of the quality of particular actions at particular states.
This paper presents and proves in detail a convergence theorem for Q,-learning based on that outlined in Watkins
(1989). We show that Q-learning converges to the optimum action-values with probability 1 so long as all actions
are repeatedly sampled in all states and the action-values are represented discretely. We also sketch extensions
to the cases of non-discounted, but absorbing, Markov environments, and where many Q values can be changed
each iteration, rather than just one.
- 2013-04-19 14:23:35下载
- 积分:1
-
xunhuanpu
一个用循环谱原理给出的计算程序,可以画出信号的图形(a cyclic spectrum )
- 2009-11-29 23:36:45下载
- 积分:1
-
circle_sector_contains_point_2d
for calculating circle sector
- 2009-12-08 16:17:48下载
- 积分:1
-
Huffman
霍夫曼信源编码,针对一串数据 进行霍夫曼压缩 得到霍夫曼树(Huffman coding)
- 2012-05-11 00:35:31下载
- 积分:1
-
HAWTSim
toolbox for simulation of wind turbine (Demo)
- 2011-07-06 19:38:49下载
- 积分:1
-
buckboost
升降压DC——DC变换器的simulink模型,前相为buck变换器,后相为boost变换器,可以进行适当的升降电压调节。(The buck-boost DC- DC converter simulink model, phase buck converter, the latter phase of the boost converter can be appropriate lifting voltage regulator.)
- 2012-07-01 16:28:12下载
- 积分:1
-
61549826Hierarchical
说明: 基于电力系统节点无功电压控制灵敏度,电气距离的聚类分析(Cluster Analysis of Reactive Power and Voltage Control Sensitivity and Electrical Distance of Power System Nodes)
- 2020-10-14 20:57:30下载
- 积分:1
-
Beamforming
This is a MATLAB based program that computes the weights and
beamforming pattern of a
- 2013-02-09 04:01:06下载
- 积分:1
-
Precoder-ande-Decoder
瑞利信道的仿真,以及瑞利信道下的编码解码过程的matlab实现(Rayleigh channel simulation, and the Rayleigh channel encoding and decoding process under the matlab implementation)
- 2011-09-21 18:23:16下载
- 积分:1
-
Improving-Dictionary-Learning
PDF+论文算法实现源代码"Improving Dictionary Learning: Multiple Dictionary Updates and Coefficient Reuse"由莱斯利N.史密斯和迈克尔·埃拉德,IEEE信号处理快报(2013年)(PDF+ thesis algorithm source code " Improving Dictionary Learning: Multiple Dictionary Updates and Coefficient Reuse" by the Er Ai Leslie N. Smith and Mike Ladd, IEEE Signal Processing Letters (2013))
- 2014-01-12 23:27:57下载
- 积分:1