MDP-model-of-MPNP
代码说明:
在matlab平台上,针对多周期报童问题,采用值迭代算法、策略迭代算法和强化学习算法求解MDP模型的实例(This is an example presentting how to apply value-iteration algorithm,policy-iteration algorithm and reinforcement learning algorithm to MDP model, which aims to solve the multi-period newsboy problem.)
文件列表:
多周期报童问题的MDP建模及求解
.............................\draw.asv,1879,2011-11-15
.............................\draw.m,1880,2011-11-15
.............................\drawFigure.asv,613,2011-07-28
.............................\drawFigure.m,1038,2011-10-09
.............................\initial.asv,854,2011-11-14
.............................\initial.m,855,2011-11-15
.............................\initial2.m,109,2011-10-15
.............................\main.asv,1727,2011-10-15
.............................\main.m,1271,2012-04-04
.............................\policyIteration.asv,2419,2011-10-13
.............................\policyIteration.m,2419,2011-10-13
.............................\QLearning.asv,2341,2011-10-12
.............................\QLearning.m,2712,2011-10-13
.............................\revenueMDP.asv,955,2011-07-30
.............................\revenueMDP.m,937,2011-10-15
.............................\revenuesS.m,919,2011-07-30
.............................\reward.asv,1348,2011-10-15
.............................\reward.m,1346,2011-10-15
.............................\transitionMatrix.asv,2738,2011-10-10
.............................\transitionMatrix.m,3186,2011-10-10
.............................\valueIteration.asv,1453,2011-10-10
.............................\valueIteration.m,1406,2011-10-12
下载说明:请别用迅雷下载,失败请重下,重下不扣分!