登录
首页 » python » 强化学习教程代码

强化学习教程代码

于 2022-03-15 发布 文件大小:247.06 kB
0 115
下载积分: 2 下载次数: 3

代码说明:

强化学习基本教程,包括A3C,DDPG等 有小车、机器臂等基本游戏的控制 强化学习是智能体(Agent)以“试错”的方式进行学习,通过与环境进行交互获得的奖赏指导行为,目标是使智能体获得最大的奖赏,强化学习不同于连接主义学习中的监督学习,主要表现在强化信号上,强化学习中由环境提供的强化信号是对产生动作的好坏作一种评价(通常为标量信号),而不是告诉强化学习系统RLS(reinforcement learning system)如何去产生正确的动作。由于外部环境提供的信息很少,RLS必须靠自身的经历进行学习。通过这种方式,RLS在行动-评价的环境中获得知识,改进行动方案以适应环境。

下载说明:请别用迅雷下载,失败请重下,重下不扣分!

发表评论

0 个回复

  • 开普勒望远镜观察星星的光通量
    开普勒望远镜观察星星的光通量,包括5000条训练数据和400多条测试数据。可用机器学习算法进行训练测试,验证算法的可行性。
    2022-09-04 05:30:15下载
    积分:1
  • QtTest
    说明:  利用深度学习实现图像识别与分类,并用QT编程实现界面设计(Design of image recognition interface based on deep learning + QT)
    2020-01-13 10:14:40下载
    积分:1
  • Django测试
    Django测试,Django测试Django测试,Django测试,Django测试,Django测试,Django测试,Django测试,Django测试,Django测试,Django测试,Django测试,Django测试,Django测试,Django测试,Django测试,Django测试,Django测试,Django测试,Django测试,Django测试,Django测试,Django测试,Django测试,Django测试,Django测试,Django测试,Django测试,Django测试,Django测试,Django测试,Django测试,Django测试,Django测试,Django测试,Django测试,Django测试,Django测试,
    2022-02-25 03:55:40下载
    积分:1
  • gan-master
    用GAN实现样本生成,深度学习框架为TensorFlow(to generate samples)
    2019-02-25 10:07:14下载
    积分:1
  • change
    pyqt和opencv结合,将opencv采集的图像显示到QT的窗口上 python代码 国外大神写的(pyqt and opencv combined with the opencv captured image is displayed on the window to QT python code written in foreign god)
    2013-07-25 13:42:45下载
    积分:1
  • algorithm
    利用python实现的二分法思想以及几种排序、二叉树的删除算法实现(The idea of dichotomy realized by Python and the implementation of several sorting and deletion algorithms of binary tree)
    2020-06-18 10:40:02下载
    积分:1
  • _book
    该资源是关于Python软件的安装环境参考教学文档的(The resource is about the installation environment of the Python software refer to the teaching document.)
    2020-06-25 00:40:02下载
    积分:1
  • vucharlength
    機械應力,變形和變形速率之間的關係可能??非常複雜,儘管如果數量足夠小,線性近似在實踐中可能是足夠的。超過材料的某些強度極限的應力將導致永久變形(例如塑性流動,破裂,空化)或甚至改變其晶體結構和化學組成。 在某些工程分支中,術語壓力有時會在寬鬆的意義上用作“內部力量”的同義詞。例如,在桁架的分析中,它可以指作用在樑上的總牽引力或壓縮力,而不是力除以其橫截面的面積。(The relation between mechanical stress, deformation, and the rate of change of deformation can be quite complicated, although a linear approximation may be adequate in practice if the quantities are small enough. Stress that exceeds certain strength limits of the material will result in permanent deformation (such as plastic flow, fracture, cavitation) or even change its crystal structure and chemical composition. In some branches of engineering, the term stress is occasionally used in a looser sense as a synonym of "internal force". For example, in the analysis of trusses, it may refer to the total traction or compression force acting on a beam, rather than the force divided by the area of its cross-section.)
    2020-06-23 09:40:02下载
    积分:1
  • Python编程:从入门到实践
    一本很好地python入门编程书籍,建议新手都看一下(A good python book,I suggest every new one reading it.)
    2018-08-22 22:19:31下载
    积分:1
  • ACO
    基于ACO算法的路径规划算法,用python3.7实现,可直接运行(The path planning algorithm based on ACO algorithm is implemented by Python 3.7 and can run directly.)
    2020-06-25 08:40:02下载
    积分:1
  • 696518资源总数
  • 104321会员总数
  • 14今日下载