首页
学习
活动
专区
工具
TVP
发布
社区首页 >专栏 >近年来强化学习分类综述大全,不看后悔,收藏为先!

近年来强化学习分类综述大全,不看后悔,收藏为先!

作者头像
演化计算与人工智能
发布2021-06-09 16:28:56
1.1K0
发布2021-06-09 16:28:56
举报

Evacloud

大纲

强化学习宏观视角

多智能体强化学习

offline rl

off-policy rl

策略估计

逆强化学习

强化学习、模仿学习

分层强化学习

多任务强化学习

adversarial rl

Probabilistic rl

分布式强化学习

Sim-to-Real

奖励塑形

安全强化学习

model-free、model-based强化学习

强化学习状态表示

贝叶斯强化学习

强化学习组合优化

迁移学习强化学习

课程学习强化学习

可解释的强化学习

动态环境强化学习

Human advice强化学习

NLP&rl

强化学习实验

强化学习的应用

1.机器人 2.搜索、推荐、社交 3.经济金融 4.交通运输、能源 5.游戏 6.云计算、物联网 7.自动驾驶、路径规划 8.安全 9.医疗 10.深度学习框架 11.自动控制

其他

正文

强化学习宏观视角

A Survey of Reinforcement Learning Techniques: Strategies, Recent Development, and Future Directions

arxiv.org/abs/2001.0692

A survey of benchmarking frameworks for reinforcement learning

arxiv.org/pdf/2011.1357

Universal Reinforcement Learning Algorithms: Survey and Experiments

arxiv.org/pdf/1705.1055

A Brief Survey of Deep Reinforcement Learning

arxiv.org/pdf/1708.0586

Policy Search in Continuous Action Domains: an Overview

arxiv.org/pdf/1803.0470

A Tour of Reinforcement Learning: The View from Continuous Control

arxiv.org/pdf/1806.0946

多智能体强化学习

An Overview of Multi-agent Reinforcement Learning from Game Theoretical Perspective

arxiv.org/pdf/2011.0058

Algorithms in Multi-Agent Systems: A Holistic Perspective from Reinforcement Learning and Game Theory

arxiv.org/pdf/2001.0648

Deep Reinforcement Learning for Multi-Agent Systems: A Review of Challenges, Solutions and Applications

arxiv.org/pdf/1812.1179

A Survey of Learning in Multiagent Environments: Dealing with Non-Stationarity

arxiv.org/pdf/1707.0918

A Survey and Critique of Multiagent Deep Reinforcement Learning

arxiv.org/pdf/1810.0558

offline rl

Offline (Batch) Reinforcement Learning: A Review of Literature and Applications

Offline (Batch) Reinforcement Learning: A Review of Literature and Applications

Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems

arxiv.org/pdf/2005.0164

off-policy rl

Off-policy Learning With Eligibility Traces: A Survey

jmlr.org/papers/volume1

策略估计

逆强化学习

A Survey of Inverse Reinforcement Learning: Challenges, Methods and Progress

arxiv.org/pdf/1806.0687

强化学习、模仿学习

A Survey of Deep RL and IL for Autonomous Driving Policy Learning

arxiv.org/pdf/2101.0199

A Survey of Deep Network Solutions for Learning Control in Robotics: From Reinforcement to Imitation

arxiv.org/pdf/1612.0713

分层强化学习

Hierarchical principles of embodied reinforcement learning: A review

arxiv.org/pdf/2012.1014

多任务强化学习

A Survey of Multi-Task Deep Reinforcement Learning

mdpi.com/2079-9292/9/9/

adversarial rl

Challenges and Countermeasures for Adversarial Attacks on Deep Reinforcement Learning

arxiv.org/pdf/2001.0968

Probabilistic rl

A Short Survey on Probabilistic Reinforcement Learning

arxiv.org/pdf/1901.0701

Tutorial and Survey on Probabilistic Graphical Model and Variational Inference in Deep Reinforcement Learning

arxiv.org/pdf/1908.0938

分布式强化学习

Distributed Deep Reinforcement Learning: An Overview

arxiv.org/pdf/2011.1101

Sim-to-Real

Sim-to-Real Transfer in Deep Reinforcement Learning for Robotics: a Survey

arxiv.org/pdf/2009.1330

奖励塑形

Intrinsically Motivated Goal-Conditioned Reinforcement Learning: a Short Survey

arxiv.org/pdf/2012.0983

A survey on intrinsic motivation in reinforcement learning

arxiv.org/pdf/1908.0697

安全强化学习

Safe Learning and Optimization Techniques: Towards a Survey of the State of the Art

arxiv.org/pdf/2101.0950

model-free、model-based强化学习

Average-reward model-free reinforcement learning: a systematic review and literature mapping

arxiv.org/pdf/2010.0892

Model-Free Reinforcement Learning for Financial Portfolios: A Brief Survey

arxiv.org/ftp/arxiv/pap

Model-based Reinforcement Learning: A Survey.

arxiv.org/pdf/2006.1671

Model-Based Deep Reinforcement Learning for High-Dimensional Problems, a Survey

arxiv.org/pdf/2008.0559

A Survey on Applications of Model-Free Strategy Learning in Cognitive Wireless Networks

arxiv.org/pdf/1504.0397

强化学习状态表示

An Overview of Natural Language State Representation for Reinforcement Learning

arxiv.org/pdf/2007.0977

Feature-Based Aggregation and Deep Reinforcement Learning: A Survey and Some New Implementations

arxiv.org/pdf/1804.0457

贝叶斯强化学习

Bayesian Reinforcement Learning: A Survey

arxiv.org/pdf/1609.0443

强化学习组合优化

Reinforcement Learning for Combinatorial Optimization: A Survey

arxiv.org/pdf/2003.0360

A Survey on Reinforcement Learning for Combinatorial Optimization

arxiv.org/pdf/2008.1224

迁移学习强化学习

Transfer Learning in Deep Reinforcement Learning: A Survey

arxiv.org/pdf/2009.0788

课程学习强化学习

Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey

arxiv.org/pdf/2003.0496

Automatic Curriculum Learning For Deep RL: A Short Survey

arxiv.org/pdf/2003.0466

可解释的强化学习

Explainable Reinforcement Learning: A Survey

arxiv.org/pdf/2005.0624

动态环境强化学习

A Survey of Reinforcement Learning Algorithms for Dynamically Varying Environments

arxiv.org/pdf/2005.1061

A Survey of Learning in Multiagent Environments: Dealing with Non-Stationarity

arxiv.org/pdf/1707.0918

Human advice强化学习

REINFORCEMENT LEARNING WITH HUMAN ADVICE: A SURVEY.

arxiv.org/pdf/2005.1101

NLP&rl

A Survey of Reinforcement Learning Informed by Natural Language

arxiv.org/pdf/1906.0392

强化学习实验

A Survey on Reproducibility by Evaluating Deep Reinforcement Learning Algorithms on Real-World Robots

arxiv.org/pdf/1909.0377

强化学习的应用

1.机器人

Reinforcement Learning Approaches in Social Robotics

arxiv.org/pdf/2009.0968

Emotion in Reinforcement Learning Agents and Robots: A Survey

arxiv.org/pdf/1705.0517

2.搜索、推荐、社交

Reinforcement Learning based Recommender Systems: A Survey

arxiv.org/pdf/2101.0628

Deep Reinforcement Learning for Search, Recommendation, and Online Advertising: A Survey

arxiv.org/pdf/1812.0712

Applications of Deep Reinforcement Learning in Communications and Networking: A Survey

arxiv.org/pdf/1810.0786

Reinforcement Learning Approaches in Social Robotics

arxiv.org/pdf/2009.0968

3.经济金融

Comprehensive Review of Deep Reinforcement Learning Methods and Applications in Economics

arxiv.org/ftp/arxiv/pap

Model-Free Reinforcement Learning for Financial Portfolios: A Brief Survey

arxiv.org/ftp/arxiv/pap

4.交通运输、能源

Deep Reinforcement Learning for Intelligent Transportation Systems: A Survey

arxiv.org/pdf/2005.0093

Deep Reinforcement Learning for Smart Building Energy Management: A Survey

https://arxiv.org/pdf/2008.05074.pdfarxiv.org

weixin.qq.com/g/AwYAAO5<br> (二维码自动识别)

满200的话拉您进2群,加我微信 jjnuxjp5x

5.游戏

A Survey of Deep Reinforcement Learning in Video Games

arxiv.org/pdf/1912.1094

6.云计算、物联网

Reinforcement Learning-based Application Autoscaling in the Cloud: A Survey

arxiv.org/pdf/2001.0995

Deep Reinforcement Learning for Autonomous Internet of Things: Model, Applications and Challenges

arxiv.org/pdf/1907.0905

7.自动驾驶、路径规划

Deep Reinforcement Learning for Autonomous Driving: A Survey

arxiv.org/pdf/2002.0044

Survey of Deep Reinforcement Learning for Motion Planning of Autonomous Vehicles

arxiv.org/pdf/2001.1123

8.安全

Deep Reinforcement Learning for Cyber Security

arxiv.org/pdf/1906.0579

9.医疗

Deep Reinforcement Learning for Clinical Decision Support: A Brief Survey

arxiv.org/pdf/1907.0947

Reinforcement Learning in Healthcare: A Survey

arxiv.org/pdf/1908.0879

10.深度学习框架

Deep Reinforcement Learning for Sequence-to-Sequence Models

arxiv.org/pdf/1805.0946

11.自动控制

Optimal and Autonomous Control Using Reinforcement Learning: A Survey

par.nsf.gov/servlets/pu

其他

Derivative-Free Reinforcement Learning: A Review

arxiv.org/pdf/2102.0571

A SHORT SURVEY ON MEMORY BASED REINFORCEMENT LEARNING

arxiv.org/pdf/1904.0673

本文参与 腾讯云自媒体分享计划,分享自微信公众号。
原始发表:2021-05-27,如有侵权请联系 cloudcommunity@tencent.com 删除

本文分享自 DrawSky 微信公众号,前往查看

如有侵权,请联系 cloudcommunity@tencent.com 删除。

本文参与 腾讯云自媒体分享计划  ,欢迎热爱写作的你一起参与!

评论
登录后参与评论
0 条评论
热度
最新
推荐阅读
目录
  • A Survey of Reinforcement Learning Techniques: Strategies, Recent Development, and Future Directions
    • A Survey of Multi-Task Deep Reinforcement Learning
    相关产品与服务
    批量计算
    批量计算(BatchCompute,Batch)是为有大数据计算业务的企业、科研单位等提供高性价比且易用的计算服务。批量计算 Batch 可以根据用户提供的批处理规模,智能地管理作业和调动其所需的最佳资源。有了 Batch 的帮助,您可以将精力集中在如何分析和处理数据结果上。
    领券
    问题归档专栏文章快讯文章归档关键词归档开发者手册归档开发者手册 Section 归档