首页
学习
活动
专区
工具
TVP
发布
社区首页 >专栏 >ppo trained carla demo show and method

ppo trained carla demo show and method

作者头像
用户1908973
发布2019-04-28 14:23:59
5360
发布2019-04-28 14:23:59
举报
文章被收录于专栏:CreateAMindCreateAMindCreateAMind
视频内容

Our dream is creating a safe driving system working well under all circumstance, for this purpose, a more intelligent agent is needed. We want to create a mind, instead of a larger or deeper network. Now our team using RL algorithm for self-driving, let the agent learning how to drive totally by itself, here is a demo in Carla simulator.

We use two image stack together as input and three-layer convolution network process the image and another mlp net handle measurement. Most important, we never give the car detail command, like go straight 5m then turn right, instead we use a high-level command, like--turn right. Just like when we driving the car, map system tells us turn left, we have to decide when and how to turn, it is the same to our agent.

We use an algorithm called Proximal policy optimization, plus some trick and running with 12 workers, this demo is generated by an agent training about 30 hours.

本文参与 腾讯云自媒体分享计划,分享自微信公众号。
原始发表:2019-03-03,如有侵权请联系 cloudcommunity@tencent.com 删除

本文分享自 CreateAMind 微信公众号,前往查看

如有侵权,请联系 cloudcommunity@tencent.com 删除。

本文参与 腾讯云自媒体分享计划  ,欢迎热爱写作的你一起参与!

评论
登录后参与评论
0 条评论
热度
最新
推荐阅读
领券
问题归档专栏文章快讯文章归档关键词归档开发者手册归档开发者手册 Section 归档