Data Efficient Reinforcement learning for Autonomous Robots with Simulated and Off-policy Data

Learning from interaction with the environment -- trying untested actions, observing successes and failures, and tying effects back to causes -- is one of the first capabilities thought of when considering intelligent agents. Reinforcement learning is the area of artificial intelligence research that has the goal of allowing autonomous agents to learn in this way. Despite many recent empirical successes, most modern reinforcement learning algorithms are still limited by the large amounts of experience required before useful skills are learned. Making reinforcement learning more data efficient would allow computers to autonomously solve complex tasks in dynamic environments such as those found in robotics, traffic management, or healthcare.

My research focuses on giving agents the ability to predict how their actions influence their ability to solve a given task. In this talk, I will describe my research in this area and how efficient prediction connects to efficient reinforcement learning. In the first part of the talk, I will introduce an algorithm that allows an agent to find informative exploratory behaviors for learning how it’s actions influence task performance. In the second part of the talk, I will introduce an algorithm that allows robot skills learned in simulated environments to transfer to the real world. Finally, I will describe directions for future work that will lead to an increased applicability of reinforcement learning to real-world problems.

See more at

Advertisement

Data Efficient Reinforcement learning for Autonomous Robots with Simulated and Off-policy Data

microsoft research,

Post a Comment

0 Comments

Popular Videos

Model Sophie dress presentation agency Brima.d

Won't work under pressure when it comes to national security: PM Modi

The Most Efficient Way To Farm Sugar Cane Collection (Hypixel Skyblock)

Muscles supplied by facial nerve.ENT PEARLS daily

Body Image- Mental Health Awareness Week 2019

Слух: From Software готовит "Dark Souls" про викингов | Игровые новости

30 THINGS YOU DO WRONG

Archive

Recent

Categories

HOT

Menu Footer Widget

Advertisement

Data Efficient Reinforcement learning for Autonomous Robots with Simulated and Off-policy Data

microsoft research,

You may like these posts

Post a Comment

0 Comments

Popular Videos

Model Sophie dress presentation agency Brima.d

Won't work under pressure when it comes to national security: PM Modi

The Most Efficient Way To Farm Sugar Cane Collection (Hypixel Skyblock)

Muscles supplied by facial nerve.ENT PEARLS daily

Body Image- Mental Health Awareness Week 2019

Слух: From Software готовит "Dark Souls" про викингов | Игровые новости

30 THINGS YOU DO WRONG

Archive

Recent

Categories

HOT

Menu Footer Widget