GitHub - ZhenghaoFei/hybrid_net: Investigating planning-reaction hybrid network for deep reinforcement learning robot control

Planing - Reaction Hybrid Network

We are investigating a planning-reaction hybrid network for deep reinforcement learning robot control. The basic idea is design a planning network and combine it to a reaction network.

Reaction Network

We call the standard plain network as reaction network, which means they are good at reaction but not good at planning computaion.

A standard reaction network is like this:

Planning Network

The idea of planning network was got from :

Tamar, Aviv, et al. "Value iteration networks." Advances in Neural Information Processing Systems. 2016.

We simplfied the planning network and make it more general do not rely on explicit value interation.

We design a planning network use rnn like this:

Planing - Reaction Hybrid Network

Our ultimate purpose is design a network that has Planing ability while also good at Reaction, the combining of two kinds of computation can improve the total performance of deep reinforcement learning especially in control.

The preliminary design is like this:

We are using a very small network "state evaluation net" in the middle and every time it will output a scala alpha and combine the ouput of planning and reaction module by the ratio of alpha, 1 - alpha. We hope the "state evaluation net" can learn to know when the situation is needing planning more.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
A3C		A3C
Atari		Atari
DQN		DQN
PG/Networks_5		PG/Networks_5
resources		resources
roms		roms
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Planing - Reaction Hybrid Network

Reaction Network

Planning Network

Planing - Reaction Hybrid Network

About

Releases

Packages

Languages

ZhenghaoFei/hybrid_net

Folders and files

Latest commit

History

Repository files navigation

Planing - Reaction Hybrid Network

Reaction Network

Planning Network

Planing - Reaction Hybrid Network

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages