File tree 1 file changed +9
-5
lines changed
1 file changed +9
-5
lines changed Original file line number Diff line number Diff line change 92
92
93
93
* Trust-Region Policy Optimization (TRPO)
94
94
[[ slides] ( https://github.com/wangshusen/DRL/blob/master/Slides/5_Policy_1.pdf )]
95
+ [[ Video (in Chinese)] ( https://youtu.be/fcSYiyvPjm4 )] .
95
96
96
- * Policy Network + RNNs.
97
+ * Partial Observation and RNNs.
97
98
98
99
99
100
100
101
6 . ** Dealing with Continuous Action Space.**
101
102
102
103
103
- * Discrete versus Continuous Control.
104
- [[ slides] ( https://github.com/wangshusen/DRL/blob/master/Slides/6_Continuous_1.pdf )]
104
+ * Discrete versus Continuous Control
105
+ [[ slides] ( https://github.com/wangshusen/DRL/blob/master/Slides/6_Continuous_1.pdf )]
106
+ [[ Video (in Chinese)] ( https://youtu.be/rRIjgdxSvg8 )] .
105
107
106
- * Deterministic Policy Gradient (DPG) for Continuous Control.
108
+ * Deterministic Policy Gradient (DPG) for Continuous Control
107
109
[[ slides] ( https://github.com/wangshusen/DRL/blob/master/Slides/6_Continuous_2.pdf )]
110
+ [[ Video (in Chinese)] ( https://youtu.be/cmWejKRWLA8 )] .
108
111
109
- * Stochastic Policy Gradient for Continuous Control.
112
+ * Stochastic Policy Gradient for Continuous Control
110
113
[[ slides] ( https://github.com/wangshusen/DRL/blob/master/Slides/6_Continuous_3.pdf )]
114
+ [[ Video (in Chinese)] ( https://youtu.be/McqFyl_W5Wc )] .
111
115
112
116
113
117
You can’t perform that action at this time.
0 commit comments