Skip to content

Commit 4a2eb85

Browse files
committed
added TRPO
1 parent 1d97e63 commit 4a2eb85

File tree

1 file changed

+9
-5
lines changed

1 file changed

+9
-5
lines changed

README.md

+9-5
Original file line numberDiff line numberDiff line change
@@ -92,22 +92,26 @@
9292

9393
* Trust-Region Policy Optimization (TRPO)
9494
[[slides](https://github.com/wangshusen/DRL/blob/master/Slides/5_Policy_1.pdf)]
95+
[[Video (in Chinese)](https://youtu.be/fcSYiyvPjm4)].
9596

96-
* Policy Network + RNNs.
97+
* Partial Observation and RNNs.
9798

9899

99100

100101
6. **Dealing with Continuous Action Space.**
101102

102103

103-
* Discrete versus Continuous Control.
104-
[[slides](https://github.com/wangshusen/DRL/blob/master/Slides/6_Continuous_1.pdf)]
104+
* Discrete versus Continuous Control
105+
[[slides](https://github.com/wangshusen/DRL/blob/master/Slides/6_Continuous_1.pdf)]
106+
[[Video (in Chinese)](https://youtu.be/rRIjgdxSvg8)].
105107

106-
* Deterministic Policy Gradient (DPG) for Continuous Control.
108+
* Deterministic Policy Gradient (DPG) for Continuous Control
107109
[[slides](https://github.com/wangshusen/DRL/blob/master/Slides/6_Continuous_2.pdf)]
110+
[[Video (in Chinese)](https://youtu.be/cmWejKRWLA8)].
108111

109-
* Stochastic Policy Gradient for Continuous Control.
112+
* Stochastic Policy Gradient for Continuous Control
110113
[[slides](https://github.com/wangshusen/DRL/blob/master/Slides/6_Continuous_3.pdf)]
114+
[[Video (in Chinese)](https://youtu.be/McqFyl_W5Wc)].
111115

112116

113117

0 commit comments

Comments
 (0)