PPO Implementation
Activity
-
Newest first Oldest first
-
Show all activity Show comments only Show history only
- Daniel Lukats marked this issue as related to #2 (closed)
marked this issue as related to #2 (closed)
- Daniel Lukats added Feature label
added Feature label
- Daniel Lukats assigned to @dl337788
assigned to @dl337788
- Daniel Lukats marked this issue as related to #17 (closed)
marked this issue as related to #17 (closed)
- Daniel Lukats marked this issue as related to #18 (closed)
marked this issue as related to #18 (closed)
- Author Owner
"If using a neural network architecture that shares parameters between the policy and value function, we must use a loss function that combines the policy surrogate and a value function error term." PPO p. 5
- Daniel Lukats marked this issue as related to #21 (closed)
marked this issue as related to #21 (closed)
- Daniel Lukats marked this issue as related to #22 (closed)
marked this issue as related to #22 (closed)
- Daniel Lukats added In Progress label
added In Progress label
- Daniel Lukats marked this issue as related to #25 (closed)
marked this issue as related to #25 (closed)
- Daniel Lukats changed milestone to %Official dates
changed milestone to %Official dates
- Author Owner
Several performance optimizations are outlined and analyzed in Are Deep Policy Gradient AlgorithmsTruly Policy Gradient Algorithms? (Ilyas et al., 2018)
- Daniel Lukats added In Review label and removed In Progress label
added In Review label and removed In Progress label
- Daniel Lukats closed
closed
- Daniel Lukats removed In Review label
removed In Review label
- Daniel Lukats marked this issue as related to #35 (closed)
marked this issue as related to #35 (closed)
- Daniel Lukats marked this issue as related to #36 (closed)
marked this issue as related to #36 (closed)
- Daniel Lukats marked this issue as related to #33 (closed)
marked this issue as related to #33 (closed)
- Daniel Lukats marked this issue as related to #31 (closed)
marked this issue as related to #31 (closed)
- Daniel Lukats marked this issue as related to #53 (closed)
marked this issue as related to #53 (closed)