Shared Value Function + Policy Parameters
Designs
- Show closed items
Relates to
Activity
-
Newest first Oldest first
-
Show all activity Show comments only Show history only
- Daniel Lukats assigned to @dl337788
assigned to @dl337788
- Daniel Lukats added Feature label
added Feature label
- Daniel Lukats added In Progress label
added In Progress label
- Author Owner
https://github.com/openai/baselines/blob/9ee399f5b20cd70ac0a871927a6cf043b478193f/baselines/ppo1/cnn_policy.py#L36 has two heads, one for action selection and one for value prediction. Shared parameters appear to be common.
- Daniel Lukats marked this issue as related to #8 (closed)
marked this issue as related to #8 (closed)
- Daniel Lukats added In Review label and removed In Progress label
added In Review label and removed In Progress label
- Daniel Lukats mentioned in issue #6 (closed)
mentioned in issue #6 (closed)
- Author Owner
Tests implemented in 3dbac4a4
- Daniel Lukats closed
closed
- Daniel Lukats reopened
reopened
- Daniel Lukats closed
closed
- Daniel Lukats removed In Review label
removed In Review label
- Daniel Lukats changed milestone to %Official dates
changed milestone to %Official dates
Please register or sign in to reply