Skip to content

[RLlib] Fix SAC bug (twin Q not used for min'ing over both Q-nets in loss func).#7354

Merged
ericl merged 1 commit intoray-project:masterfrom
sven1977:sac_twin_q_net_not_used_for_min_over_q_values
Feb 27, 2020
Merged

[RLlib] Fix SAC bug (twin Q not used for min'ing over both Q-nets in loss func).#7354
ericl merged 1 commit intoray-project:masterfrom
sven1977:sac_twin_q_net_not_used_for_min_over_q_values

Conversation

@sven1977
Copy link
Copy Markdown
Contributor

@sven1977 sven1977 commented Feb 27, 2020

In the SAC loss function, we do not use the twin-Q net's output to minimize over Q/twin-Q values (as described in the paper).

Related issue number

Checks

@sven1977 sven1977 requested a review from ericl February 27, 2020 18:50
@AmplabJenkins
Copy link
Copy Markdown

Can one of the admins verify this patch?

@AmplabJenkins
Copy link
Copy Markdown

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/22507/
Test PASSed.

@ericl ericl merged commit 0c9e5db into ray-project:master Feb 27, 2020
@sven1977 sven1977 deleted the sac_twin_q_net_not_used_for_min_over_q_values branch March 3, 2020 10:15
ffbin pushed a commit to antgroup/ant-ray that referenced this pull request Mar 20, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants