[rllib] Port DDPG to the build_tf_policy pattern by ericl · Pull Request #5242 · ray-project/ray

ericl · 2019-07-21T23:08:38Z

What do these changes do?

This ports DDPG to the policy builder pattern. This is the last major algorithm that needed to be ported.

Pendulum performance seems to be on par. @joneswong could you check if parameter noise exploration still works as expected? There was a lot of changes around handling in that code.

fyi @qxcv @gehring

Related issue number

Closes #4822
Closes #4788

Linter

I've run scripts/format.sh to lint the changes in this PR.

…nder

This reverts commit 5f64551.

AmplabJenkins · 2019-07-21T23:34:34Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/15530/
Test FAILed.

AmplabJenkins · 2019-07-22T00:22:06Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/15531/
Test FAILed.

AmplabJenkins · 2019-07-22T00:35:03Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/15532/
Test FAILed.

AmplabJenkins · 2019-07-22T00:48:08Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/15534/
Test FAILed.

AmplabJenkins · 2019-07-22T01:03:53Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/15535/
Test FAILed.

AmplabJenkins · 2019-07-22T01:33:29Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/15537/
Test FAILed.

AmplabJenkins · 2019-07-22T01:42:08Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/15538/
Test FAILed.

AmplabJenkins · 2019-07-22T03:14:12Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/15539/
Test PASSed.

AmplabJenkins · 2019-07-22T09:27:18Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-Perf-Integration-PRB/1750/
Test FAILed.

AmplabJenkins · 2019-07-22T09:27:23Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-Perf-Integration-PRB/1751/
Test FAILed.

AmplabJenkins · 2019-07-23T11:06:14Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/15590/
Test PASSed.

ericl added 27 commits July 5, 2019 18:03

port vtrace

40fcf6d

fix vf

476a3f6

fix vs

d947b28

fix the example

507bfe3

wip ddpg

1746360

fix tests

e1f3d5b

fix tests

7681677

Merge branch 'port-remainder' of github.com:ericl/ray into port-remai…

f4c579d

…nder

remove ddpg model

5f64551

Revert "remove ddpg model"

b9290e2

This reverts commit 5f64551.

comments

300035f

wip

c3c4ed9

set vf share layers True by default

8620837

typo

8b308b8

Merge branch 'port-remainder' into port-ddpg

c549dbe

wip

1687f8d

lint

e961cd3

wip

63b6e2c

lint

a8d36f7

Merge remote-tracking branch 'upstream/master' into port-ddpg

15a1489

Merge remote-tracking branch 'upstream/master' into port-ddpg

1a1d0f4

wip

b88b770

now runnable

a8c602e

now trains

c75a710

separate noop model

9750818

use keras layers

28d816a

fix param noise

3aae25c

ericl assigned joneswong Jul 21, 2019

no final linear

7da3a46

ericl force-pushed the port-ddpg branch 4 times, most recently from fd50692 to 10be568 Compare July 22, 2019 00:10

ericl force-pushed the port-ddpg branch from 10be568 to 8081588 Compare July 22, 2019 00:22

auto set update ops

c1a5a11

ericl force-pushed the port-ddpg branch from 8081588 to c1a5a11 Compare July 22, 2019 00:24

td error mixin

b95c87e

ericl force-pushed the port-ddpg branch from 565c3e3 to b95c87e Compare July 22, 2019 00:29

fix apex

3885e3e

ericl added the tests-ok The tagger certifies test failures are unrelated and assumes personal liability. label Jul 22, 2019

more seed

e83730a

richardliaw approved these changes Jul 24, 2019

View reviewed changes

ericl merged commit 60f5963 into ray-project:master Jul 24, 2019

This was referenced Sep 3, 2019

[rllib] mountaincarcontinous-ddpg regression #5604

Closed

[rllib] Revert [rllib] Port DDPG to the build_tf_policy pattern #5626

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[rllib] Port DDPG to the build_tf_policy pattern#5242

[rllib] Port DDPG to the build_tf_policy pattern#5242
ericl merged 32 commits intoray-project:masterfrom
ericl:port-ddpg

ericl commented Jul 21, 2019 •

edited

Loading

Uh oh!

AmplabJenkins commented Jul 21, 2019

Uh oh!

AmplabJenkins commented Jul 22, 2019

Uh oh!

AmplabJenkins commented Jul 22, 2019

Uh oh!

AmplabJenkins commented Jul 22, 2019

Uh oh!

AmplabJenkins commented Jul 22, 2019

Uh oh!

AmplabJenkins commented Jul 22, 2019

Uh oh!

AmplabJenkins commented Jul 22, 2019

Uh oh!

AmplabJenkins commented Jul 22, 2019

Uh oh!

AmplabJenkins commented Jul 22, 2019

Uh oh!

AmplabJenkins commented Jul 22, 2019

Uh oh!

AmplabJenkins commented Jul 23, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

ericl commented Jul 21, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What do these changes do?

Related issue number

Linter

Uh oh!

AmplabJenkins commented Jul 21, 2019

Uh oh!

AmplabJenkins commented Jul 22, 2019

Uh oh!

AmplabJenkins commented Jul 22, 2019

Uh oh!

AmplabJenkins commented Jul 22, 2019

Uh oh!

AmplabJenkins commented Jul 22, 2019

Uh oh!

AmplabJenkins commented Jul 22, 2019

Uh oh!

AmplabJenkins commented Jul 22, 2019

Uh oh!

AmplabJenkins commented Jul 22, 2019

Uh oh!

AmplabJenkins commented Jul 22, 2019

Uh oh!

AmplabJenkins commented Jul 22, 2019

Uh oh!

AmplabJenkins commented Jul 23, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ericl commented Jul 21, 2019 •

edited

Loading