-
Notifications
You must be signed in to change notification settings - Fork 7.4k
Description
Per discussion with several users, there are some possible renames we could do to clarify the internal architecture:
-
Rename
rllib.evaluation.PolicyGraphtorllib.policy.Policy. Similarly,rllib.evaluation.TFPolicyGraphbecomesrllib.policy.TFPolicyand so on. The files are moved to a newrllib/policydir. -
Move therllib/agentsdirectory torllib/train. -
Add TF qualifier for consistency with Torch policies
A3CPolicyGraph => A3CTFPolicy
A3CTorchPolicyGraph => A3CTorchPolicy -
Rename
PolicyEvaluatortoRolloutWorker
deprecatecompute_gradients()/apply_gradients() -
Consolidate
[local_evaluator, remote_evaluators]to singleWorkerSetobjectworkers = WorkerSet(...) local_evaluators => workers.local_worker() remote_evaluators => workers.remote_workers()
For the most part, I think we can do these renames without breaking backwards compatibility by leaving aliases behind. Though, I'm not sure how easy moving an entire directory is.