add more logging fields that can be set in construction time#51260
add more logging fields that can be set in construction time#51260zhaojuanmao wants to merge 12 commits intogh/zhaojuanmao/58/basefrom
Conversation
add more logging fields to DDPLoggingData, including param stats, bucket stats, environment variables, nccl version, data type Differential Revision: [D26118245](https://our.internmc.facebook.com/intern/diff/D26118245/) [ghstack-poisoned]
💊 CI failures summary and remediationsAs of commit 0b04c41 (more details on the Dr. CI page):
This comment was automatically generated by Dr. CI (expand for details).Follow this link to opt-out of these comments for your Pull Requests.Please report bugs/suggestions to the (internal) Dr. CI Users group. |
add more logging fields to DDPLoggingData, including param stats, bucket stats, environment variables, nccl version, data type Differential Revision: [D26118245](https://our.internmc.facebook.com/intern/diff/D26118245/) ghstack-source-id: 120541380 Pull Request resolved: #51260
add more logging fields to DDPLoggingData, including param stats, bucket stats, environment variables, nccl version, data type Differential Revision: [D26118245](https://our.internmc.facebook.com/intern/diff/D26118245/) [ghstack-poisoned]
add more logging fields to DDPLoggingData, including param stats, bucket stats, environment variables, nccl version, data type Differential Revision: [D26118245](https://our.internmc.facebook.com/intern/diff/D26118245/) [ghstack-poisoned]
Pull Request resolved: #51260 add more logging fields to DDPLoggingData, including param stats, bucket stats, environment variables, nccl version, data type ghstack-source-id: 120603119 Differential Revision: [D26118245](https://our.internmc.facebook.com/intern/diff/D26118245/)
add more logging fields to DDPLoggingData, including param stats, bucket stats, environment variables, nccl version, data type Differential Revision: [D26118245](https://our.internmc.facebook.com/intern/diff/D26118245/) [ghstack-poisoned]
Pull Request resolved: #51260 add more logging fields to DDPLoggingData, including param stats, bucket stats, environment variables, nccl version, data type ghstack-source-id: 120693261 Differential Revision: [D26118245](https://our.internmc.facebook.com/intern/diff/D26118245/)
add more logging fields to DDPLoggingData, including param stats, bucket stats, environment variables, nccl version, data type Differential Revision: [D26118245](https://our.internmc.facebook.com/intern/diff/D26118245/) [ghstack-poisoned]
Pull Request resolved: #51260 add more logging fields to DDPLoggingData, including param stats, bucket stats, environment variables, nccl version, data type ghstack-source-id: 120805134 Differential Revision: [D26118245](https://our.internmc.facebook.com/intern/diff/D26118245/)
|
@rohan-varma thanks for reviewing the diff. I can see your major concern is about logging the env variables that depends on the specific backends. Here are the reasons why I log them as it is right now:
For now, I think we can either 1) log the small set of env variables as the starting pointing in this diff 2) remove NCCL/Gloo env variables right now, add them back later on once you fleshed out debug ability design and list the variables we really need. I prefer 1) though, as we can start with them and see how useful they are. How do you think? |
add more logging fields to DDPLoggingData, including param stats, bucket stats, environment variables, nccl version, data type Differential Revision: [D26118245](https://our.internmc.facebook.com/intern/diff/D26118245/) [ghstack-poisoned]
Pull Request resolved: #51260 add more logging fields to DDPLoggingData, including param stats, bucket stats, environment variables, nccl version, data type ghstack-source-id: 120885387 Differential Revision: [D26118245](https://our.internmc.facebook.com/intern/diff/D26118245/)
|
regarding separating logging codes from reducer core logic, I think we can have a logger that wraps ddp_logging_data pointer and also reducer pointer, then put all logging related methods into logger. Let me try that |
add more logging fields to DDPLoggingData, including param stats, bucket stats, environment variables, nccl version, data type Differential Revision: [D26118245](https://our.internmc.facebook.com/intern/diff/D26118245/) [ghstack-poisoned]
add more logging fields to DDPLoggingData, including param stats, bucket stats, environment variables, nccl version, data type Differential Revision: [D26118245](https://our.internmc.facebook.com/intern/diff/D26118245/) [ghstack-poisoned]
Pull Request resolved: #51260 add more logging fields to DDPLoggingData, including param stats, bucket stats, environment variables, nccl version, data type ghstack-source-id: 121021915 Differential Revision: [D26118245](https://our.internmc.facebook.com/intern/diff/D26118245/) **NOTE FOR REVIEWERS**: This PR has internal Facebook specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D26118245/)!
add more logging fields to DDPLoggingData, including param stats, bucket stats, environment variables, nccl version, data type Differential Revision: [D26118245](https://our.internmc.facebook.com/intern/diff/D26118245/) [ghstack-poisoned]
Pull Request resolved: #51260 add more logging fields to DDPLoggingData, including param stats, bucket stats, environment variables, nccl version, data type ghstack-source-id: 121022069 Differential Revision: [D26118245](https://our.internmc.facebook.com/intern/diff/D26118245/) **NOTE FOR REVIEWERS**: This PR has internal Facebook specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D26118245/)!
add more logging fields to DDPLoggingData, including param stats, bucket stats, environment variables, nccl version, data type Differential Revision: [D26118245](https://our.internmc.facebook.com/intern/diff/D26118245/) [ghstack-poisoned]
Pull Request resolved: #51260 add more logging fields to DDPLoggingData, including param stats, bucket stats, environment variables, nccl version, data type ghstack-source-id: 121045907 Differential Revision: [D26118245](https://our.internmc.facebook.com/intern/diff/D26118245/) **NOTE FOR REVIEWERS**: This PR has internal Facebook specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D26118245/)!
Sure, (1) sounds good to me for now and we can refactor later as needed. |
rohan-varma
left a comment
There was a problem hiding this comment.
LGTM, although I'm sort of confused about a line in the test, please double check that, and some other small comments.
add more logging fields to DDPLoggingData, including param stats, bucket stats, environment variables, nccl version, data type Differential Revision: [D26118245](https://our.internmc.facebook.com/intern/diff/D26118245/) [ghstack-poisoned]
Pull Request resolved: #51260 add more logging fields to DDPLoggingData, including param stats, bucket stats, environment variables, nccl version, data type ghstack-source-id: 121102207 Differential Revision: [D26118245](https://our.internmc.facebook.com/intern/diff/D26118245/) **NOTE FOR REVIEWERS**: This PR has internal Facebook specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D26118245/)!
add more logging fields to DDPLoggingData, including param stats, bucket stats, environment variables, nccl version, data type Differential Revision: [D26118245](https://our.internmc.facebook.com/intern/diff/D26118245/) [ghstack-poisoned]
Pull Request resolved: #51260 add more logging fields to DDPLoggingData, including param stats, bucket stats, environment variables, nccl version, data type ghstack-source-id: 121260224 Differential Revision: [D26118245](https://our.internmc.facebook.com/intern/diff/D26118245/) **NOTE FOR REVIEWERS**: This PR has internal Facebook specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D26118245/)!
|
This pull request has been merged in 18e0a61. |
…#51260) Summary: Pull Request resolved: pytorch#51260 add more logging fields to DDPLoggingData, including param stats, bucket stats, environment variables, nccl version, data type ghstack-source-id: 121260224 Test Plan: unit tests Reviewed By: rohan-varma Differential Revision: D26118245 fbshipit-source-id: ba48b7a11340bda1f5f3b24c8603545d346361e9
…#51260) Summary: Pull Request resolved: pytorch#51260 add more logging fields to DDPLoggingData, including param stats, bucket stats, environment variables, nccl version, data type ghstack-source-id: 121260224 Test Plan: unit tests Reviewed By: rohan-varma Differential Revision: D26118245 fbshipit-source-id: ba48b7a11340bda1f5f3b24c8603545d346361e9
…#51260) Summary: Pull Request resolved: pytorch#51260 add more logging fields to DDPLoggingData, including param stats, bucket stats, environment variables, nccl version, data type ghstack-source-id: 121260224 Test Plan: unit tests Reviewed By: rohan-varma Differential Revision: D26118245 fbshipit-source-id: ba48b7a11340bda1f5f3b24c8603545d346361e9
Stack from ghstack:
add more logging fields to DDPLoggingData, including param stats, bucket stats, environment variables, nccl version, data type
Differential Revision: D26118245