[core][telemetry/01] migrate python telemetry recorder to opentelemetry#53098
[core][telemetry/01] migrate python telemetry recorder to opentelemetry#53098can-anyscale merged 2 commits intomasterfrom
Conversation
aslonnie
left a comment
There was a problem hiding this comment.
approve for dependency change.
|
🙌 |
|
@MengjinYan can you review this pls? |
23327c7 to
3c4f326
Compare
3345501 to
1e0aa1e
Compare
|
@MengjinYan's comments |
a744a44 to
4ff61cc
Compare
|
@MengjinYan: I changed We can discuss more in person next week perhaps if this makes no sense. |
This is a series of PR to migrate metric collection from opencencus to openlemetry. For context on the existing components, see #53098. --------------- This PR - Use open telemetry to export gauge metric to grpc endpoint - Grpc endpoint also export received metrics to prometheus server - Add unit tests + e2e tests. Note that I create a test job separately for open telemetry, the grpc server needs to use the network host directly (it doesn't work in a docker-in-docker environment from CI for reasons I don't 100% understand yet). Overall, when `RAY_experimental_enable_open_telemetry_on_core` is True, gauge metric uses open telemetry and everything else keeps using opencensus. Otherwise, everything will use opencensus. This ensure correctness for both options. Test: - CI --------- Signed-off-by: can <can@anyscale.com> Signed-off-by: Cuong Nguyen <128072568+can-anyscale@users.noreply.github.com> Co-authored-by: Dhyey Shah <dhyey2019@gmail.com>
This is a series of PR to migrate metric collection from opencencus to openlemetry. For context on the existing components, see #53098. --------------- This PR - Support Counter metric for `open_telemetry_metric_recorder.cc` - Add a unittest to make sure it works - This feature is not used anywhere yet, since the counter part `open_telemetry_metric_recorder.py` needs to be able to consume the counter metric as well, if this part is used. I'll add that in the next PR Test: - CI --------- Signed-off-by: can <can@anyscale.com> Signed-off-by: Cuong Nguyen <128072568+can-anyscale@users.noreply.github.com> Co-authored-by: Dhyey Shah <dhyey2019@gmail.com>
This is a series of PR to migrate metric collection from opencencus to openlemetry. For context on the existing components, see #53098. --------------- This PR - Support Counter metric for `open_telemetry_metric_recorder.cc` - Add a unittest to make sure it works - This feature is not used anywhere yet, since the counter part `open_telemetry_metric_recorder.py` needs to be able to consume the counter metric as well, if this part is used. I'll add that in the next PR Test: - CI --------- Signed-off-by: can <can@anyscale.com> Signed-off-by: Cuong Nguyen <128072568+can-anyscale@users.noreply.github.com> Co-authored-by: Dhyey Shah <dhyey2019@gmail.com> Signed-off-by: elliot-barn <elliot.barnwell@anyscale.com>
This is a series of PR to migrate metric collection from opencencus to openlemetry. For context on the existing components, see #53098. ------------ This PR - Support Counter metric on dashboard agent side - Support Counter metric e2e (from worker to dashboard agent) Test: - CI --------- Signed-off-by: can <can@anyscale.com> Signed-off-by: Cuong Nguyen <can@anyscale.com>
This is a series of PR to migrate metric collection from opencencus to openlemetry. For context on the existing components, see ray-project#53098. ------------ This PR - Support Counter metric on dashboard agent side - Support Counter metric e2e (from worker to dashboard agent) Test: - CI --------- Signed-off-by: can <can@anyscale.com> Signed-off-by: Cuong Nguyen <can@anyscale.com>
This is a series of PR to migrate metric collection from opencencus to openlemetry. For context on the existing components, see #53098. --------------- This PR - Support Counter metric for `open_telemetry_metric_recorder.cc` - Add a unittest to make sure it works - This feature is not used anywhere yet, since the counter part `open_telemetry_metric_recorder.py` needs to be able to consume the counter metric as well, if this part is used. I'll add that in the next PR Test: - CI --------- Signed-off-by: can <can@anyscale.com> Signed-off-by: Cuong Nguyen <128072568+can-anyscale@users.noreply.github.com> Co-authored-by: Dhyey Shah <dhyey2019@gmail.com> Signed-off-by: elliot-barn <elliot.barnwell@anyscale.com>
This is a series of PR to migrate metric collection from opencencus to openlemetry. For context on the existing components, see #53098. ------------ This PR - Support Counter metric on dashboard agent side - Support Counter metric e2e (from worker to dashboard agent) Test: - CI --------- Signed-off-by: can <can@anyscale.com> Signed-off-by: Cuong Nguyen <can@anyscale.com> Signed-off-by: elliot-barn <elliot.barnwell@anyscale.com>
This is a series of PR to migrate metric collection from opencencus to openlemetry. For context, ray has two main parts that instrument and record metric through opencencus: c++ and python part.
The python part consists of three components:
The c++ part consists of one component (to be deep dived more):
This PR migrates the TelemetryMetricRecorder from using opencencus to opentelemetry. The migration is also control via a flag (which turns on the new stuff by default). Fairly small PR (most files are auto-generated).
Test: