[Metrics] Add Prometheus counters for Model FLOPs Utilization (MFU)#30950
[Metrics] Add Prometheus counters for Model FLOPs Utilization (MFU)#30950markmc merged 2 commits intovllm-project:mainfrom
Conversation
|
Documentation preview: https://vllm--30950.org.readthedocs.build/en/30950/ |
There was a problem hiding this comment.
Code Review
This pull request successfully adds Prometheus counters for Model FLOPs Utilization (MFU) metrics, making them available for monitoring. The changes are well-integrated with the existing metrics system, including support for Ray environments. The documentation has also been updated accordingly. I've identified one area for improvement related to code duplication that would enhance maintainability.
|
Hi @markmc, the pre-commit checks have failed. Please run: uv pip install pre-commit
pre-commit install
pre-commit run --all-filesThen, commit the changes and push to your branch. For future commits, Tip Is
|
|
hi @markmc any updates? |
|
It looks like all the tests passed but it's not been reviewed |
|
I don't see anything that adds |
Yes, this is ready for review/merge
This was added by #30738 which just added console logging of these metrics. This PR follows-on to add Prometheus support |
c69c0c5 to
bd84989
Compare
bd84989 to
76f98f9
Compare
hmellor
left a comment
There was a problem hiding this comment.
Otherwise the docs build will complain that perf.md can't be navigated to
Signed-off-by: Mark McLoughlin <markmc@redhat.com>
Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by: Mark McLoughlin <markbmc@gmail.com>
57df7a6 to
fb98350
Compare
…llm-project#30950) Export the existing Model FLOPs Utilization (MFU) metrics via Prometheus. `--enable-mfu-metrics` is required for these to be exposed. Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by: Mark McLoughlin <markmc@redhat.com>
…llm-project#30950) Export the existing Model FLOPs Utilization (MFU) metrics via Prometheus. `--enable-mfu-metrics` is required for these to be exposed. Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by: Mark McLoughlin <markmc@redhat.com>
…llm-project#30950) Export the existing Model FLOPs Utilization (MFU) metrics via Prometheus. `--enable-mfu-metrics` is required for these to be exposed. Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by: Mark McLoughlin <markmc@redhat.com>
See #30738 - this is a follow-on to export these metrics via Prometheus in addition to the console logging
The metrics are only calculated and available with
--enable-mfu-metrics