Skip to content

Add plugin to monitor AMD GPUs using ROCm System Management Interface rocm-smi #9601

@mconcas

Description

@mconcas

Feature Request

As it is done with nvdia GPUs would be nice to have the possibility to query metrics from AMD devices using the rocm-smi [0] utility.

Proposal:

Write a plugin that parses the JSON output provided by rocm-smi executable and extracts metrics to be sent via Telegraf.

Current behavior:

Currently I could not find any effort in that direction.

Desired behavior:

Use case:

Very convenient in all those scenario where ROCm [1] is deployed as computing environment for GPGPU applications.

Some pointers:

[0] https://rocmdocs.amd.com/en/latest/index.html
[1] https://rocmdocs.amd.com/en/latest/Installation_Guide/ROC-smi.html

Metadata

Metadata

Assignees

No one assigned

    Labels

    feature requestRequests for new plugin and for new features to existing plugins

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions