Skip to content

S.M.A.R.T input plugin: Power On Hours on NVME device not reported #10907

@coccobill1

Description

@coccobill1

Relevant telegraf.conf

[[inputs.smart]]
    path_smartctl = "/Program Files/smartmontools/bin/smartctl.exe"
    interval = "1m"
    enable_extensions = ["auto-on"]
    attributes = true

Logs from Telegraf

C:\Program Files\Telegraf>telegraf --input-filter=smart --test
2022-03-29T07:00:43Z I! Using config file: C:\Program Files\Telegraf\telegraf.conf
> smart_attribute,device=sde,host=CHRONUS,model=Samsung\ SSD\ 970\ EVO\ 500GB,name=Critical_Warning,serial_no=xxx raw_value=0i 1648537244000000000
> smart_attribute,device=sde,host=CHRONUS,id=194,model=Samsung\ SSD\ 970\ EVO\ 500GB,name=Temperature_Celsius,serial_no=xxx raw_value=47i 1648537244000000000
> smart_attribute,device=sde,host=CHRONUS,model=Samsung\ SSD\ 970\ EVO\ 500GB,name=Available_Spare,serial_no=xxx raw_value=83i 1648537244000000000
> smart_attribute,device=sde,host=CHRONUS,model=Samsung\ SSD\ 970\ EVO\ 500GB,name=Available_Spare_Threshold,serial_no=xxx raw_value=10i 1648537244000000000
> smart_attribute,device=sde,host=CHRONUS,model=Samsung\ SSD\ 970\ EVO\ 500GB,name=Percentage_Used,serial_no=xxx raw_value=5i 1648537244000000000
> smart_attribute,device=sde,host=CHRONUS,id=12,model=Samsung\ SSD\ 970\ EVO\ 500GB,name=Power_Cycle_Count,serial_no=xxx raw_value=44i 1648537244000000000
> smart_attribute,device=sde,host=CHRONUS,model=Samsung\ SSD\ 970\ EVO\ 500GB,name=Unsafe_Shutdowns,serial_no=xxx raw_value=28i 1648537244000000000
> smart_attribute,device=sde,host=CHRONUS,model=Samsung\ SSD\ 970\ EVO\ 500GB,name=Media_and_Data_Integrity_Errors,serial_no=xxx raw_value=43i 1648537244000000000
> smart_attribute,device=sde,host=CHRONUS,model=Samsung\ SSD\ 970\ EVO\ 500GB,name=Warning_Temperature_Time,serial_no=xxx raw_value=0i 1648537244000000000
> smart_attribute,device=sde,host=CHRONUS,model=Samsung\ SSD\ 970\ EVO\ 500GB,name=Critical_Temperature_Time,serial_no=xxx raw_value=0i 1648537244000000000
> smart_attribute,device=sde,host=CHRONUS,model=Samsung\ SSD\ 970\ EVO\ 500GB,name=Temperature_Sensor_1,serial_no=xxx raw_value=47i 1648537244000000000
> smart_attribute,device=sde,host=CHRONUS,model=Samsung\ SSD\ 970\ EVO\ 500GB,name=Temperature_Sensor_2,serial_no=xxx raw_value=48i 1648537244000000000
> smart_device,device=sde,host=CHRONUS,model=Samsung\ SSD\ 970\ EVO\ 500GB,serial_no=xxx exit_status=0i,health_ok=true,temp_c=47i 1648537244000000000

> smart_attribute,device=sdd,host=CHRONUS,model=Samsung\ SSD\ 980\ PRO\ 2TB,name=Critical_Warning,serial_no=xxx raw_value=0i 1648537244000000000
> smart_attribute,device=sdd,host=CHRONUS,id=194,model=Samsung\ SSD\ 980\ PRO\ 2TB,name=Temperature_Celsius,serial_no=xxx raw_value=39i 1648537244000000000
> smart_attribute,device=sdd,host=CHRONUS,model=Samsung\ SSD\ 980\ PRO\ 2TB,name=Available_Spare,serial_no=xxx raw_value=100i 1648537244000000000
> smart_attribute,device=sdd,host=CHRONUS,model=Samsung\ SSD\ 980\ PRO\ 2TB,name=Available_Spare_Threshold,serial_no=xxx raw_value=10i 1648537244000000000
> smart_attribute,device=sdd,host=CHRONUS,model=Samsung\ SSD\ 980\ PRO\ 2TB,name=Percentage_Used,serial_no=xxx raw_value=0i 1648537244000000000
> smart_attribute,device=sdd,host=CHRONUS,id=12,model=Samsung\ SSD\ 980\ PRO\ 2TB,name=Power_Cycle_Count,serial_no=xxx raw_value=14i 1648537244000000000
> smart_attribute,device=sdd,host=CHRONUS,id=9,model=Samsung\ SSD\ 980\ PRO\ 2TB,name=Power_On_Hours,serial_no=xxx raw_value=496i 1648537244000000000
> smart_attribute,device=sdd,host=CHRONUS,model=Samsung\ SSD\ 980\ PRO\ 2TB,name=Unsafe_Shutdowns,serial_no=xxx raw_value=5i 1648537244000000000
> smart_attribute,device=sdd,host=CHRONUS,model=Samsung\ SSD\ 980\ PRO\ 2TB,name=Media_and_Data_Integrity_Errors,serial_no=xxx raw_value=0i 1648537244000000000
> smart_attribute,device=sdd,host=CHRONUS,model=Samsung\ SSD\ 980\ PRO\ 2TB,name=Error_Information_Log_Entries,serial_no=xxx raw_value=0i 1648537244000000000
> smart_attribute,device=sdd,host=CHRONUS,model=Samsung\ SSD\ 980\ PRO\ 2TB,name=Warning_Temperature_Time,serial_no=xxx raw_value=0i 1648537244000000000
> smart_attribute,device=sdd,host=CHRONUS,model=Samsung\ SSD\ 980\ PRO\ 2TB,name=Critical_Temperature_Time,serial_no=xxx raw_value=0i 1648537244000000000
> smart_attribute,device=sdd,host=CHRONUS,model=Samsung\ SSD\ 980\ PRO\ 2TB,name=Temperature_Sensor_1,serial_no=xxx raw_value=39i 1648537244000000000
> smart_attribute,device=sdd,host=CHRONUS,model=Samsung\ SSD\ 980\ PRO\ 2TB,name=Temperature_Sensor_2,serial_no=xxx raw_value=44i 1648537244000000000
> smart_device,device=sdd,host=CHRONUS,model=Samsung\ SSD\ 980\ PRO\ 2TB,serial_no=xxx exit_status=0i,health_ok=true,temp_c=39i 1648537244000000000

System info

Telegraf 1.22.0, smartctl 7.3, Windows 10 (19044,1620)

Docker

No response

Steps to reproduce

  1. Run telegraf with the input.smart plugin and a Samsung NVME drive (not sure about other vendors)

Expected behavior

Telegraf reports the Power On Hours smart_attribute correctly on all NVME drives.

Actual behavior

My Samsung 980 PRO:
Power On Hours: 496

My Samsung 970 EVO:
Power On Hours: 22 371

The 980 value is reported correctly, for the 970 the whole attribute is missing. Looks like smartctl adds an extra blank as the thousands separator in the reported value, which might be causing issues?

Additional info

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions