-
Notifications
You must be signed in to change notification settings - Fork 5.8k
S.M.A.R.T input plugin: Power On Hours on NVME device not reported #10907
Copy link
Copy link
Closed
Labels
area/smartbugunexpected problem or unintended behaviorunexpected problem or unintended behaviorplatform/windows
Description
Relevant telegraf.conf
[[inputs.smart]]
path_smartctl = "/Program Files/smartmontools/bin/smartctl.exe"
interval = "1m"
enable_extensions = ["auto-on"]
attributes = trueLogs from Telegraf
C:\Program Files\Telegraf>telegraf --input-filter=smart --test
2022-03-29T07:00:43Z I! Using config file: C:\Program Files\Telegraf\telegraf.conf
> smart_attribute,device=sde,host=CHRONUS,model=Samsung\ SSD\ 970\ EVO\ 500GB,name=Critical_Warning,serial_no=xxx raw_value=0i 1648537244000000000
> smart_attribute,device=sde,host=CHRONUS,id=194,model=Samsung\ SSD\ 970\ EVO\ 500GB,name=Temperature_Celsius,serial_no=xxx raw_value=47i 1648537244000000000
> smart_attribute,device=sde,host=CHRONUS,model=Samsung\ SSD\ 970\ EVO\ 500GB,name=Available_Spare,serial_no=xxx raw_value=83i 1648537244000000000
> smart_attribute,device=sde,host=CHRONUS,model=Samsung\ SSD\ 970\ EVO\ 500GB,name=Available_Spare_Threshold,serial_no=xxx raw_value=10i 1648537244000000000
> smart_attribute,device=sde,host=CHRONUS,model=Samsung\ SSD\ 970\ EVO\ 500GB,name=Percentage_Used,serial_no=xxx raw_value=5i 1648537244000000000
> smart_attribute,device=sde,host=CHRONUS,id=12,model=Samsung\ SSD\ 970\ EVO\ 500GB,name=Power_Cycle_Count,serial_no=xxx raw_value=44i 1648537244000000000
> smart_attribute,device=sde,host=CHRONUS,model=Samsung\ SSD\ 970\ EVO\ 500GB,name=Unsafe_Shutdowns,serial_no=xxx raw_value=28i 1648537244000000000
> smart_attribute,device=sde,host=CHRONUS,model=Samsung\ SSD\ 970\ EVO\ 500GB,name=Media_and_Data_Integrity_Errors,serial_no=xxx raw_value=43i 1648537244000000000
> smart_attribute,device=sde,host=CHRONUS,model=Samsung\ SSD\ 970\ EVO\ 500GB,name=Warning_Temperature_Time,serial_no=xxx raw_value=0i 1648537244000000000
> smart_attribute,device=sde,host=CHRONUS,model=Samsung\ SSD\ 970\ EVO\ 500GB,name=Critical_Temperature_Time,serial_no=xxx raw_value=0i 1648537244000000000
> smart_attribute,device=sde,host=CHRONUS,model=Samsung\ SSD\ 970\ EVO\ 500GB,name=Temperature_Sensor_1,serial_no=xxx raw_value=47i 1648537244000000000
> smart_attribute,device=sde,host=CHRONUS,model=Samsung\ SSD\ 970\ EVO\ 500GB,name=Temperature_Sensor_2,serial_no=xxx raw_value=48i 1648537244000000000
> smart_device,device=sde,host=CHRONUS,model=Samsung\ SSD\ 970\ EVO\ 500GB,serial_no=xxx exit_status=0i,health_ok=true,temp_c=47i 1648537244000000000
> smart_attribute,device=sdd,host=CHRONUS,model=Samsung\ SSD\ 980\ PRO\ 2TB,name=Critical_Warning,serial_no=xxx raw_value=0i 1648537244000000000
> smart_attribute,device=sdd,host=CHRONUS,id=194,model=Samsung\ SSD\ 980\ PRO\ 2TB,name=Temperature_Celsius,serial_no=xxx raw_value=39i 1648537244000000000
> smart_attribute,device=sdd,host=CHRONUS,model=Samsung\ SSD\ 980\ PRO\ 2TB,name=Available_Spare,serial_no=xxx raw_value=100i 1648537244000000000
> smart_attribute,device=sdd,host=CHRONUS,model=Samsung\ SSD\ 980\ PRO\ 2TB,name=Available_Spare_Threshold,serial_no=xxx raw_value=10i 1648537244000000000
> smart_attribute,device=sdd,host=CHRONUS,model=Samsung\ SSD\ 980\ PRO\ 2TB,name=Percentage_Used,serial_no=xxx raw_value=0i 1648537244000000000
> smart_attribute,device=sdd,host=CHRONUS,id=12,model=Samsung\ SSD\ 980\ PRO\ 2TB,name=Power_Cycle_Count,serial_no=xxx raw_value=14i 1648537244000000000
> smart_attribute,device=sdd,host=CHRONUS,id=9,model=Samsung\ SSD\ 980\ PRO\ 2TB,name=Power_On_Hours,serial_no=xxx raw_value=496i 1648537244000000000
> smart_attribute,device=sdd,host=CHRONUS,model=Samsung\ SSD\ 980\ PRO\ 2TB,name=Unsafe_Shutdowns,serial_no=xxx raw_value=5i 1648537244000000000
> smart_attribute,device=sdd,host=CHRONUS,model=Samsung\ SSD\ 980\ PRO\ 2TB,name=Media_and_Data_Integrity_Errors,serial_no=xxx raw_value=0i 1648537244000000000
> smart_attribute,device=sdd,host=CHRONUS,model=Samsung\ SSD\ 980\ PRO\ 2TB,name=Error_Information_Log_Entries,serial_no=xxx raw_value=0i 1648537244000000000
> smart_attribute,device=sdd,host=CHRONUS,model=Samsung\ SSD\ 980\ PRO\ 2TB,name=Warning_Temperature_Time,serial_no=xxx raw_value=0i 1648537244000000000
> smart_attribute,device=sdd,host=CHRONUS,model=Samsung\ SSD\ 980\ PRO\ 2TB,name=Critical_Temperature_Time,serial_no=xxx raw_value=0i 1648537244000000000
> smart_attribute,device=sdd,host=CHRONUS,model=Samsung\ SSD\ 980\ PRO\ 2TB,name=Temperature_Sensor_1,serial_no=xxx raw_value=39i 1648537244000000000
> smart_attribute,device=sdd,host=CHRONUS,model=Samsung\ SSD\ 980\ PRO\ 2TB,name=Temperature_Sensor_2,serial_no=xxx raw_value=44i 1648537244000000000
> smart_device,device=sdd,host=CHRONUS,model=Samsung\ SSD\ 980\ PRO\ 2TB,serial_no=xxx exit_status=0i,health_ok=true,temp_c=39i 1648537244000000000
System info
Telegraf 1.22.0, smartctl 7.3, Windows 10 (19044,1620)
Docker
No response
Steps to reproduce
- Run telegraf with the input.smart plugin and a Samsung NVME drive (not sure about other vendors)
Expected behavior
Telegraf reports the Power On Hours smart_attribute correctly on all NVME drives.
Actual behavior
My Samsung 980 PRO:
Power On Hours: 496
My Samsung 970 EVO:
Power On Hours: 22 371
The 980 value is reported correctly, for the 970 the whole attribute is missing. Looks like smartctl adds an extra blank as the thousands separator in the reported value, which might be causing issues?
Additional info
No response
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
area/smartbugunexpected problem or unintended behaviorunexpected problem or unintended behaviorplatform/windows