Skip to content

[service-checker] Fix the service-checker issue which is caused by PR 17836#18109

Merged
rlhui merged 1 commit intosonic-net:masterfrom
mlok-nokia:fix-service-checker
Feb 21, 2024
Merged

[service-checker] Fix the service-checker issue which is caused by PR 17836#18109
rlhui merged 1 commit intosonic-net:masterfrom
mlok-nokia:fix-service-checker

Conversation

@mlok-nokia
Copy link
Copy Markdown
Contributor

Why I did it

PR #17836 added the container checking for database-chassis for Supervisor. But the related container_feature_dict[] is missing for the database-chassis. This causes the exception failure shows on Supervisor

admin@ixre-cpm-chassis13:~$ sudo show system-health sum 
System status summary

  System status LED  red
  Services:
    Status: OK
  Hardware:
    Status: Not OK
    Reasons: PSU 12 is missing or not available
	     PSU 11 is missing or not available
	     PSU 10 is missing or not available
	     PSU 9 is missing or not available
	     PSU 8 is missing or not available
	     Failed to perform health check for ServiceChecker due to exception - KeyError('database-chassis')

Work item tracking
  • Microsoft ADO (number only):

How I did it

Added the database-chassis feature entry to container_feature_dict[] to address this issue

How to verify it

On Supervisor, execute the "show system-health summary". No exception error shown

admin@ixre-cpm-chassis13:~$ sudo show system-health sum 
System status summary

  System status LED  red
  Services:
    Status: OK
  Hardware:
    Status: Not OK
    Reasons: PSU 12 is missing or not available
	     PSU 11 is missing or not available
	     PSU 10 is missing or not available
	     PSU 9 is missing or not available
	     PSU 8 is missing or not available

Which release branch to backport (provide reason below if selected)

  • 201811
  • 201911
  • 202006
  • 202012
  • 202106
  • 202111
  • 202205
  • 202211
  • 202305

Tested branch (Please provide the tested image version)

Description for the changelog

Link to config_db schema for YANG module changes

A picture of a cute animal (not mandatory but encouraged)

…17836

Signed-off-by: mlok <marty.lok@nokia.com>
@mlok-nokia mlok-nokia requested a review from lguohan as a code owner February 16, 2024 16:47
@mlok-nokia
Copy link
Copy Markdown
Contributor Author

@judyjoseph @abdosi OC tested failed on this service checker on MSFT build image .55. Please review this PR. Thanks.

@rlhui rlhui requested a review from abdosi February 20, 2024 19:11
@abdosi
Copy link
Copy Markdown
Contributor

abdosi commented Feb 23, 2024

@mlok-nokia : please create PR for 202305 as their is conflict

@mlok-nokia
Copy link
Copy Markdown
Contributor Author

@judyjoseph Are we going to cherry-pick this to 202205 next build?

@judyjoseph
Copy link
Copy Markdown
Contributor

judyjoseph commented Apr 3, 2024

@gechiang pls cherry-pick this PR to 202205 - thanks

@gechiang gechiang added the Included in Chassis for 202205 Branch Indicate PR is already in MSFT repo 202205 branch label Apr 3, 2024
@gechiang
Copy link
Copy Markdown
Collaborator

gechiang commented Apr 3, 2024

picked up in 202205-msft repo:
Azure/sonic-buildimage-msft#317

mssonicbld pushed a commit to mssonicbld/sonic-buildimage that referenced this pull request Apr 5, 2024
…17836 (sonic-net#18109)

PR sonic-net#17836 added the container checking for database-chassis for Supervisor. But the related container_feature_dict[] is missing for the database-chassis. This causes the exception failure shows on Supervisor

Signed-off-by: mlok <marty.lok@nokia.com>
@mssonicbld
Copy link
Copy Markdown
Collaborator

Cherry-pick PR to 202311: #18582

mssonicbld pushed a commit that referenced this pull request Apr 8, 2024
…17836 (#18109)

PR #17836 added the container checking for database-chassis for Supervisor. But the related container_feature_dict[] is missing for the database-chassis. This causes the exception failure shows on Supervisor

Signed-off-by: mlok <marty.lok@nokia.com>
mlok-nokia added a commit to mlok-nokia/sonic-buildimage that referenced this pull request Jun 5, 2024
…17836 (sonic-net#18109)

PR sonic-net#17836 added the container checking for database-chassis for Supervisor. But the related container_feature_dict[] is missing for the database-chassis. This causes the exception failure shows on Supervisor

Signed-off-by: mlok <marty.lok@nokia.com>
@gechiang
Copy link
Copy Markdown
Collaborator

@yxieca , Who can help review/approve this PR for 202311 branch backport?

@mlok-nokia mlok-nokia deleted the fix-service-checker branch September 27, 2024 15:05
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

Archived in project

Development

Successfully merging this pull request may close these issues.

7 participants