[Elastic Agent] Add process error handling guidelines#29152
[Elastic Agent] Add process error handling guidelines#29152ruflin merged 2 commits intoelastic:masterfrom
Conversation
We don't have a good place yet to document expectations around services run by Elastic Agent. I'm starting this document to have a place to add more content to it but I expect long term we need to figure out a better place. This guideline comes out of recent issues we had in Cloud and local setups of fleet-server (elastic/fleet-server#883). We never set clear guidlines on what the expectation is of a service run by Elastic Agent and Elastic Agent itself. This PR is kick off the discussion.
|
This pull request does not have a backport label. Could you fix it @ruflin? 🙏
NOTE: |
|
@elastic/elastic-agent Would be great to get your input on this specific guideline but also on where and how we should document these things. @graphaelli Thanks for the inspiration to kick this off @simitt might be interested in this too. |
💚 Build Succeeded
Expand to view the summary
Build stats
❕ Flaky test reportNo test was executed to be analysed. 🤖 GitHub commentsTo re-run your PR in the CI, just comment with:
|
graphaelli
left a comment
There was a problem hiding this comment.
Would this be a good place to capture security guidelines? eg configuration of process capabilities (eg via seccomp filters), running as a specific user
|
@graphaelli Yes. We should start to capture here all the guidelines around Elastic Agent. |
Co-authored-by: Gil Raphaelli <g@raphaelli.com>
|
@graphaelli Should we get this initial PR merged so we have a future "place" to add more things to the list? |
|
Sure, that works, thanks for getting this started. |
* upstream/master: [libbeat] Fix add_labels flattening of arrays values (elastic#29211) Change elastic-agent pprof default to false (elastic#29155) elastic#28472 fix flaky tests in libbeat fmtstr to use time.UTC instead of time.Local (elastic#28473) Adopt `parsers` in Filebeat's journald input (elastic#29070) [Elastic Agent] Add process error handling guidelines (elastic#29152) winlogbeat/sys/winevent: use reflect IsZero method (elastic#29190) Remove Journalbeat (elastic#29131) Add note that there is no warranty or support for generator code (elastic#28797) packetbeat: preparation for npcap addition (elastic#29017) Use the generic helper for opening file to read in filestream (elastic#29180) Workflow for macos (elastic#29174) Fix `decode_json_fields` processor to always add error key (elastic#29107)
We don't have a good place yet to document expectations around services run by Elastic Agent. I'm starting this document to have a place to add more content to it but I expect long term we need to figure out a better place.
This guideline comes out of recent issues we had in Cloud and local setups of fleet-server (elastic/fleet-server#883). We never set clear guidlines on what the expectation is of a service run by Elastic Agent and Elastic Agent itself.
This PR is kick off the discussion.