[Meta]Investigate resource consumption of Elastic Agent with K8s Integration

## Backround
The latest issues like [3863](https://github.com/elastic/sdh-beats/issues/3863),  [3991](https://github.com/elastic/sdh-beats/issues/3991) and [4081](https://github.com/elastic/sdh-beats/issues/4081), proved that the installation of the default configuration of Elastic Agent with our Kubernetes Integration can lead to situations were our customers result in unfortunate circumstances (even with broken k8s clusters sometimes). There are many details and variables that affect the final setup and installation of our observability solution and we can try to summarise and list them here.

## Goals
This issue  tries to summarise the next actions we need in order to investigate:
- The current resource consumption of default Elastic Agent with K8s Integration
- Several alternative ways that we can offer in order to minimise the impact in different k8s environments and customer setups, regarding resource consumption of k8s cluster.

## Actions
### Current Actions
We have observed until now that:
a) Memory consumption of Elastic Agent had increased from 8.8 to 8.9 versions and later of Elastic Agent (Relevant https://github.com/elastic/sdh-beats/issues/3863#issuecomment-1733750863)
b) Number of API calls towards Kubernetes Control API has increased since 8.9 version (See Salesforce 01507229 regarding Elastic Agent overloading Kubernetes API server.: https://github.com/elastic/sdh-beats/issues/3991#issuecomment-1787648161)
c) CPU consumption (although not such a big issue at the moment and not first priority) has been [referred here](https://github.com/elastic/sdh-beats/issues/4081) as a concern.

Unti now:
- Since 8.11 we have updated the  [elastic-agent-autodiscover](https://github.com/elastic/elastic-agent-autodiscover/releases/tag/v0.6.4), [beats PR](https://github.com/elastic/beats/pull/36879) to v0.6.4. Disabling metadata for deployment and cronjob. Pods that will be created from deployments or cronjobs will not have the extra metadata field for kubernetes.deployment or kubernetes.cronjob.
- We have merged [leader election configuration variables](https://github.com/elastic/elastic-agent/pull/3625)
- Proposing a way to disable Leader Election in Managed Elastic Agents (See [here](https://github.com/elastic/sdh-beats/issues/3863#issuecomment-1785308398))

### Next Planned Actions
- https://github.com/elastic/beats/issues/37243
- https://github.com/elastic/elastic-agent/issues/3594
- https://github.com/elastic/beats/issues/37179
- https://github.com/elastic/elastic-agent/issues/4670

### Future Plans/Actions

- [ ]  Propose https://kubernetes.io/docs/concepts/cluster-administration/flow-control/
- [ ] Run tests in real k8s clusters and retrieve diagnostics from Agent trying to investigate memory consumption
- [ ] Check with audit logs or other relevant way the number of API calls made by Agent. Try to suggest rate limiting API calls on startup of agent or in case of errors
   - [ ] https://github.com/elastic/beats/issues/37922
- [ ] Any other solutions/ fixes that might come from the investigation should be linked here
  - [x] https://github.com/elastic/beats/issues/37243
  - [x] https://github.com/elastic/beats/issues/30086
  - [x] https://github.com/elastic/beats/issues/34717
- [x] https://github.com/elastic/elastic-agent/issues/4122
- [ ] Automate the cluster creation and reproduction of specific issues  




Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Meta]Investigate resource consumption of Elastic Agent with K8s Integration #3801

Backround

Goals

Actions

Current Actions

Next Planned Actions

Future Plans/Actions

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

[Meta]Investigate resource consumption of Elastic Agent with K8s Integration #3801

Description

Backround

Goals

Actions

Current Actions

Next Planned Actions

Future Plans/Actions

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions