[Cloud Security] add misconfiguration latest transform to Wiz integration by maxcold · Pull Request #10965 · elastic/integrations

maxcold · 2024-09-02T13:36:04Z

Proposed commit message

Introducing the latest transform for Wiz Cloud Configuration Finding data to support CDR workflows as per [Guide] Adapting data stream to Cloud Security in Kibana

Checklist

I have reviewed tips for building integrations and this pull request is aligned with them.
I have verified that all data streams collect metrics or logs.
I have added an entry to my package's changelog.yml file.
I have verified that Kibana version constraints are current according to guidelines.

Author's Checklist

[ ]

How to test this PR locally

build the package elastic-package build
run stack elastic-package stack up -v -d
Install Wiz integration, enable Cloud Configuration Finding data stream
check that the transform is created and started. It should be healthy after some time
check that Wiz's findings are available in the Findings page

Related issues

contributes to https://github.com/elastic/security-team/issues/9664

Screenshots

maxcold · 2024-09-06T14:31:42Z

/ci

elasticmachine · 2024-09-09T07:13:23Z

🚀 Benchmarks report

To see the full report comment with /test benchmark fullreport

elasticmachine · 2024-09-12T15:49:58Z

Pinging @elastic/security-service-integrations (Team:Security-Service Integrations)

CohenIdo · 2024-09-16T06:31:11Z

packages/wiz/elasticsearch/transform/latest_cdr_misconfigurations/transform.yml

+      move_on_creation: true
+latest:
+  unique_key:
+    - rule.uuid


Why are we using rule.uuid instead of rule.id? The native Transform utilizes rule.id, and I noticed that it is also part of the ECS schema.

@CohenIdo this is a valid question, the reasons I chose rule.uuid for the unique key are the following:

our native integration doesn't provide rule.uuid but rule.id has the format fe083488-fa0f-5408-9624-ac27607ac2ff which is basically the uuid as per ECS schema

As per ECS rule.id is a unique id in some context and this context is narrower than for rule.uuid. In the context of Cloud Security it might be that the id is unique within the benchmark, but not unique across multiple benchmarks, therefore I thought using uuid allow for better uniqueness, taking into account that our native also uses uuid, while providing it in the rule.id field
I will reflect this in the guide, sharing that the id should be unique across benchmarks

For user convenience, I believe it's worth using the same field names across integrations.

However, if you decide to go with rule.uuid, let's at least create a ticket to migrate from rule.id to rule.uuid in the native integration.

For user convenience, I believe it's worth using the same field names across integrations.

Overall I agree that consistency is better, but do you have a specific influence on the UX if we go for a different field between Wiz and CSP? I could not think of any specific downside for the users

However, if you decide to go with rule.uuid, let's at least create a ticket to migrate from rule.id to rule.uuid in the native integration.

I summarised all current mismatches between Wiz and our native CSP integration in this document https://docs.google.com/document/d/1GG3st6wR-9lNh4HjHsNlam5lo2oWceRK67-IcvyIVjk/edit , the mismatch between rule.uuid and rule.id is one of the points there. I will create GitHub issues after I gather some feedback on these points. Can you also review the document?

One scenario is where a user is investigating misconfiguration documents and needs to determine the rule ID field for each integration. Or, when a user wants to add a rule ID column to the data grid and has both integrations installed, they would have to add two columns that represent the same information.

@CohenIdo both use cases related to the rule.id vs rule.uuid in general, and not to which field to use in the transform as far as I understand. Whatever field we choose for the transform, these problems will stay due to our native integration not following ECS and providing uiud in rule.id and not providing rule.uuid at all. I believe we should not make changes in Wiz to make it not compliant with ECS just for it to be consistent with our native integration. I think we rather need to fix our native integration to be compliant with ECS and then the problem will be resolved properly

kcreddy

The transform's manifest.yml could be defined as per this spec: https://github.com/elastic/package-spec/blob/main/spec/integration/elasticsearch/transform/manifest.spec.yml

It contains properties such as start, but more importantly destination_index_template.settings.index.sort which defines the sort order of documents within the index. According to Elasticsearch doc, we don't seem to have default field for index.sort.

By default Lucene does not apply any sort. The index.sort.* settings define which fields should be used to sort the documents inside each Segment.

So I think we have to define this manually. Example: https://github.com/elastic/integrations/blob/main/packages/ti_threatconnect/elasticsearch/transform/latest/manifest.yml

kcreddy · 2024-09-17T14:14:11Z

packages/wiz/elasticsearch/transform/latest_cdr_misconfigurations/fields/ecs.yml

+- name: event.outcome
+  external: ecs
+- name: observer.vendor
+  external: ecs


There are few more ECS fields being populated in the source datastream's ingest pipeline, like cloud.region, event.category, etc. Shouldn't we be defininig them as well?

maxcold · 2024-09-18T21:05:03Z

@kcreddy thanks for the code review. Regarding the start, we decided not to specify it as true seems to be a default, meaning the transform starts right away anyway.
As for destination_index_template.settings.index.sort, it's a good point, though we don't have this setting on our native latest transform (the one transforming data coming from cloud_security_posture integration). I wonder what is the effect of this sort setting not being set as we haven't experienced any downside yet with our native integration. Do you know what exactly is the effect for the users if we leave it out? If it requires further investigation, I would leave this setting out to be consistent with our native integration

elastic-sonarqube · 2024-09-18T21:37:12Z

Quality Gate failed

Failed conditions
0.5% Coverage on New Code (required ≥ 80%)

See analysis details on SonarQube

elasticmachine · 2024-09-18T21:37:13Z

💚 Build Succeeded

Buildkite Build
Commit: a7863a1

History

💚 Build #15880 succeeded 9c42875
💚 Build #15840 succeeded 6bb05f0
💚 Build #15826 succeeded 6a6f6c2
💚 Build #15635 succeeded c75def5
💔 Build #15310 failed 05332a5ca8d09c5932ef7d9321a450dda3a26280

kcreddy · 2024-09-19T06:59:14Z

@maxcold,

I wonder what is the effect of this sort setting not being set as we haven't experienced any downside yet without native integration. Do you know what exactly is the effect for the users if we leave it out? If it requires further investigation, I would leave this setting out to be consistent with our native integration

Based on the docs, I can see that when index.sort is set, the documents are stored in sorted order inside the segments within a shard. This makes searches such as Get first N documents faster. Even for such queries, note that track_total_hits needs to be false to disable the aggregation being performed on all documents.
Another benefit that docs talk about it having efficient conjunctions, but they only seem to work on low-cardinality fields.

There are also caveats mentioned regarding performance while indexing the documents with index.sort enabled:

Index sorting also has a cost in terms of indexing throughput since documents must be sorted at flush and merge time.

The default source data-stream backed indices of logs-* also don't seem to contain the setting index.sort.

I agree that we should not set this property especially since you haven't experienced any downside yet in native integration.

elastic-vault-github-plugin-prod · 2024-10-30T10:25:39Z

Package wiz - 2.0.0 containing this change is available at https://epr.elastic.co/search?package=wiz

…tion (elastic#10965) * add misconfiguration latest transform to Wiz integration * add PR link to changelog * add ecs mapping to wiz latest cdr misconfigurations transform * add missing ecs fields to wiz misconfiguration transform mapping

maxcold added enhancement New feature or request Team:Cloud Security Cloud Security team [elastic/cloud-security-posture] labels Sep 2, 2024

andrewkroh added the Integration:wiz Wiz label Sep 2, 2024

maxcold mentioned this pull request Sep 3, 2024

[Cloud Security] add privileges required for CDR misconfiguration features to work elastic/elasticsearch#112456

Merged

maxcold added 2 commits September 9, 2024 08:52

add misconfiguration latest transform to Wiz integration

a0bc3e0

add PR link to changelog

c75def5

maxcold added 2 commits September 12, 2024 14:08

Merge branch 'main' into csp-add-wiz-misconfigruation-findings-transform

6a6f6c2

add ecs mapping to wiz latest cdr misconfigurations transform

6bb05f0

maxcold marked this pull request as ready for review September 12, 2024 15:41

maxcold requested a review from a team as a code owner September 12, 2024 15:41

maxcold requested review from a team and CohenIdo September 12, 2024 15:41

andrewkroh added the Team:Security-Service Integrations Security Service Integrations team [elastic/security-service-integrations] label Sep 12, 2024

maxcold changed the title ~~[WIP][Cloud Security] add misconfiguration latest transform to Wiz integration~~ [Cloud Security] add misconfiguration latest transform to Wiz integration Sep 13, 2024

Merge branch 'main' into csp-add-wiz-misconfigruation-findings-transform

9c42875

CohenIdo reviewed Sep 16, 2024

View reviewed changes

maxcold requested a review from CohenIdo September 17, 2024 09:30

kcreddy reviewed Sep 17, 2024

View reviewed changes

add missing ecs fields to wiz misconfiguration transform mapping

a7863a1

maxcold requested a review from kcreddy September 18, 2024 21:17

kcreddy approved these changes Sep 19, 2024

View reviewed changes

maxcold merged commit d9da904 into elastic:main Sep 19, 2024

maxcold deleted the csp-add-wiz-misconfigruation-findings-transform branch September 19, 2024 11:47

Conversation

maxcold commented Sep 2, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Proposed commit message

Checklist

Author's Checklist

How to test this PR locally

Related issues

Screenshots

Uh oh!

maxcold commented Sep 6, 2024

Uh oh!

elasticmachine commented Sep 9, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🚀 Benchmarks report

Uh oh!

elasticmachine commented Sep 12, 2024

Uh oh!

CohenIdo Sep 16, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

maxcold Sep 16, 2024

Choose a reason for hiding this comment

Uh oh!

CohenIdo Sep 17, 2024

Choose a reason for hiding this comment

Uh oh!

maxcold Sep 17, 2024

Choose a reason for hiding this comment

Uh oh!

CohenIdo Sep 17, 2024

Choose a reason for hiding this comment

Uh oh!

maxcold Sep 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kcreddy left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kcreddy Sep 17, 2024

Choose a reason for hiding this comment

Uh oh!

maxcold commented Sep 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elastic-sonarqube bot commented Sep 18, 2024

Quality Gate failed

Uh oh!

elasticmachine commented Sep 18, 2024

💚 Build Succeeded

History

Uh oh!

kcreddy commented Sep 19, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elastic-vault-github-plugin-prod bot commented Oct 30, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

maxcold commented Sep 2, 2024 •

edited

Loading

elasticmachine commented Sep 9, 2024 •

edited

Loading

CohenIdo Sep 16, 2024 •

edited

Loading

maxcold Sep 18, 2024 •

edited

Loading

kcreddy left a comment •

edited

Loading

maxcold commented Sep 18, 2024 •

edited

Loading

kcreddy commented Sep 19, 2024 •

edited

Loading