feat!: Unified variables and adds support for IAM policies by adlersantos · Pull Request #341 · GoogleCloudPlatform/public-datasets-pipelines

adlersantos · 2022-04-12T15:08:07Z

Description

Unified variables

This PR supports an optional variables file (.vars.{ENV}.yaml) under every dataset folder, that can contain all the values of variables needed for that dataset's infra and the pipeline configurations.

The namespaces supported in the YAML file are

infra: a set of key-value pairs that are copied as Terraform variables under infra/terraform.tfvars
pipelines: a JSON object that contains dataset-specific variables (Airflow variables), copied to .{env}/datasets/{DATASET}/pipelines/{dataset}_variables.json

Because these YAML files might contain sensitive information, they aren't checked into the repo.

Support for IAM policies for GCS buckets and BQ datasets

This PR also adds support for adding IAM policies into the Terraform resource definitions of GCS buckets and BQ datasets using the YAML variables above. The convention is to use

infra:
  iam_policies:
    storage_buckets:
      bucket_name:
        - role: roles/storage.objectViewer
          members:
            - user:some-user@google.com
    bigquery_datasets:
      dataset_name:
        - role: roles/bigquery.dataViewer
          members:
            - allAuthenticatedUsers
            - user:another-user@example.com

in the YAML variables file to associate a list of IAM roles to a specific GCS bucket or BQ dataset. These IAM roles will be included in the generated Terraform files as IAM policy resources.

Checklist

Note: If an item applies to you, all of its sub-items must be fulfilled

happyhuman

LGTM. Just one comment about open(...) usage.

happyhuman · 2022-04-12T17:05:31Z

-    gcs_uri = f"gs://{composer_bucket}/data/variables/{filename}"
+    pipeline_vars_file = f"{dataset_id}_variables.json"
+    env_vars_file = DATASETS_PATH / dataset_id / f".vars{env_path.name}.yaml"
+    env_vars = yaml.load(open(env_vars_file)) if env_vars_file.exists() else {}


It may be better to open the file using the with clause.

adlersantos added 5 commits April 11, 2022 18:57

support infra vars and IAM policies for TF generation

78b11cb

support IAM policies in Terraform templates

0b745b5

wait for black formatting to finish

d9df1eb

Merge branch 'main' into infra-vars

082a2d6

use pipeline vars and removed shared vars

c3de2bb

adlersantos added feature request New feature or request cleanup Cleanup or refactor code revision: feature Modify an existing feature labels Apr 12, 2022

adlersantos requested a review from happyhuman April 12, 2022 15:08

happyhuman approved these changes Apr 12, 2022

View reviewed changes

happyhuman merged commit c4a45a0 into main Apr 12, 2022

happyhuman deleted the infra-vars branch April 12, 2022 17:16

release-please Bot mentioned this pull request Apr 12, 2022

chore(main): release 4.0.0 #321

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat!: Unified variables and adds support for IAM policies#341

feat!: Unified variables and adds support for IAM policies#341
happyhuman merged 5 commits into
mainfrom
infra-vars

adlersantos commented Apr 12, 2022 •

edited

Loading

Uh oh!

happyhuman left a comment

Uh oh!

happyhuman Apr 12, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

adlersantos commented Apr 12, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Unified variables

Support for IAM policies for GCS buckets and BQ datasets

Checklist

Uh oh!

happyhuman left a comment

Choose a reason for hiding this comment

Uh oh!

happyhuman Apr 12, 2022

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

adlersantos commented Apr 12, 2022 •

edited

Loading