Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Terraform sync tool #290

Draft
wants to merge 53 commits into
base: master
Choose a base branch
from

Conversation

danieldeleo
Copy link
Collaborator

No description provided.

@google-cla
Copy link

google-cla bot commented Jun 8, 2022

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

Comment on lines +77 to +87
├── modules # Terraform modules directory
│ ├── bigquery # Example Terraform BigQuery Setup
│ └── ... # Other modules setup you have
├── qa # qa environment directory
│ ├── terragrunt.hcl
│ └── terraform-sync-tool # Tool terraform-sync-tool
│ ├── json_schemas # Terraform schema files
│ ├── terragrunt.hcl
│ └── ...
├── cloudbuild.yaml # Cloud Build configuration file
├── deploy.sh # Build Step 0 - contains terragrunt commands
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

All of these should go inside an example/ directory

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Updated in 5496102

Comment on lines 3 to 6
This directory contains the setup for the Terraform Sync Tool. Terraform Sync Tool was designed to address the schema drifts in BigQuery tables and keep the
Terraform schemas up-to-date with the BigQuery table schemas in production environment. Schema drifts occurred when BigQuery Table schemas are updated by newly
ingested data while Terraform schema files contain the outdated schemas. Therefore, this tool will detect the schema drifts, trace the origins of the drifts, and alert
developers/data engineers.
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
This directory contains the setup for the Terraform Sync Tool. Terraform Sync Tool was designed to address the schema drifts in BigQuery tables and keep the
Terraform schemas up-to-date with the BigQuery table schemas in production environment. Schema drifts occurred when BigQuery Table schemas are updated by newly
ingested data while Terraform schema files contain the outdated schemas. Therefore, this tool will detect the schema drifts, trace the origins of the drifts, and alert
developers/data engineers.
This directory contains the Terraform Sync Tool. This tool intentionally fails your CI/CD pipeline when schema drifts occur between what your BigQuery Terraform resources declare and what's actually present in your BigQuery environment. Theses schema drifts happen when BigQuery tables are updated by processes outside of Terraform (ETL process may dynamically add new columns when loading data into BigQuery). When drifts occur, you end up with outdated BigQuery Terraform resource files. This tool detects the schema drifts, traces the origins of the drifts, and alerts developers/data engineers (by failing the CI/CD pipeline) so they can patch the Terraform in their current commit.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

updated in 7f469e4

ingested data while Terraform schema files contain the outdated schemas. Therefore, this tool will detect the schema drifts, trace the origins of the drifts, and alert
developers/data engineers.

The Terraform Schema Sync Tool fails the build attemps if resource drifts are detected and notifies the latest resource information. Developers and data engineers should be able to update the Terraform resources accordingly.
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
The Terraform Schema Sync Tool fails the build attemps if resource drifts are detected and notifies the latest resource information. Developers and data engineers should be able to update the Terraform resources accordingly.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Updated in 7f469e4

The Terraform Schema Sync Tool fails the build attemps if resource drifts are detected and notifies the latest resource information. Developers and data engineers should be able to update the Terraform resources accordingly.

Terraform Sync Tool can be integrated into your CI/CD pipeline. You'll need to add two steps to CI/CD pipeline.
- Step 0: Use Terraform/erragrunt command to detect resource drifts and write output into a JSON file
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
- Step 0: Use Terraform/erragrunt command to detect resource drifts and write output into a JSON file
- Step 0: Run the Terraform plan command (using either Terraform/Terragrunt) with the `-json` option and write the output into a JSON file using the caret operator `> output.json`

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Updated in 7f469e4


## How to run Terraform Schema Sync Tool

#### Use Terraform/Terragrunt commands to test if any resources drifts existed
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

All these should be 3rd level headers (###), not 4 (####)

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Updated in 5496102

- Step 0: Use Terraform/erragrunt command to detect resource drifts and write output into a JSON file
- Step 1: Use Python scripts to identify and investigate the drifts

## How to run Terraform Schema Sync Tool
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
## How to run Terraform Schema Sync Tool
## How to run Terraform Schema Sync Tool
```bash
###############
# Using Terragrunt
###############
terragrunt run-all plan -json --terragrunt-non-interactive > plan_output.json
python3 terraform_sync.py plan_output.json
##############
# Using Terraform
##############
terraform plan -json > plan_output.json
python3 terraform_sync.py plan_output.json

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Updated in 5496102 and 34f1461

Comment on lines 18 to 46
Terragrunt/Terraform commands:
```
terragrunt run-all plan -json --terragrunt-non-interactive

# Terraform Command
terraform plan -json
```

After running the Terrform plan command, **the event type "resource_drift"("type": "resource_drift") indicates a drift has occurred**.
If drifts detected, please update your terraform configurations and address the resource drifts based on the event outputs.


#### Add Could Build Steps to your configuration file

Please check cloud build steps in `cloudbuild.yaml` file, and add these steps to your Cloud Build Configuration File.

- step 0: run terraform commands in `deploy.sh` to detects drifts

Add `deploy.sh` to your project directory.

- step 1: run python scripts to investigate terraform output

Add `requirements.txt` and `terraform_sync.py` to your project directory.

#### (Optional if you haven't created Cloud Build Trigger) Create and configure a new Trigger in Cloud Build
Make sure to indicate your cloud configuration file location correctly.

#### That's all you need! Let's commit and test in CLoud Build!

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
Terragrunt/Terraform commands:
```
terragrunt run-all plan -json --terragrunt-non-interactive
# Terraform Command
terraform plan -json
```
After running the Terrform plan command, **the event type "resource_drift"("type": "resource_drift") indicates a drift has occurred**.
If drifts detected, please update your terraform configurations and address the resource drifts based on the event outputs.
#### Add Could Build Steps to your configuration file
Please check cloud build steps in `cloudbuild.yaml` file, and add these steps to your Cloud Build Configuration File.
- step 0: run terraform commands in `deploy.sh` to detects drifts
Add `deploy.sh` to your project directory.
- step 1: run python scripts to investigate terraform output
Add `requirements.txt` and `terraform_sync.py` to your project directory.
#### (Optional if you haven't created Cloud Build Trigger) Create and configure a new Trigger in Cloud Build
Make sure to indicate your cloud configuration file location correctly.
#### That's all you need! Let's commit and test in CLoud Build!

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

These are not required to run this tool. This is more related to running the example you provide with this tool.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Updated in 5496102

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants