Link Search Menu Expand Document

Use case guides

Last modified on 09-Sep-24

Use the following guides as example implementations based on how you intend to use Soda for data quality testing. For standard set up instructions, see Get started.

Guide Description Soda product
requirements
Test data in an Airflow pipeline Use this guide as an example for how to set up Soda to test the quality of your data in an Airflow pipeline that uses dbt transformations. Soda Library
Soda Cloud
Test data quality in an ADF pipeline Learn how to invoke Soda data quality tests in an ETL pipeline in Azure Data Factory. Soda Library
Soda Cloud
Test data quality in a Dagster pipeline Learn how to invoke Soda data quality tests in a Dagster pipeline. Soda Library
Soda Cloud
Test data quality in Databricks pipeline Learn how to use Databricks notesbooks with Soda to test data quality before feeding a machine learning model. Soda Library
Soda Cloud
Test data before migration Use this guide to set up Soda to test before and after data migration between data sources. Soda Library
Soda Cloud
Self-serve Soda Use this guide to set up Soda Cloud to enable users across your organization to serve themselves when it comes to testing data quality. Soda Cloud
Soda Agent
Test data during development Use this guide to set up Soda to test the quality of your data during your development lifecycle in a GitHub Workflow. Soda Library
Soda Cloud
Automate monitoring Use this guide to set up Soda to automatically monitor data quality. Soda Cloud
Soda Agent


Use the following How tos for practical advice, examples, and instructions for using Soda.

How to Description Soda product
requirements
Invoke Soda in Databricks Learn how to invoke Soda data quality tests in a Databricks notebook. Soda Library
Soda Cloud
Use a Secrets Manager Learn how to set up a Soda Agent to use an External Secrets Manager to retrieve frequently-rotated data source passwords. Soda Cloud
Self-hosted Agent
Generate API keys Learn how to use Soda Cloud API keys to securely communicate with other entities such as Soda Library and self-hosted Soda Agents, and to provide secure access to Soda Cloud via API. Soda Cloud
Manage sensitive data Learn how to adjust several configurable settings that help you manage access to sensitive data in Soda Cloud. Soda Cloud
Reroute failed row samples Learn how to programmatically set up Soda Library to display failed row samples in the command-line. Soda Library
Soda Cloud
Double-onboard a data source Learn how to onboard a data source in Soda Cloud that you have already onboarded via Soda Library. Soda Library
Soda Cloud

Need help? Join the Soda community on Slack.


Was this documentation helpful?

What could we do to improve this page?

Documentation always applies to the latest version of Soda products
Last modified on 09-Sep-24