Loading...
「ツール」は右上に移動しました。
利用したサーバー: wtserver1
7いいね 185 views回再生

How to PREVENT incomplete data using dbt tests and Orchestra | dbt test tutorial #dataquality

Ever wondered how you can check for incomplete data? Now with just a few lines of SQL you can prevent ANY of your datasets that are customer-facing having incomplete tests.

In this video we use dbt tests / dbt custom macros to define a dbt test that is parameterisable and applicable across any dataset.

We place this in the dbt tests/generic folder for use across different models.

By implementing this kind of data completeness test using dbt, you can ensure that if there is EVER any INCOMPLETE or MISSING data in ANY dataset, you as the data team are the first to know.

Of course - the final thing to understand is how you run this and get visibility of what is going on. This is where Orchestra comes in, which is a unified control plane for data / all round awesome data platform.

Here you can connect your git-repository with your dbt code and dbt testing code, and in a few minutes have your automated checks running either within an end-to-end data pipeline or outside of it.

Check out the video to see how you can use dbt tests and Orchestra to improve your data quality and STOP stakeholders asking you "Hey why is there data missing?"

#dataquality #dbt #dbttesting #dataqualitytesting #dataengineering #datanegineeringpipeline

Try dbt testing in Orchestra here: https://app.getorchestra.io/signup

コメント