What is the best practice to test a ETL pipeline?
In traditional software development practice, before going into production, a piece of code should go through various stages of testing (unit test, integration test, user acceptance test) to secure the stability of the software.
A ETL pipeline, as a piece of code, should also go through these testing steps to build a healthy system.
However due to the nature of ETL process, traditional testing technique may not be applicable.
Is there any reference or guideline specifically focus on testing on ETL pipeline?
Topic etl reference-request
Category Data Science