Bujar BakiuOct 14, 20225 min readDockerizing dbt Transformations for Managed Airflow: Docker, dbt, and GCP Cloud ComposerAirflow is one of the most popular pipeline orchestration tools out there. It has been around for more than 8 years, and it is used...
Kejdi TakoSep 14, 20223 min readDistributed Machine Learning Model Training with Spark (PySpark)GitHub repo: https://github.com/data-max-hq/pyspark-3-ways What is Spark? Apache Spark was designed to function as a simple API for...
IgliAug 24, 20224 min readDeploy Airflow and Metabase in Kubernetes using Infrastructure-as-CodeA step-by-step guide to deploying Airflow and Metabase in GCP with Terraform and Helm providers. With the extensive usage of cloud...