Building a Data Warehouse — Naming ConventionsNaming conventions are one of the first steps toward ensuring proper cataloging, development consistency, and faster onboarding…Jul 9, 2023Jul 9, 2023
Downloading files from Databricks’ DBFSA quick tutorial on how to access your DBFS instance to download files solely via your browser.Jan 11, 20231Jan 11, 20231
Published inTowards AIQuerying Synapse Analytics Delta Lake from DatabricksA step-by-step guide on how to connect (query) Azure Synapse Analytics Delta Lake data from Databricks for both dedicated and serverless…Dec 8, 2022Dec 8, 2022
Published inTowards AIAirflow Production Tips — Grouped failures and retriesApache Airflow has become the de facto standard for Data Orchestration. However, throughout the years and versions, it accumulated a set…Oct 25, 20221Oct 25, 20221
Published inTowards AIAirflow Production Tips — Proper Task (Not DAG) CatchupThis series of articles aims at walking Apache Airflow users through the process of overcoming Production issues.Oct 15, 2022Oct 15, 2022
Published inTowards AIFrom OLTP to Data LakehouseA layman’s overview of the Data ecosystem in a few paragraphsJun 4, 2022Jun 4, 2022
Published inTowards Data ScienceSQL Performance Tips #2Avoiding running on the heap and CTEs vs Temporary TablesJan 4, 2021Jan 4, 2021
Published inTowards Data ScienceSQL Performance Tips #1Avoiding self joins and join on operationsNov 27, 2020Nov 27, 2020
Published inTowards AIExploring the NoSQL FamilyA (long) primer on a growing requirement for Data Scientist interviewsAug 12, 2020Aug 12, 2020
Published inTowards Data ScienceData-Driven T-SQL Business RulesCreating a data validator with dynamic and interchangeable business rules or why (T-) SQL is more powerful than we like to admitMay 7, 2020May 7, 2020