Guilherme BanhudoBuilding a Data Warehouse — Naming ConventionsNaming conventions are one of the first steps toward ensuring proper cataloging, development consistency, and faster onboarding…Jul 9, 2023Jul 9, 2023
Guilherme BanhudoDownloading files from Databricks’ DBFSA quick tutorial on how to access your DBFS instance to download files solely via your browser.Jan 11, 20231Jan 11, 20231
Guilherme BanhudoinTowards AIQuerying Synapse Analytics Delta Lake from DatabricksA step-by-step guide on how to connect (query) Azure Synapse Analytics Delta Lake data from Databricks for both dedicated and serverless…Dec 8, 2022Dec 8, 2022
Guilherme BanhudoinTowards AIAirflow Production Tips — Grouped failures and retriesApache Airflow has become the de facto standard for Data Orchestration. However, throughout the years and versions, it accumulated a set…Oct 25, 20221Oct 25, 20221
Guilherme BanhudoinTowards AIAirflow Production Tips — Proper Task (Not DAG) CatchupThis series of articles aims at walking Apache Airflow users through the process of overcoming Production issues.Oct 15, 2022Oct 15, 2022
Guilherme BanhudoinTowards AIFrom OLTP to Data LakehouseA layman’s overview of the Data ecosystem in a few paragraphsJun 4, 2022Jun 4, 2022
Guilherme BanhudoinTowards Data ScienceSQL Performance Tips #2Avoiding running on the heap and CTEs vs Temporary TablesJan 4, 2021Jan 4, 2021
Guilherme BanhudoinTowards Data ScienceSQL Performance Tips #1Avoiding self joins and join on operationsNov 27, 2020Nov 27, 2020
Guilherme BanhudoinTowards AIExploring the NoSQL FamilyA (long) primer on a growing requirement for Data Scientist interviewsAug 12, 2020Aug 12, 2020
Guilherme BanhudoinTowards Data ScienceData-Driven T-SQL Business RulesCreating a data validator with dynamic and interchangeable business rules or why (T-) SQL is more powerful than we like to admitMay 7, 2020May 7, 2020