Guilherme BanhudoBuilding a Data Warehouse — Naming ConventionsNaming conventions are one of the first steps toward ensuring proper cataloging, development consistency, and faster onboarding…4 min read·Jul 9, 2023----
Guilherme BanhudoDownloading files from Databricks’ DBFSA quick tutorial on how to access your DBFS instance to download files solely via your browser.3 min read·Jan 11, 2023----
Guilherme BanhudoinTowards AIQuerying Synapse Analytics Delta Lake from DatabricksA step-by-step guide on how to connect (query) Azure Synapse Analytics Delta Lake data from Databricks for both dedicated and serverless…6 min read·Dec 8, 2022----
Guilherme BanhudoinTowards AIAirflow Production Tips — Grouped failures and retriesApache Airflow has become the de facto standard for Data Orchestration. However, throughout the years and versions, it accumulated a set…4 min read·Oct 25, 2022--1--1
Guilherme BanhudoinTowards AIAirflow Production Tips — Proper Task (Not DAG) CatchupThis series of articles aims at walking Apache Airflow users through the process of overcoming Production issues.4 min read·Oct 15, 2022----
Guilherme BanhudoinTowards AIFrom OLTP to Data LakehouseA layman’s overview of the Data ecosystem in a few paragraphs4 min read·Jun 4, 2022----
Guilherme BanhudoinTowards Data ScienceSQL Performance Tips #2Avoiding running on the heap and CTEs vs Temporary Tables5 min read·Jan 4, 2021----
Guilherme BanhudoinTowards Data ScienceSQL Performance Tips #1Avoiding self joins and join on operations6 min read·Nov 27, 2020----
Guilherme BanhudoinTowards AIExploring the NoSQL FamilyA (long) primer on a growing requirement for Data Scientist interviews9 min read·Aug 12, 2020----
Guilherme BanhudoinTowards Data ScienceData-Driven T-SQL Business RulesCreating a data validator with dynamic and interchangeable business rules or why (T-) SQL is more powerful than we like to admit5 min read·May 7, 2020----