preloader

BLOGS

An Ultimate Guide to Databricks Unity Catalog February 6, 2024

An Ultimate Guide to Databricks Unity Catalog

Databricks Unity Catalog (UC) has gained significant attention lately, with Databricks making huge investments and shifting to make it the default choice for all new Databricks accounts.

Introduction to Databricks SQL March 4, 2023

Introduction to Databricks SQL

Databricks SQL is a powerful tool for querying and analysing data in Databricks Lakehouse.

Databricks SQL in VSCode January 25, 2023

Databricks SQL in VSCode

Recently, I had the opportunity to explore the Databricks SQL extension for VSCode, and I was thoroughly impressed.

Write a single JSON file in Databricks March 23, 2022

Write a single JSON file in Databricks

When writing to a JSON destination using the DataFrameWriter the dataset is split into multiple files to reflect the number of RDD partitions in the dataframe when in memory – this is the most efficient way for Spark to write data out.

Databricks Execution Plans October 4, 2021

Databricks Execution Plans

The execution plans in Databricks allows you to understand how code will actually get executed across a cluster and is useful for optimising queries.

Databricks Delta and Spark Cache September 10, 2021

Databricks Delta and Spark Cache

As data sizes and demand increases as time goes on, you often see slowness on Databricks this can be due to number of factors from security, network transfers, read/write requests, and memory space.