Video explains - What are Deletion Vectors in Delta Tables? What is Liquid Clustering in Delta Table? How Liquid Clustering improved performance in Delta Tables? How to Optimize a Delta table with High Cardinality Column? How to read a file using SQL in Databricks? How to enable Liquid Clustering on a Delta Table? What is Delta Clustering?
This video is also available as a part of playlist : Databricks Zero to Hero ([ Ссылка ])
Chapters
00:00 - Introduction
00:44 - What are Deletion Vector in Delta Tables?
01:58 - How Deletion Vector works in Delta Tables?
08:13 - What is Liquid Clustering in Delta Tables?
09:13 - How to enable Liquid Clustering in Delta Tables?
For Local PySpark Jupyter Lab setup just run the command - docker pull jupyter/pyspark-notebook
Python Basics - [ Ссылка ]
GitHub URL for code - [ Ссылка ]
Delta Lake Optimization Documentation - [ Ссылка ]
The series provides a step-by-step guide to learning PySpark, a popular open-source distributed computing framework that is used for big data processing.
Ещё видео!