Beyond basics: API integrations, multithreading, robust logging and reusable ETL libraries.
Production-grade SQL — tuning, partitioning, indexing, warehouse-aware patterns.
Pipelines, datasets, triggers, parameterization. The orchestration backbone.
Databricks end-to-end — workspaces, Delta Lake, medallion architecture, Unity Catalog.
Distributed compute under the hood — RDDs, DataFrames, optimization and tuning.
Modeling the layer everything else depends on. Dimensions, SCDs, CDC, real-time loads.