Databricks Q&A
On May 5 th , I had the pleasure of presenting “Data Cleansing using Databricks ( https://www.meetup.com/ohio-north-database-training/events/314363881/ ). During the meeting, many good questions were raised. Listed below are the answers to these questions: What are jobs in databricks? Jobs are workloads that can be scheduled, managed, and automated without manual intervention. Workloads can be notebooks, SQL queries, or pipelines on a cluster. How do jobs compare to other automation tools? What is a “Delta Live Table”? Delta Live Tables (DLT) is a Databricks feature that makes it much easier to build and run data pipelines, either batch or streaming. Simply write the transformations in SQL or Python, and DLT takes care of setting up the infrastructure, tracking dependencies, handling errors, and enforcing data quality rules through its built ‑ in expectations. ...