-
Book Overview & Buying
-
Table Of Contents
-
Feedback & Rating

Data Engineering with Databricks Cookbook
By :

In this recipe, you will learn how to write the output of Apache Spark Structured Streaming to a sink such as Delta Lake. Delta Lake can serve as a unified storage layer for various data types, reducing data silos within organizations. By using Delta Lake as a sink for streaming data, you can simplify data pipelines, reducing complexity and streamlining data architecture. Delta Lake enables unified analytics, allowing you to leverage a wide range of analytics tools and frameworks within a single environment, including Apache Spark, Databricks, SQL, and machine learning (ML) libraries. This versatility makes Delta Lake a valuable choice for real-time data processing and analytics pipelines, enhancing data reliability, durability, and consistency while simplifying data management and supporting compliance requirements.
Before we start, we need to make sure that we have a Kafka cluster...