Get Free Shipping on orders over $49
Real-Time Analytics with Apache Spark : Master Structured Streaming, Kafka, Databricks, Real-Time Data Pipelines, Stateful Processing, and Production-Scale Stream Engineering (English Edition) - Subhadip Chanda

Real-Time Analytics with Apache Spark

Master Structured Streaming, Kafka, Databricks, Real-Time Data Pipelines, Stateful Processing, and Production-Scale Stream Engineering (English Edition)

By: Subhadip Chanda

eBook | 17 June 2026

At a Glance

eBook


$35.00

or 4 interest-free payments of $8.75 with

 or 

Instant Digital Delivery to your Kobo Reader App

Turn Data in Motion into Decisions in Real

Key Features

? Get a free one-month digital subscription to www.avaskillshelf.com.

? Master Spark Structured Streaming from windowed aggregations and stateful processing to sub-second latency.

? Build production ingestion pipelines using Kafka, Kinesis, Event Hubs, and Auto Loader at scale.

? Deploy, monitor, and integrate ML inference into streaming workflows using CI/CD and Declarative Automation Bundles.

Book Description

The Next Generation of Data Platforms Will Be Real-Time, Intelligent, and Always On

Real-time Analytics with Apache Spark is your complete, comprehensive guide to building production-grade streaming systems using Apache Spark Structured Streaming on the Databricks platform, from first principles to enterprise-scale deployment.

You begin with Spark fundamentals and streaming concepts, then progressively advance through windowed aggregations, stateful processing with transformWithState, stream-stream joins, and the new Real-time Mode for sub-second latency. Every chapter combines clear explanations with production-ready code, preparing you to handle real-world challenges including late data, state management, and performance tuning across Kafka, Kinesis, Event Hubs, and Auto Loader.

What you will learn

? Build fault-tolerant streaming pipelines with exactly-once guarantees on Apache Spark.

? Apply windowed aggregations, watermarks, and stateful processing for real-time data workflows.

? Ingest streaming data from Kafka, Kinesis, Event Hubs, and Auto Loader at scale.

? Deploy streaming pipelines using Declarative Automation Bundles and CI/CD on Databricks.

Table of Contents

  1. Real-Time Analytics Landscape and Use Cases

  2. Apache Spark Fundamentals (with a Streaming Mindset)

  3. Structured Streaming

  4. Deep Dive into Sources and Sinks

  5. Windowed and Stateful Operations

  6. Writing Streaming Queries with Spark SQL

  7. Low-Latency Streaming with Spark Real-Time Mode

  8. Machine Learning for Streaming Applications

  9. Monitoring, Debugging, and Performance Tuning

  10. Packaging, Orchestration, and CI/CD Using Declarative Automation Bundles.

  11. End-to-End Real-Time Analytics Project

Index

on

More in Data Capture & Analysis

China's Megatrends : The 8 Pillars of a New Society - John Naisbitt

eBOOK

AI Model Evaluation - Leemay Nassery

eBOOK