Get Free Shipping on orders over $49
Real-Time Analytics with Apache Spark : Master Structured Streaming, Kafka, Databricks, Real-Time Data Pipelines, Stateful Processing, and Production-Scale Stream Engineering (English Edition) - Subhadip Chanda

Real-Time Analytics with Apache Spark

Master Structured Streaming, Kafka, Databricks, Real-Time Data Pipelines, Stateful Processing, and Production-Scale Stream Engineering (English Edition)

By: Subhadip Chanda, Harsha Pasala

eText | 15 June 2026 | Edition Number 1

At a Glance

eText


$38.85

or 4 interest-free payments of $9.71 with

 or 

Instant online reading in your Booktopia eTextbook Library *

Why choose an eTextbook?

Instant Access *

Purchase and read your book immediately

Read Aloud

Listen and follow along as Bookshelf reads to you

Study Tools

Built-in study tools like highlights and more

* eTextbooks are not downloadable to your eReader or an app and can be accessed via web browsers only. You must be connected to the internet and have no technical issues with your device or browser that could prevent the eTextbook from operating.
Turn Data in Motion into Decisions in Real

Key Features ? Get a free one-month digital subscription to www.avaskillshelf.com. ? Master Spark Structured Streaming from windowed aggregations and stateful processing to sub-second latency. ? Build production ingestion pipelines using Kafka, Kinesis, Event Hubs, and Auto Loader at scale. ? Deploy, monitor, and integrate ML inference into streaming workflows using CI/CD and Declarative Automation Bundles.

Book Description The Next Generation of Data Platforms Will Be Real-Time, Intelligent, and Always On

Real-time Analytics with Apache Spark is your complete, comprehensive guide to building production-grade streaming systems using Apache Spark Structured Streaming on the Databricks platform, from first principles to enterprise-scale deployment.

You begin with Spark fundamentals and streaming concepts, then progressively advance through windowed aggregations, stateful processing with transformWithState, stream-stream joins, and the new Real-time Mode for sub-second latency. Every chapter combines clear explanations with production-ready code, preparing you to handle real-world challenges including late data, state management, and performance tuning across Kafka, Kinesis, Event Hubs, and Auto Loader.

The final section teaches you to think like a production engineer by packaging pipelines with Declarative Automation Bundles, automating deployments with CI/CD, integrating ML inference into streaming workflows, and building monitoring dashboards with custom alerts. By the end of the book, you will have a proven blueprint for delivering scalable, fault-tolerant streaming solutions on Apache Spark and Databricks.

What you will learn ? Build fault-tolerant streaming pipelines with exactly-once guarantees on Apache Spark. ? Apply windowed aggregations, watermarks, and stateful processing for real-time data workflows. ? Ingest streaming data from Kafka, Kinesis, Event Hubs, and Auto Loader at scale. ? Deploy streaming pipelines using Declarative Automation Bundles and CI/CD on Databricks. ? Integrate real-time ML inference into production streaming data workflows with confidence. ? Monitor, debug, and tune streaming jobs for production performance and operational reliability.

Table of Contents 1. Real-Time Analytics Landscape and Use Cases 2. Apache Spark Fundamentals (with a Streaming Mindset) 3. Structured Streaming 4. Deep Dive into Sources and Sinks 5. Windowed and Stateful Operations 6. Writing Streaming Queries with Spark SQL 7. Low-Latency Streaming with Spark Real-Time Mode 8. Machine Learning for Streaming Applications 9. Monitoring, Debugging, and Performance Tuning 10. Packaging, Orchestration, and CI/CD Using Declarative Automation Bundles. 11. End-to-End Real-Time Analytics Project Index

About the Author Subhadip Chanda and Harsha Pasala are experts in real-time data engineering, specializing in scalable Spark and Databricks streaming architectures. Combining deep production experience with practical design insight, they guide readers beyond prototypes to build resilient, low-latency, and future-ready analytics pipelines that operate reliably at enterprise scale.
on
Desktop
Tablet
Mobile

More in Data Capture & Analysis

China's Megatrends : The 8 Pillars of a New Society - John Naisbitt

eBOOK

AI Model Evaluation - Leemay Nassery

eBOOK