Get Free Shipping on orders over $79
Apache Spark 3 for Data Engineering and Analytics with Python - David Mngadi

Apache Spark 3 for Data Engineering and Analytics with Python

By: David Mngadi

eText | 30 August 2021 | Edition Number 1

At a Glance

eText


$29.69

or 4 interest-free payments of $7.42 with

Instant online reading in your Booktopia eTextbook Library *

Why choose an eTextbook?

Instant Access *

Purchase and read your book immediately

Read Aloud

Listen and follow along as Bookshelf reads to you

Study Tools

Built-in study tools like highlights and more

* eTextbooks are not downloadable to your eReader or an app and can be accessed via web browsers only. You must be connected to the internet and have no technical issues with your device or browser that could prevent the eTextbook from operating.

Apache Spark 3 is an open-source distributed engine for querying and processing data. This course will provide you with a detailed understanding of PySpark and its stack. This course is carefully developed and designed to guide you through the process of data analytics using Python Spark. The author uses an interactive approach in explaining keys concepts of PySpark such as the Spark architecture, Spark execution, transformations and actions using the structured API, and much more. You will be able to leverage the power of Python, Java, and SQL and put it to use in the Spark ecosystem.



You will start by getting a firm understanding of the Apache Spark architecture and how to set up a Python environment for Spark. Followed by the techniques for collecting, cleaning, and visualizing data by creating dashboards in Databricks. You will learn how to use SQL to interact with DataFrames. The author provides an in-depth review of RDDs and contrasts them with DataFrames.



There are multiple problem challenges provided at intervals in the course so that you get a firm grasp of the concepts taught in the course.



The code bundle for this course is available here: https://github.com/PacktPublishing/Apache-Spark-3-for-Data-Engineering-and-Analytics-with-Python-

on
Desktop
Tablet
Mobile

More in Programming & Scripting Languages

Investing for Programmers - Stefan Papp

eBOOK

The Rust Programming Language, 3rd Edition - Carol Nichols

eBOOK

The Debugging Handbook - Johannes Kuhlmann

eBOOK

RRP $67.77

$54.99

19%
OFF