Apache Iceberg: The Definitive Guide : Data Lakehouse Functionality, Performance, and Scalability on the Data Lake - Tomer Shiran

Apache Iceberg: The Definitive Guide

Data Lakehouse Functionality, Performance, and Scalability on the Data Lake

By: Tomer Shiran, Jason Hughes, Alex Merced, Dipankar Mazumdar

Paperback | 22 March 2024

At a Glance

Paperback


RRP $133.00

$58.25

56%OFF

or 4 interest-free payments of $14.56 with

 or 

Aims to ship in 15 to 25 business days

Traditional data architecture patterns are severely limited. To use these patterns, you have to ETL data into each tool a cost-prohibitive process for making warehouse features available to all of your data. This lack of flexibility forces you to adjust your workflow to the tool your data is locked in, which creates data silos and data drift. This book shows you a better way.

Apache Iceberg provides the capabilities, performance, scalability, and savings that fulfill the promise of an open data lakehouse. By following the lessons in this book, you'll be able to achieve interactive, batch, machine learning, and streaming analytics with this lakehouse. Authors Tomer Shiran, Jason Hughes, and Alex Merced from Dremio guide you through the process.

With this book, you'll learn:
  • The architecture of Apache Iceberg tables
  • What happens under the hood when you perform operations on Iceberg tables
  • How to further optimize Apache Iceberg tables for maximum performance
  • How to use Apache Iceberg with popular data engines such as Apache Spark, Apache Flink, and Dremio Sonar
  • How Apache Iceberg can be used in streaming and batch ingestion

Discover why Apache Iceberg is a foundational technology for implementing an open data lakehouse.

More in Database Design & Theory

Python All-in-One For Dummies : 3rd Edition - John C. Shovic

RRP $74.95

$50.35

33%
OFF
Information Modeling and Relational Databases : 2nd Edition - Terry Halpin
Scaling Python with Dask : From Data Science to Machine Learning - Holden Karau
Artificial Intelligence in Finance : A Python-Based Guide - Yves Hilpisch
Database in Depth : O'Reilly Ser. - Chris J. Date

RRP $66.50

$31.75

52%
OFF
Data Visualisation : 2nd Edition - A Handbook for Data Driven Design - Andy Kirk