Master the critical skills required to deploy and use Databricks SQL and elevate your Business Intelligence from the Warehouse to the Lakehouse with confidence.
Key Features
- Learn about business intelligence on the Lakehouse with features and functions of Databricks SQL
- Make the most of Databricks SQL by learning the enablers of its data warehousing capabilities
- Learn new techniques and concepts that will allow you to supercharge your Business Intelligence with Databricks SQL analytics
Book Description
It is a new era in the design of data platform systems. Disparate data lakes and data warehouses are giving way to a new type of data platform system - the Lakehouse. It promises to unify all data analytics into a single platform. Databricks with its Databricks SQL product suite is the hottest Lakehouse platform out there. It harnesses the power of Apache Spark™, Delta Lake™ and other innovations to enable data warehousing capabilities on the Lakehouse with data lake economics.
This book is a comprehensive hands-on guide that lets you explore all the advanced features, use cases and technology components of Databricks SQL. You will start with the fundamentals of the Lakehouse architecture and how Databricks SQL fits into it. Next, you will learn how to use the platform - exploring data, executing queries, building reports and dashboards. Moving on, learn the administrative aspects of the Lakehouse - data security, governance and managing the computation power of the Lakehouse. You will delve into the core technology enablers of Databricks SQL - Delta Lake™ and Photon. Finally, you will get hands on with advanced SQL commands for ingesting data and maintaining the Lakehouse.
By the end of this book, you will have mastered Databricks SQL and be able to deploy and deliver fast, scalable business intelligence on the Lakehouse.
What you will learn
- Implement the Data Lakehouse architecture using Databricks
- Perform everyday analytics with Databricks SQL workbench and BI tools
- Organize and catalog your data assets
- Program the data security model to protect and govern your data
- Tune SQL Endpoints (computing clusters) for optimal query experience
- Tune the Delta Lake™ storage format for maximum query performance
- Achieve extreme performance with Photon query execution engine
- Implement advanced data ingestion patterns with Databricks SQL
Who This Book Is For
This book is for business intelligence practitioners, data warehouse administrators, and data engineers who are new to Databrick SQL and want to learn how to deliver high-quality insights unhindered by the scale of data or infrastructure. This book is also perfect for anyone who wants to study the advanced technologies that power Databricks SQL.
Basic knowledge of data warehouses, SQL-based analytics, and ETL processes is recommended to effectively learn the concepts introduced in this book and appreciate the innovation behind the platform.
Table of Contents
- Introduction
- Databricks Product Suite: A Visual Tour
- The Data Catalog
- The Data Security Model
- The Workbench
- The SQL Endpoints
- Using Business Intelligence tools with Databricks SQL
- The Delta Engine
- The Photon Engine
- Warehouse on the Lakehouse
- SQL Commands Part 1
- SQL Commands Part 2
- Data Manipulation Language
- Ask me Anything