Scaling Up with R and Apache Arrow : Bigger Data, Easier Workflows - Nic Crane

Scaling Up with R and Apache Arrow

Bigger Data, Easier Workflows

By: Nic Crane, Jonathan Keane, Neal Richardson

Hardcover | 2 June 2025 | Edition Number 1

At a Glance

Hardcover


$446.90

or 4 interest-free payments of $111.72 with

 or 

Available: 2nd June 2025

Preorder. Will ship when available.

Analyze large datasets directly from R. Scaling Up With R and Arrow provides a guide to working efficiently with larger-than-memory datasets using the arrow R package. As data grows in size and complexity, traditional data analysis methods in R often hit technical limitations. In this book, you'll learn how to overcome these hurdles without needing to set up complex infrastructure.

You'll learn about the Apache Arrow project's origins, goals, and its significance in bridging the gap between data science and big data ecosystems. You'll also learn how to leverage the arrow R package to work directly with files in various formats, such as CSV and Parquet, using familiar dplyr syntax. This book explores practical topics like data manipulation, file formats, working with larger datasets, and optimizing workflows for data in cloud storage. Advanced chapters examine user-defined functions, integration with other tools like DuckDB, and extending Arrow's capabilities to work with geospatial data.

Written by developers of the Arrow R package, this guide is essential for anyone looking to scale their data processing capabilities in R.

More in Computer Science

The Shortest History of AI - Toby Walsh

RRP $27.99

$22.50

20%
OFF
Sea of Thieves : The Art of Piracy - Chris Allcock

RRP $59.99

$47.75

20%
OFF
AI Engineering : Building Applications with Foundation Models - Chip Huyen
Windows 11 For Seniors For Dummies, 2nd Edition - Curt Simmons
Windows 11 For Dummies, 2nd Edition : Windows 11 For Dummies - Alan Simpson