Get Free Shipping on orders over $79
Exploratory Data Analysis Using R : Chapman & Hall/CRC Data Mining and Knowledge Discovery Series - Ronald K. Pearson

Exploratory Data Analysis Using R

By: Ronald K. Pearson

Hardcover | 18 May 2026 | Edition Number 2

At a Glance

Hardcover


$387.75

or 4 interest-free payments of $96.94 with

 or 

Available: 18th May 2026

Preorder. Will ship when available.

Exploratory Data Analysis Using R provides a classroom-tested introduction to exploratory data analysis (EDA), and this revised edition is accompanied by the R package ExploreTheData that implements many of the approaches described. As before, the primary focus of the book is on identifying "interesting" features - good, bad, and ugly - in a dataset, why it is important to find them, how to treat them, and more generally, the use of R to explore and explain datasets and the analysis results derived from them.

The book begins with a brief overview of exploratory data analysis using R, followed by a detailed discussion of creating various graphical data summaries in R. Then comes a thorough introduction to exploratory data analysis, and a detailed treatment of 13 data anomalies, why they are important, how to find them, and some options for addressing them. Subsequent chapters introduce the mechanics of working with external data, structured query language (SQL) for interacting with relational databases, linear regression analysis (the simplest and historically most important class of predictive models), and crafting data stories to explain our results to others. These chapters use R as an interactive data analysis platform, while Chapter 9 turns to writing programs in R, focusing on creating custom functions that can greatly simplify repetitive analysis tasks. Further chapters expand the scope to more advanced topics and techniques: special considerations for working with text data, a second look at exploratory data analysis, and more general predictive models.

The book is designed for both advanced undergraduate, entry-level graduate students, and working professionals with little to no prior exposure to data analysis, modeling, statistics, or programming. It keeps the treatment relatively non-mathematical, even though data analysis is an inherently mathematical subject. Exercises are included at the end of most chapters, and an instructor's solution manual is available.

More in Database Design & Theory

Python All-in-One For Dummies : 3rd Edition - Alan Simpson

RRP $74.95

$55.75

26%
OFF
Think Stats : Exploratory Data Analysis - Allen B. Downey

RRP $152.00

$73.75

51%
OFF
Scaling Python with Dask : From Data Science to Machine Learning - Holden Karau