Get Free Shipping on orders over $79
Tree-Based Methods for Statistical Learning in R : Chapman & Hall/CRC Data Science Series - Brandon M. Greenwell

Tree-Based Methods for Statistical Learning in R

By: Brandon M. Greenwell

Hardcover | 23 June 2022 | Edition Number 1

At a Glance

Hardcover


RRP $200.00

$176.75

12%OFF

or 4 interest-free payments of $44.19 with

 or 

Ships in 3 to 5 business days

Tree-based Methods for Statistical Learning in R provides a thorough introduction to both individual decision tree algorithms (Part I) and ensembles thereof (Part II). Part I of the book brings several different tree algorithms into focus, both conventional and contemporary. Building a strong foundation for how individual decision trees work will help readers better understand tree-based ensembles at a deeper level, which lie at the cutting edge of modern statistical and machine learning methodology.

The book follows up most ideas and mathematical concepts with code-based examples in the R statistical language; with an emphasis on using as few external packages as possible. For example, users will be exposed to writing their own random forest and gradient tree boosting functions using simple for loops and basic tree fitting software (like rpart and party/partykit), and more. The core chapters also end with a detailed section on relevant software in both R and other opensource alternatives (e.g., Python, Spark, and Julia), and example usage on real data sets. While the book mostly uses R, it is meant to be equally accessible and useful to non-R programmers.

Consumers of this book will have gained a solid foundation (and appreciation) for tree-based methods and how they can be used to solve practical problems and challenges data scientists often face in applied work.

Features:

  • Thorough coverage, from the ground up, of tree-based methods (e.g., CART, conditional inference trees, bagging, boosting, and random forests).

  • A companion website containing additional supplementary material and the code to reproduce every example and figure in the book.
  • A companion R package, called treemisc, which contains several data sets and functions used throughout the book (e.g., there's an implementation of gradient tree boosting with LAD loss that shows how to perform the line search step by updating the terminal node estimates of a fitted rpart tree).
  • Interesting examples that are of practical use; for example, how to construct partial dependence plots from a fitted model in Spark MLlib (using only Spark operations), or post-processing tree ensembles via the LASSO to reduce the number of trees while maintaining, or even improving performance.
Industry Reviews

Tree-based algorithms have been a workhorse for data science teams for decades, but the data science field has lacked an all-encompassing review of trees - and their modern variants like XGBoost - until now. Greenwell has written the ultimate guide for tree-based methods: how they work, their pitfalls, and alternative solutions. He puts it all together in a readable and immediately usable book. You're guaranteed to learn new tips and tricks to help your data science team.

-Alex Gutman, Director of Data Science, Author: Becoming a Data Head


"Here's a new title that is a "must have" for any data scientist who uses the R language. It's a wonderful learning resource for tree-based techniques in statistical learning, one that's become my go-to text when I find the need to do a deep dive into various ML topic areas for my work."

Daniel D. Gutierrez, Editor-in-Chief for insideBIGDATA, USA, insideBIGDATA, February 2023

More in Economic Statistics

Principles of Human Physiology, Global Edition : 6th edition - Cindy Stanfield
Accounting : 9th Edition - Tracie Miller-Nobles

RRP $206.95

$155.75

25%
OFF
Business Analytics : 5th Cengage International Edition - Jeffrey D. Camm
Operations and Supply Chain Management : 3rd Edition - David Collier
Basic Business Statistics : 5th Edition - Mark Berenson

RRP $167.95

$126.75

25%
OFF
Quantitative Methods for Business (Custom Edition) : 3rd Edition - Mark Berenson
Data Analysis for Business, Economics, and Policy - Gábor Békés
The Art of Statistics : Learning from Data - David Spiegelhalter

RRP $26.99

$22.99

15%
OFF
Capital and Ideology - Thomas Piketty

RRP $70.95

$52.99

25%
OFF
Lean Analytics : Use Data to Build a Better Startup Faster - Alistair Croll
Ecco! due Activity Book : Ecco! - Carla Catanzariti
Basic Business Statistics + PHStat for Statistics : 5th Edition - Mark Berenson

RRP $159.95

$127.75

20%
OFF
Introduction to Econometrics : 5th edition - Christopher Dougherty

RRP $140.95

$116.75

17%
OFF
Naked Statistics : Stripping the Dread from the Data - Charles Wheelan
SPSS Statistics For Dummies : 4th edition - Jesus Salcedo

RRP $65.95

$46.17

30%
OFF
Business Statistics : 4th Global Edition - Norean Sharpe

RRP $154.30

$117.75

24%
OFF