Get Free Shipping on orders over $79
Data Clustering with Python : From Theory to Implementation - Guojun Gan

Data Clustering with Python

From Theory to Implementation

By: Guojun Gan

Hardcover | 15 September 2025 | Edition Number 1

At a Glance

Hardcover


$168.00

or 4 interest-free payments of $42.00 with

 or 

Ships in 5 to 7 business days

Data clustering, an interdisciplinary field with diverse applications, has gained increasing popularity since its origins in the 1950s. Over the past six decades, researchers from various fields have proposed numerous clustering algorithms. In 2011, I wrote a book on implementing clustering algorithms in C++ using object-oriented programming. While C++ offers efficiency, its steep learning curve makes it less ideal for rapid prototyping. Since then, Python has surged in popularity, becoming the most widely used programming language since 2022. Its simplicity and extensive scientific libraries make it an excellent choice for implementing clustering algorithms.

Features:

· Introduction to Python programming fundamentals

· Overview of key concepts in data clustering

· Implementation of popular clustering algorithms in Python

· Practical examples of applying clustering algorithms to datasets

· Access to associated Python code on GitHub

This book extends my previous work by implementing clustering algorithms in Python. Unlike the object-oriented approach in C++, this book uses a procedural programming style, as Python allows many clustering algorithms to be implemented concisely. The book is divided into two parts: the first introduces Python and key libraries like NumPy, Pandas, and Matplotlib, while the second covers clustering algorithms, including hierarchical and partitional methods. Each chapter includes theoretical explanations, Python implementations, and practical examples, with comparisons to scikit-learn where applicable.

This book is ideal for anyone interested in clustering algorithms, with no prior Python experience required. The complete source code is available at: https://github.com/ganml/dcpython.

More in Mathematical & Statistical Software

SPSS Statistics : 5th Edition - A Practical Guide - Kellie Bennett

RRP $104.95

$85.75

18%
OFF
Statistics Using Stata : 3rd Edition - An Integrative Approach - Sharon Lawner Weinberg
Understanding Statistics in Psychology with SPSS : 8th Edition - Dennis Howitt
Time Series : A Data Analysis Approach Using R - David S.  Stoffer

RRP $94.99

$85.75

10%
OFF
Time Series : A Data Analysis Approach Using R - David S.  Stoffer

RRP $145.00

$130.75

10%
OFF
Applied Statistics with Python : Volume II: Multivariate Models - Leon  Kaganovskiy
SPSS Basics : Techniques for a First Course in Statistics - Deborah Mikyo Oh
IBM SPSS for Intermediate Statistics : Use and Interpretation - George A.  Morgan
IBM SPSS for Intermediate Statistics : Use and Interpretation - George A.  Morgan
Statistics in Corpus Linguistics Research : A New Approach - Sean Wallis
Statistics in a Nutshell : In a Nutshell - Sarah Boslaugh

RRP $104.75

$51.75

51%
OFF