Get Free Shipping on orders over $89
Big Data Glossary : A Guide to the New Generation of Data Tools - Pete Warden

Big Data Glossary

A Guide to the New Generation of Data Tools

By: Pete Warden

Paperback | 7 October 2011

At a Glance

Paperback


RRP $38.00

$21.75

43%OFF

or 4 interest-free payments of $5.44 with

Ships in 15 to 25 business days

There's been a massive amount of innovation in data tools over the last few years, thanks to a few key trends: * *Learning from the web*. Techniques originally developed by website developers coping with scaling issues are increasingly being applied to other domains. * *CS+?=$$$*. Google have proven that research techniques from computer science can be effective at solving problems and creating value in many real-world situations. That's led to increased interest in cross-pollination and investment in academic research from commercial organizations. * *Cheap hardware*. Now that machines with a decent amount of processing power can be hired for just a few cents an hour, many more people can afford to do large-scale data processing. They can't afford the traditional high prices of professional data software though, so they've turned to open-source alternatives. These trends have led to a Cambrian Explosion of new tools, which means when you're planning a new data project you have a lot to choose from. This guide aims to help you make those choices by describing each tool from the perspective of a developer looking to use them in an application. Wherever possible, this will be from my first-hand experiences, or from colleagues who have used the systems in production environments. I've made a deliberate choice to include my own opinions and impressions, so you should see this guide as a starting point for exploring the tools, not the final word. I'll do my best to explain what I like about each service but your tastes and requirements may well be quite different. Since the goal is to help experienced engineers navigate the new data landscape, the guide only covers tools that have been created or risen to prominence in the last few years. For example, PostGres is not covered because it's been widely used for over a decade, but its Greenplum derivative is newer and less well-known, so it is included.

More in 3D Graphics & Modelling

Microsoft Power BI For Dummies : For Dummies (Computer/Tech) - Jack A. Hyman
Python All-in-One For Dummies : 3rd Edition - Alan Simpson

RRP $74.95

$49.99

33%
OFF
Data-driven BIM for Energy Efficient Building Design : 1st Edition - Saeed Banihashemi
Building a Scalable Data Warehouse with Data Vault 2.0 - Dan Linstedt
Spark : The Definitive Guide : Big Data Processing Made Simple - Bill Chambers
Fundamentals of Data Engineering : Plan and Build Robust Data Systems - Joe Reis
Think Stats : Exploratory Data Analysis - Allen Downey

RRP $66.50

$36.75

45%
OFF
Statistics and Data Foundations for AI - Geetha Murthy

RRP $94.99

$85.75

10%
OFF
Statistics and Data Foundations for AI - Geetha Murthy

RRP $175.00

$155.75

11%
OFF