
At a Glance
ePUB
eBook
$5.05
or 4 interest-free payments of $1.26 with
Instant Digital Delivery to your Kobo Reader App
Industries are using Hadoop extensively to analyze their data sets. The reason is that Hadoop framework is based on a simple programming model (MapReduce) and it enables a computing solution that is scalable, flexible, fault-tolerant and cost effective. Here, the main concern is to maintain speed in processing large datasets in terms of waiting time between queries and waiting time to run the program.
Spark was introduced by Apache Software Foundation for speeding up the Hadoop computational computing software process.
As against a common belief, Spark is not a modified version of Hadoop and is not, really, dependent on Hadoop because it has its own cluster management. Hadoop is just one of the ways to implement Spark.
Spark uses Hadoop in two ways - one is storage and second is processing. Since Spark has its own cluster management computation, it uses Hadoop for storage purpose only.
on
ISBN: 1230003598597
Published: 28th February 2020
Format: ePUB
Language: English
Publisher: Hoang Tran
























