Home

Account

Wishlist

Cart

Learning Spark: Lightning-Fast Data Analysis (Paperback) | Released: 27 Feb 2015

Name: Learning Spark: Lightning-Fast Data Analysis
Brand: Bookswagon
Price: 2108 INR
Availability: OutOfStock
Rating: 4 (1 reviews)

By: Patrick Wendell (Author) , Andy Konwinski (Author) , Holden Karau (Author) , Andy Kowinski (Author) , Mark Hamstra (Author) | Publisher: O'Reilly Media | Publisher Imprint: O'Reilly Media

4 | 6 Reviews

Read 6 Reviews

₹2,108 ~~M.R.P. :~~₹3,100~~~~
Save: ₹992(32%)

Out of Stock

Data in all domains is getting bigger. How can you work with it efficiently? Recently updated for Spark 1.3 , this book introduces Apache Spark, the open source cluster computing system that ...Read more

There is a newer edition of this item:

Learning Spark: Lightning-Fast Data Analytics
₹4,340 ~~₹7,000~~
International Edition

Data is bigger, arrives faster, and comes in a variety of formatsâ and it all needs to be processed at scale for analytics or machine learning. But how can you process such varied workloads efficiently? Enter Apache Spark.

Updated to include Spark 3.0, this second edition shows data engineers and data scientists why structure and unification in Spark matters. Specifically, this book explains how to perform simple and complex data analytics and employ machine learning algorithms. Through step-by-step walk-throughs, code snippets, and notebooks, youâ ll be able to:

Learn Python, SQL, Scala, or Java high-level Structured APIs
Understand Spark operations and SQL Engine
Inspect, tune, and debug Spark operations with Spark configurations and Spark UI
Connect to data sources: JSON, Parquet, CSV, Avro, ORC, Hive, S3, or Kafka
Perform analytics on batch and streaming data using Structured Streaming
Build reliable data pipelines with open source Delta Lake and Spark
Develop machine learning pipelines with MLlib and productionize models using MLflow

ISBN-10

1449358624

ISBN-13

9781449358624

Page Number

276

Language

English

Imprint

O'Reilly Media

Weight (gr)

453

Dimention(mm)

226x13x183

See all details

Premium quality

Bookswagon upholds the quality by delivering untarnished books. Quality, services and satisfaction are everything for us!

Easy Return

Easy return

Not satisfied with this product! Keep it in original condition and packaging to avail easy return policy.

Certified product

First impression is the last impression! Address the book’s certification page, ISBN, publisher’s name, copyright page and print quality.

Secure Checkout

Secure checkout

Security at its finest! Login, browse, purchase and pay, every step is safe and secured.

Money back guarantee

Money-back guarantee:

It’s all about customers! For any kind of bad experience with the product, get your actual amount back after returning the product.

On time delivery

On-time delivery

At your doorstep on time! Get this book delivered without any delay.

Notify me when this book is in stock

Add to Wishlist

About the Book

Data in all domains is getting bigger. How can you work with it efficiently? Recently updated for Spark 1.3, this book introduces Apache Spark, the open source cluster computing system that makes data analytics fast to write and fast to run. With Spark, you can tackle big datasets quickly through simple APIs in Python, Java, and Scala. This edition includes new information on Spark SQL, Spark Streaming, setup, and Maven coordinates.

Written by the developers of Spark, this book will have data scientists and engineers up and running in no time. You'll learn how to express parallel jobs with just a few lines of code, and cover applications from simple batch jobs to stream processing and machine learning.

Quickly dive into Spark capabilities such as distributed datasets, in-memory caching, and the interactive shell
Leverage Spark's powerful built-in libraries, including Spark SQL, Spark Streaming, and MLlib
Use one programming paradigm instead of mixing and matching tools like Hive, Hadoop, Mahout, and Storm
Learn how to deploy interactive, batch, and streaming applications
Connect to data sources including HDFS, Hive, JSON, and S3
Master advanced topics like data partitioning and shared variables

About the Author:

Holden Karau is transgender Canadian, and anactive open source contributor. When not in San Francisco working as asoftware development engineer at IBM's Spark Technology Center, Holdentalks internationally on Spark and holds office hours at coffee shops athome and abroad. She makes frequent contributions to Spark, specializing inPySpark and Machine Learning. Prior to IBM she worked on a variety ofdistributed, search, and classification problems at Alpine, Databricks, Google, Foursquare, and Amazon. She graduated from the University ofWaterloo with a Bachelor of Mathematics in Computer Science. Outside ofsoftware she enjoys playing with fire, welding, scooters, poutine, anddancing.

Most recently, Andy Konwinski co-founded Databricks. Before that he was a PhD student and then postdoc in the AMPLab at UC Berkeley, focused on large scale distributed computing and cluster scheduling. He co-created and is a committer on the Apache Mesos project. He also worked with systems engineers and researchers at Google on the design of Omega, their next generation cluster scheduling system. More recently, he developed and led the AMP Camp Big Data Bootcamps and first Spark Summit, and has been contributing to the Spark project.

Patrick Wendell is an engineer at Databricks as well as a Spark Committer and PMC member. In the Spark project, Patrick has acted as release manager for several Spark releases, including Spark 1.0. Patrick also maintains several subsystems of Spark's core engine. Before helping start Databricks, Patrick obtained an M.S. in Computer Science at UC Berkeley. His research focused on low latency scheduling for large scale analytics workloads. He holds a B.S.E in Computer Science from Princeton University

Matei Zaharia is the creator of Apache Spark and CTO at Databricks. He holds a PhD from UC Berkeley, where he started Spark as a research project. He now serves as its Vice President at Apache. Apart from Spark, he has made research and open source contributions to other projects in the cluster computing area, including Apache Hadoop (where he is a committer) and Apache Mesos (which he also helped start at Berkeley).

Best Sellers

See All

26%

Quick View

No Review Yet

₹221 ~~₹299~~

12%

Quick View

Fruits Wonder House Books

4.5

(2)

₹175 ~~₹199~~

Quick View

The Satanic Verses Salman Rushdie

4.5

(6)

₹1,350

10%

Quick View

Satanic Verses Salman Rushdie

4.5

(6)

₹1,454 ~~₹1,615~~

36%

Quick View

My First Library: Boxset of 10 Board Books for Kids Wonder House Books Editorial

4.1

(8)

₹480 ~~₹750~~

27%

Quick View

The Psychology Of Money Morgan Housel

4.5

(8)

₹291 ~~₹399~~

Quick View

My First Book of Patterns Pencil Control: Patterns Practice book for kids (Pattern Writing) Wonder House Books Editorial

4.6

(8)

₹136

26%

Quick View

Indian Polity for UPSC (English)|7th Edition|Civil Services Exam| State Administrative Exams M Laxmikanth

No Review Yet

₹807 ~~₹1,090~~

30%

Quick View

World’s Greatest Books For Personal Growth & Wealth (Set of 4 Books) : Perfect Motivational Gift Set Dale Carnegie, Napoleon Hill, Dr. Joseph Murphy & George S. Clason

4.6

(15)

₹489 ~~₹699~~

26%

Quick View

A Court of Thorns and Roses Paperback Box Set (5 books) Sarah J. Maas

(7)

₹4,385 ~~₹5,925~~

34%

Quick View

Ikigai Francesc Miralles

(6)

₹395 ~~₹599~~

36%

Quick View

Atomic Habits James Clear

4.6

(5)

₹575 ~~₹899~~

39%

Quick View

Atomic Habits James Clear

No Review Yet

₹1,441 ~~₹2,363~~

Product Details

ISBN-13: 9781449358624
Publisher: O'Reilly Media
Publisher Imprint: O'Reilly Media
Depth: 19
Language: English
Returnable: Y
Spine Width: 13 mm
Weight: 453 gr

ISBN-10: 1449358624
Publisher Date: 27 Feb 2015
Binding: Paperback
Height: 226 mm
No of Pages: 276
Series Title: English
Sub Title: Lightning-Fast Data Analysis
Width: 183 mm

Related Categories

Similar Products

38%

Very poor	Poor	Neutral	Good	Great

Learning Spark: Lightning-Fast Data Analysis (Paperback) | Released: 27 Feb 2015

Premium quality

Easy return

Certified product

Secure checkout

Money-back guarantee:

On-time delivery

Best Sellers

Similar Products

How would you rate your experience shopping for books on Bookswagon?

Thank you for your rating!

Customer Reviews

Rating Snapshot

Average Customer Ratings

Active Filters

A Thrilling But Totally Believable Murder Mystery

A Thrilling But Totally Believable Murder Mystery

BoxerLover2

Learning Spark: Lightning-Fast Data Analysis

New Arrivals

Inspired by your browsing history