Learning Spark

Learning Spark

Data is bigger, arrives faster, and comes in a variety of formats—and it all needs to be processed at scale for analytics or machine learning. But how can you process such varied workloads efficiently? Enter Apache Spark. Updated to include Spark 3.0, this second edition shows data engineers and data scientists why structure and unification in Spark matters. Specifically, this book explains how to perform simple and complex data analytics and employ machine learning algorithms. Through step-by-step walk-throughs, code snippets, and notebooks, you’ll be able to: Learn Python, SQL, Scala, or Java high-level Structured APIs Understand Spark operations and SQL Engine Inspect, tune, and debug Spark operations with Spark configurations and Spark UI Connect to data sources: JSON, Parquet, CSV, Avro, ORC, Hive, S3, or Kafka Perform analytics on batch and streaming data using Structured Streaming Build reliable data pipelines with open source Delta Lake and Spark Develop machine learning pipelines with MLlib and productionize models using MLflow


Author
Publisher "O'Reilly Media, Inc."
Release Date
ISBN 1492049999
Pages 400 pages
Rating 4/5 (99 users)

More Books:

Learning Spark
Language: en
Pages: 400
Authors: Jules S. Damji
Categories: Computers
Type: BOOK - Published: 2020-07-16 - Publisher: "O'Reilly Media, Inc."

Data is bigger, arrives faster, and comes in a variety of formats—and it all needs to be processed at scale for analytics or machine learning. But how can you
Learning Spark
Language: en
Pages: 276
Authors: Holden Karau
Categories: Computers
Type: BOOK - Published: 2015-01-28 - Publisher: "O'Reilly Media, Inc."

This book introduces Apache Spark, the open source cluster computing system that makes data analytics fast to write and fast to run. You'll learn how to express
Learning Spark SQL
Language: en
Pages: 452
Authors: Aurobindo Sarkar
Categories: Computers
Type: BOOK - Published: 2017-09-07 - Publisher: Packt Publishing Ltd

Design, implement, and deliver successful streaming applications, machine learning pipelines and graph applications using Spark SQL API About This Book Learn ab
Apache Spark 2.x Machine Learning Cookbook
Language: en
Pages: 666
Authors: Siamak Amirghodsi
Categories: Computers
Type: BOOK - Published: 2017-09-22 - Publisher: Packt Publishing Ltd

Simplify machine learning model implementations with Spark About This Book Solve the day-to-day problems of data science with Spark This unique cookbook consist
Hands-On Deep Learning with Apache Spark
Language: en
Pages: 322
Authors: Guglielmo Iozzia
Categories: Computers
Type: BOOK - Published: 2019-01-31 - Publisher: Packt Publishing Ltd

Deep Learning is a subset of Machine Learning where data sets with several layers of complexity can be processed. This book teaches you the different techniques
Apache Spark Deep Learning Cookbook
Language: en
Pages: 474
Authors: Ahmed Sherif
Categories: Computers
Type: BOOK - Published: 2018-07-13 - Publisher: Packt Publishing Ltd

A solution-based guide to put your deep learning models into production with the power of Apache Spark Key Features Discover practical recipes for distributed d
Apache Spark Machine Learning Blueprints
Language: en
Pages: 252
Authors: Alex Liu
Categories: Computers
Type: BOOK - Published: 2016-05-30 - Publisher: Packt Publishing Ltd

Develop a range of cutting-edge machine learning projects with Apache Spark using this actionable guide About This Book Customize Apache Spark and R to fit your
Machine Learning with Apache Spark Quick Start Guide
Language: en
Pages: 240
Authors: Jillur Quddus
Categories: Computers
Type: BOOK - Published: 2018-12-26 - Publisher: Packt Publishing Ltd

Combine advanced analytics including Machine Learning, Deep Learning Neural Networks and Natural Language Processing with modern scalable technologies including
Practical Apache Spark
Language: en
Pages: 280
Authors: Subhashini Chellappan
Categories: Computers
Type: BOOK - Published: 2018-12-12 - Publisher: Apress

Work with Apache Spark using Scala to deploy and set up single-node, multi-node, and high-availability clusters. This book discusses various components of Spark
Apache Spark 2.x for Java Developers
Language: en
Pages: 350
Authors: Sourav Gulati
Categories: Computers
Type: BOOK - Published: 2017-07-26 - Publisher: Packt Publishing Ltd

Unleash the data processing and analytics capability of Apache Spark with the language of choice: Java About This Book Perform big data processing with Spark—