Jump to ratings and reviews
Rate this book

Stream Processing with Apache Spark: Best Practices for Scaling and Optimizing Apache Spark

Rate this book
Before you can build analytics tools to gain quick insights, you first need to know how to process data in real time. With this practical guide, developers familiar with Apache Spark will learn how to put this in-memory framework to use for streaming data. You'll discover how Spark enables you to write streaming jobs in almost the same way you write batch jobs.

Authors Gerard Maas and Fran�ois Garillot help you explore the theoretical underpinnings of Apache Spark. This comprehensive guide features two sections that compare and contrast the streaming APIs Spark now the original Spark Streaming library and the newer Structured Streaming API.


Learn fundamental stream processing concepts and examine different streaming architectures
Explore Structured Streaming through practical examples; learn different aspects of stream processing in detail
Create and operate streaming jobs and applications with Spark Streaming; integrate Spark Streaming with other Spark APIs
Learn advanced Spark Streaming techniques, including approximation algorithms and machine learning algorithms
Compare Apache Spark to other stream processing projects, including Apache Storm, Apache Flink, and Apache Kafka Streams

452 pages, Kindle Edition

Published June 5, 2019

16 people are currently reading
69 people want to read

About the author

Gerard Maas

5 books

Ratings & Reviews

What do you think?
Rate this book

Friends & Following

Create a free account to discover what your friends think of this book!

Community Reviews

5 stars
2 (11%)
4 stars
7 (38%)
3 stars
5 (27%)
2 stars
4 (22%)
1 star
0 (0%)
Displaying 1 - 3 of 3 reviews
Profile Image for Alex Ott.
Author 3 books209 followers
December 31, 2019
Very good description of Spark Streaming & Spark Structured Streaming, with many examples, and useful tips. Book is quite new, and covers latest developments (Spark 2.3, and mentions 2.4 in several places) in area of streaming data processing in Spark

I really wish I had this book ~4 years ago when we build our system on the top of the Spark Streaming. One star was taken because of the errors...
114 reviews1 follower
August 10, 2023
A complete manual on Apache Spark streaming components. It is detailed and contains a few practical examples. I didn't check the online resources but they seem a good supplement. I skipped part 3 as it covers the "legacy" streaming component.
Displaying 1 - 3 of 3 reviews

Can't find what you're looking for?

Get help and learn more about the design.