Skip to content

O'Reilly book: Data Algorithms with Spark

Mahmoud Parsian edited this page Aug 1, 2022 · 1 revision

The goal of Data Algorithms with Spark book:

  • provide simple examples so that developers can use these examples (as cut-paste-modify) in production environments
  • express working data algorithms with PySpark transformations

"... This book will be a great resource for
both readers looking to implement existing
algorithms in a scalable fashion and readers
who are developing new, custom algorithms
using Spark. ..."

Dr. Matei Zaharia
Original Creator of Apache Spark

FOREWORD by Dr. Matei Zaharia

Book Chapters

The book has


Chapter Solutions in PySpark and Scala

Solutions for chapters are provided in Python (PySpark) and Scala (please note that Scala solutions are in progress...)


Data Algorithms with Spark