-
Notifications
You must be signed in to change notification settings - Fork 90
O'Reilly book: Data Algorithms with Spark
Mahmoud Parsian edited this page Aug 1, 2022
·
1 revision
Welcome to Data Algorithms with Spark wiki!
The goal of Data Algorithms with Spark book:
- provide simple examples so that developers can use these examples (as cut-paste-modify) in production environments
- express working data algorithms with PySpark transformations
"... This book will be a great resource for both readers looking to implement existing algorithms in a scalable fashion and readers who are developing new, custom algorithms using Spark. ..." Dr. Matei Zaharia Original Creator of Apache Spark FOREWORD by Dr. Matei Zaharia |
The book has
- 12 chapters
- bonus chapters (such as DataFrame tutorials, word count, TF-IDF, anagrams, ...)
Solutions for chapters are provided in Python (PySpark) and Scala (please note that Scala solutions are in progress...)