This repository provides ressources used by Alliage Academy.
It is intended to become an Ansible collection to interface more easily with the future releases of TDP.
The following datasets are used across the various labs from Alliage Academy. Each script download the data into the HDFS file system.
PySpark program to test the gain in speed of query execution on an Iceberg table.
A notebook to visualize this gain, on the docker-compose image proposed by Apache Iceberg.