- Extract: Data is extracted from a SuperHero API by Yoann Cribier
- Transform: It will be transformed in Python from its JSON format into a form that fits the MySQL database
- Load: Then loaded into a MySQL database
- Before I execute the ETL, I need to take more time transforming the data. The data was transformed but I need to replace the redundant data that was loaded. There were entities that were empty values, that needed to be NULL, and names that were repeated in the "alias" and "name" column. This will take time away from analysis that needs to be made. Instead of making those analysis, data cleaning would have to be done before; which is what I have done in MySQL.