This is the repository that contains my bachelor thesis for the Artificial Intelligence program of the University of Amsterdam. The thesis lead to this paper. The code may be reproduced by referring to the paper. Questions and/or comments may be sent to my email.
This folder contains the data sets obtained from the UCI machine learning repository, separated using two different folders. The cleaning and preprocessing scripts have to be executed in order to use the data for the baseline/anonymisation/synthetic data.
The benchmark folder contains the Python scripts for three different machine learning models and one script (run_results.py) that combines these three models to obtain results.
The folders for Anonymisation and Synthetic data have separate readme files with introduction to the code.
The gitignore file is set up to ignore processed data sets and results to keep the repository small in size. It also ignores .ipynb files and the checkpoints of these as these were used for development purposes only.