It parses files from https://developer.imdb.com/non-commercial-datasets/ in C++ and saves it to a postgres Database
Inspired by https://github.com/Totto16/imdb-dataset / https://github.com/Totto16/imdb-dataset-parser/ / https://github.com/andreivinaga/imdb-dataset
This preseeds a postgresql database with the whole data to a docker image, that can be used easily.
Use the docker images on GitHub as base database image, e.g latest or also specific dates 20241126. Use it like a normal postgres:16-alpine
docker image, and you have the table imdb
, which already has the data.
The code under this repo is under MIT License, but the data from IMDb is under a Non-Commercial License, see here