GPT-2 Neel Nanda's implementation from scratch Here's my version of GPT-2 following Neel Nanda's tutorial. WIP