Note: This is a base model, so you do raw text completion with it.
Model arch is customized GPT2 small (124M) trained to reproduce the speedrun, see also this.
Model repo is here.