GPUburnout Models

Compare language models I trained from scratch, 3.2M to 3.12 billion parameters.

Read the blog · GitHub · About

CPU inference, Q4_K_M GGUF for large models. First load can take 1-2 minutes.

Select Model
20 150
0.1 1.5
1 100
Example prompts