Compare language models I trained from scratch, 3.2M to 3.12 billion parameters.
Read the blog · GitHub · About
CPU inference, Q4_K_M GGUF for large models. First load can take 1-2 minutes.