A curated collection of interesting GitHub repositories
View the Project on GitHub tom-doerr/repo_posts
Minimal vLLM engine for fast offline LLM inference with a clean Python codebase