A curated collection of interesting GitHub repositories
View the Project on GitHub
Minimal vLLM engine for fast offline LLM inference with a clean Python codebase
View on index