A curated collection of interesting GitHub repositories
View the Project on GitHub
LLM inference in C/C++ for running and serving models locally on various hardware
View on index