turboderp-org/exllamav2
run local LLMs fast on consumer GPUs with flexible quantization

View on index · View in 3D Map
// SURVEILLANCE FEED
Discovered repositories from the open source frontier
run local LLMs fast on consumer GPUs with flexible quantization

View on index · View in 3D Map