A tool for running inference on GGUF formatted models for efficient text-generation results.
Discovered on HuggingFace via HuggingFace:unknown