A framework designed for efficient inference of text generation models.
Discovered on HuggingFace via HuggingFace:unknown