Tool that converts images and text into generated text outputs.
Discovered on HuggingFace via HuggingFace:unknown