A pipeline for converting images to text and then processing that text further.
Discovered on HuggingFace via HuggingFace:unknown