A pipeline for converting images into textual representations, integrating multiple models.
Discovered on HuggingFace via HuggingFace:unknown