A pipeline that converts images to text using advanced reasoning models.
Discovered on HuggingFace via HuggingFace:unknown