The latest version of xAI's Grok can process images
Briefly

Grok-1.5V is the first-generation multimodal AI model by xAI, processing text, diagrams, charts, screenshots, and photographs for tasks like translating a flow chart into Python or explaining a meme.
xAI introduced Grok-1.5V after Grok-1.5, enhancing coding and math capabilities and ability to understand longer contexts from various sources for improved inquiries.
xAI's RealWorldQA dataset accompanies Grok-1.5V, with 700 images for evaluating AI models, showing xAI's technology outperformed OpenAI's GPT-4V and Google Gemini Pro 1.5 in tests.
Read at Engadget
[
|
]