## xAI’s Grok Gains Sight: New Vision Feature Brings Real-World Understanding
xAI’s Grok chatbot is stepping up its game with the addition of a real-time vision feature, putting it in direct competition with Google’s Gemini and ChatGPT. Announced Tuesday, Grok Vision allows users to leverage their smartphone camera to ask questions about the world around them.
The new capability, currently available on the iOS version of the Grok app, lets users point their phone at objects like products, signs, and documents, enabling Grok to analyze the image and provide context or answer questions. Imagine pointing your phone at a foreign street sign and asking Grok for a translation, or scanning a product label to learn more about its ingredients.
This feature allows Grok to understand the context of the users’ question by understanding the real-world objects around them. It mirrors functionality already present in rival chatbots, signalling that real-time vision is quickly becoming an expected feature in the competitive AI landscape.
Beyond Vision, xAI also announced the launch of multilingual audio and real-time search within Grok’s voice mode. Android users can access these features, provided they are subscribed to the $30 per month SuperGrok plan. This expansion provides a huge boost for Grok with multilingual audio supporting Spanish, French, Turkish, Japanese, and Hindi.
The rapid development of Grok has seen a steady stream of new features. Earlier this month, xAI integrated a “memory” component, allowing the chatbot to draw upon details from previous conversations. Grok also recently gained a canvas-like tool for document and app creation. These additions position Grok as an increasingly versatile and powerful AI assistant.