Join Forward-thinking Leaders
Elevate your expertise with tech insights, startup breakthroughs, and leadership intelligence curated for your priorities.
Elevate your expertise with tech insights, startup breakthroughs, and leadership intelligence curated for your priorities.
Subscribe to our newsletter!
Elon Musk’s xAI company has announced the launch of a new feature for its Grok AI Model called Vision. This feature will assist the chatbot in scanning real-world objects, text, and environments through the user’s device camera that too, in real time.
Summary:
1. Elon Musk’s xAI chatbot Grok has launched a new feature called Vision, which will allow users to scan real-world objects, text, and environments through their device camera.
2. The Vision also offers multilingual audio and real-time search in voice mode, allowing Grok to interact with users in languages like Hindi, Spanish, and Japanese.
3. Grok also has a memory feature that allows it to recall its past interactions, allowing for more personalized and context-aware responses.
The update is available free on Android and iOS, and the Grok Vision offers operations similar to the existing features of OpenAI’s ChatGPT and Google Gemini AI.
According to xAI, the Grok Vision will allow users to scan anything in real time, be it an object, symbol, text, doc, etc., just via the camera and asking a few sets of simple questions.
For example, if you’re showing Grok an office gate to extract some information, it will respond with relevant answers in real time based on the motion of the camera. Sounds impressive, right? That’s what makes it stand out with the updated rollout feature.
Simultaneously Grok Vision has officially announced this update on X (formerly Twitter), with a statement- “I’m all ears and eyes, share your world with me”.
Additionally, Grok Vision has also launched two more interesting features: Multilingual audio and real-time search in voice mode. The multilingual features allow the AI Model to interact with users in their respective languages, which means the AI model will respond in a set of several languages such as including Hindi, Spanish, and Japanese. While the voice search tool allows users to convey their questions verbally, the Grok AI model will extract the information and provide it to the user.
As of now these features are available to paid subscribers of the SuperGrok plan with a price of $30 per month. It is yet to confirm whether the features would be available for the unpaid subscribers also.
In related news, the Grok AI model has added a memory feature that allows it to recall its past interactions, allowing for more personalized and context-aware responses. According to sources, this functionality improves Grok’s ability to personalize its responses based on a more accurate understanding of specific user preferences that have evolved.
Although this is a huge step forward for Grok, the concept is far from original in the AI space. ChatGPT has long had a comparable memory feature, which was recently updated to allow users to access their entire chat history. Similarly, Google’s Gemini, which is also an AI mode, takes advantage of memory retention to provide personalized responses based on individual interactions.
Similarly, the vision feature is nothing novel. Major competitors, such as Open AI, Google, Gemini AI model, and Apple, released comparable features for their AI models days before Grok. However, the good news is that Grok AI is always evolving and working to cover most of the holes. For example, Telegram collaborated with Grok to increase reach. So, in addition to the Grok app and X (previously Twitter), users will now be able to communicate with the AI model using Telegram. Overall, it’s interesting to observe how xAI’s Grok develops and the changes to come.