Musk’s xAI reveals Grok 1.5 Vision, claims top spatial understanding
By subscribing, you agree to our Terms of Use and Policies You may unsubscribe at any time.
Elon Musk’s artificial intelligence (AI) company, xAI, has unveiled its first multimodal model, Grok 1.5 Vision, as it looks to compete with OpenAI.
As per the preview, in addition to understanding text, the AI model can also work with documents, charts, diagrams, screenshots, and photos.
One of OpenAI’s funders, Musk advocates that AI can help humanity in unimaginable ways. However, after falling out with the vision of how OpenAI should proceed, Musk started xAI last year with a group of influential AI researchers keen on developing AI models more openly.
Featured Video RelatedLast November, the company rolled out the first iteration of its AI model, Grok. Further, it emphasized its push for openness by making its base model weights and network architecture open-sourced last month. The pace at which the company is working is evident, and its first multimodal AI model arrived barely a month after its architecture was made open-source.
What can Grok 1.5V do?
According to its website, the Grok 1.5V connects the physical and digital worlds. The company has highlighted seven examples of its capabilities to explain how the multimodal model works.
A user can share a picture of a flowchart with Grok, and the AI model can translate it into Python code. By simply showing the model a nutrition label, a user can inquire how many calories one would consume by consuming certain portions of the product.
While this might seem like an easy case of multiplication, the AI model can also take a child’s drawing and build an entire bedtime story using it. The model can do the converse, too. Show it a meme, and it will explain why it is funny and provide the context needed to understand it.
The AI model isn’t just built for play. It can convert a table into CSV format or help you correct a piece of code that might not be working. But if you need home repair advice, just share images of the affected area, and the model is designed to help you with that as well, the company lists on its website.
xAI has also released a new benchmark dubbed RealWorldQA to evaluate the spatial understanding shown by multimodal models. From examples shared by the company, Grok 1.5V can look at images and differentiate between objects that are comparatively bigger or give driving advice as well.
Grok 1.5V also handsomely beats other AI models on this benchmark as well as others, according to the company’s data shared in this chart.
What’s in store for the future?
With Elon Musk stating in a recent interview that he expects AI to be smarter than any human by the end of 2025, all eyes are on what improvements his company will bring to the AI race in the upcoming months.
xAI has said that in its aim to build beneficial artificial general intelligence (AGI) that can understand the universe, the company will make significant improvements to the capabilities of its models in other areas, such as audio, voice, and video, in the coming months.
Grok 1.5V will soon become available for the company’s testers and existing users, the company added in its blog.
(责任编辑:新闻中心)
- Scientists detect water sloshing on Mars. There could be a lot.
- Barty wary of US Open return
- Very good husband Prince Harry flattens Meghan Markle's hair in the wind
- North Korean state media airs footage of military parade
- Ford can make your Mustang Mach
- New Grok response directs users to Vote.gov for election questions
- South Korea asks North Korea to explain fire at Kaesong industrial complex: ministry
- Amazon has quietly introduced a cool delivery feature
- The Wednesday Slatest newsletter.
- Naver, Kakao strive to combat deepfake porn spreading online
- 带领乡亲订报读报,发展产业富民兴村
- “飞机坝”地名的由来
- Inauguration singer and her trans sister would like to talk to President Trump
-
21 College and University Museums
University campuses have no shortage of knowledge—from eclectic libraries filled to the brim with ra ...[详细] -
The Georgian port city of Poti wants to be a tourist destination.
Each week, Roads & Kingdoms and Slate publish a new dispatch from around the globe. For more foreign ...[详细] -
Jony Ive based the design of Apple AirPods on Star Wars Stormtroopers
Your Apple AirPods have a darker origin than you might guess.Jony Ive, the design mastermind behind ...[详细] -
Your phone could charge in seconds with this magic battery
It typically takes around two hours to charge your iPhone from dead to fully juiced up. But what if ...[详细] -
The vast majority of our portable electronic gadgets, and the new wave of electric transportation, a ...[详细]
-
Brie Larson brought Emma Stone to tears at the Oscars, but in a good way
We wish we could be in that hug.The 2017 Academy Awards were filled with emotional moments from Viol ...[详细] -
GE designs massive floating turbine to take wind energy into deep water
While offshore wind farms continue to play a growing part in the renewable energy mix, particularly ...[详细] -
Hackers stole $85 million in Ether to save it from *the real crooks
*The clock was ticking. Thieves stole $32 million worth of ether out of a popular Ethereum wallet, a ...[详细] -
PCB official under probe for conflict of interest
ListentoarticleThe Pakistan Cricket Board (PCB) has launched an investigation into a senior official ...[详细] -
S. Korea, US decide to postpone upcoming joint air exercises for diplomacy
South Korea and the United States decided to put off their wintertime combined air exercises to supp ...[详细]
A Journey Into the Mind of Stephen King
This is how Mark Zuckerberg's Oculus VR gloves actually work
- Students get free entry at second Rawalpindi Test but what’s the catch?
- Kind RA fulfills student's birthday dream of hearing a bedtime story
- The Wednesday Slatest newsletter.
- The 10 alien species we'd most like to invade Earth right now
- Students get free entry at second Rawalpindi Test but what’s the catch?
- South Korea asks North Korea to explain fire at Kaesong industrial complex: ministry
- UN aviation agency chief voices concern over North Korea's unannounced missile launches