Tech giant Google has officially rolled out Gemini, its latest artificial intelligence model that it claims has surpassed OpenAI’s GPT-4.
On Dec. 6, Google CEO Sundar Pichai and Google DeepMind CEO and co-founder Demis Hassabis announced the launch of Gemini in a company blog post
The AI model has been optimized for different sizes and use cases (Ultra, Pro, Nano) and built to be multimodal to understand and combine different types of information.
The model is also advanced in math and specialized coding, as compared to OpenAI’s GPT-4, which cannot perform math.
Meanwhile, Google claims its Ultra version achieves “state-of-the-art performance” across 30 out of 32 academic benchmarks used in LLM (large language model) development.
Furthermore, it scores 90% on a massive multitask language understanding (MMLU) test, surpassing human expert performance, according to Google.
Google’s chief scientist, Jeff Dean, said Gemini Ultra is the first model “to achieve human-expert performance on MMLU across 57 subjects with a score above 90%.”
I’m very excited to share our work on Gemini today! Gemini is a family of multimodal models that demonstrate really strong capabilities across the image, audio, video, and text domains. Our most-capable model, Gemini Ultra, advances the state of the art in 30 of 32 benchmarks,… pic.twitter.com/sQfxBy9tpT
— Jeff Dean (@) (@JeffDean) December 6, 2023
The system has also been designed from the ground up to seamlessly reason across text, images, audio, video, which puts it a step ahead of its competitors.
“We designed Gemini to be multimodal from the start,” said Dean before adding, “rather than starting with a purely text model and then grafting on vision and audio encoders after the fact.”
Gemini also has advanced programming skills, including generating high-quality code using AlphaCode 2, an advanced code generation system. It can also solve complex programming problems and collaborate with developers.
According to AI expert Rowan Cheung, Gemini Pro outperformed GPT-3.5 in six out of eight benchmarks, “making it the most powerful free chatbot on the market today.”
For those wanting to take the new AI model for a spin, a fine-tuned version of Gemini Pro has already been rolled out to Google’s version of ChatGPT — known as Bard — according to Google.
“This is the biggest upgrade to Bard since it launched. It will be available in English in more than 170 countries and territories, and we plan to expand to different modalities and support new languages and locations in the near future,” said the company.
Gemini is also rolling out on Google’s flagship phone, the Pixel 8 Pro.
“Pixel 8 Pro is the first smartphone engineered to run Gemini Nano, which is powering new features like Summarize in the Recorder app and rolling out in Smart Reply in Gboard, starting with WhatsApp — with more messaging apps coming next year,” it said.
It will be deployed across more Google products and services like Search, Ads, and Chrome “in the coming months,” it added.
The tech giant has also started experimenting with Gemini to power its web-dominant search engine to make searching a generative experience.
Google unveiled Gemini earlier this year touting its capabilities and claiming that it would be more powerful than ChatGPT.