WED, DEC 06 2023-theGBJournal|First teased in May as its next-generation foundation model, Google on Wednesday launched its Gemini 1.0, and is making it available through Bard.
The Gemini is considered Google’s “most capable and general model,” which can “understand, operate across, and combine” text, code, audio, images, and video. Being “natively multimodal” allows for better understanding, reasoning, and coding capabilities.
Gemini 1.0 is available in three different sizes that span from data centers to phones: the Nano which it will use for specific tasks and mobile devices, Pro which scales across a wide range of tasks, and Ultra sizes, its most capable and largest model.
The current approach to creating multimodal models involves “training separate components for different modalities and then stitching them together.” While good at certain tasks, Google says these models “struggle with more conceptual and complex reasoning.”
In terms of performance, Google showed Gemini Ultra surpassing GPT-4 in text-based benchmarks that measure reasoning, math, and code.
The company is particularly touting how Gemini Ultra is the “first model to outperform human experts on MMLU (massive multitask language understanding)” at 90.0%.
That benchmark “uses a combination of 57 subjects such as math, physics, history, law, medicine, and ethics for testing both world knowledge and problem-solving abilities,” with OpenAI’s offering scoring 86.4%.
Google says Bard with Gemini Pro is rolling out today in English for 170 countries/territories, with UK and European availability “in the near future.” Initially, Gemini Pro will power text-based prompts, with support for “other modalities coming soon.”
Meanwhile, Gemini Ultra is coming early next year. Google is currently “completing extensive trust and safety checks,” as well as model refinements, before broader availability for developers and enterprise customers.
It will be available through a new “Bard Advanced” offering, which Google positions as providing early access to its most advanced models and capabilities, like Gemini Ultra.
Over the coming months, Gemini is coming to Google Search, Chrome, Duet AI, and Ads. Early testing has shown Gemini reducing SGE (Search Generative Experience) latency by 40%.
X-@theGBJournal|Facebook-the Government and Business Journal|email:gbj@govbusinessjournal.com| govandbusinessj@gmail.com