Google launches two new open LLMs

Date:

Share post:


Barely a week after launching the latest iteration of its Gemini models, Google today announced the launch of Gemma, a new family of lightweight open-weight models. Starting with Gemma 2B and Gemma 7B, these new models were “inspired by Gemini” and are available for commercial and research usage.

Google did not provide us with a detailed paper on how these models perform against similar models from Meta and Mistral, for example, and only noted that they are “state-of-the-art.” The company did note that these are dense decoder-only models, though, which is the same architecture it used for its Gemini models (and its earlier PaLM models) and that we will see the benchmarks later today on Hugging Face’s leaderboard.

To get started with Gemma, developers can get access to ready-to-use Colab and Kaggle notebooks, as well as integrations with Hugging Face, MaxText and Nvidia’s NeMo. Once pre-trained and tuned, these models can then run everywhere.

While Google highlights that these are open models, it’s worth noting that they are not open-source. Indeed, in a press briefing ahead of today’s announcement, Google’s Janine Banks stressed the company’s commitment to open source but also noted that Google is very intentional about how it refers to the Gemma models.

“[Open models] has become pretty pervasive now in the industry,” Banks said. “And it often refers to open weights models, where there is wide access for developers and researchers to customize and fine-tune models but, at the same time, the terms of use — things like redistribution, as well as ownership of those variants that are developed — vary based on the model’s own specific terms of use. And so we see some difference between what we would traditionally refer to as open source and we decided that it made the most sense to refer to our Gemma models as open models.”

That means developers can use the model for inferencing and fine-tune them at will and Google’s team argues that even though these model sizes are a good fit for a lot of use cases.

“The generation quality has gone significantly up in the last year,” Google DeepMind product management director Tris Warkentin said. “things that previously would have been the remit of extremely large models are now possible with state-of-the-art smaller models. This unlocks completely new ways of developing AI applications that we’re pretty excited about, including being able to run inference and do tuning on your local developer desktop or laptop with your RTX GPU or on a single host in GCP with Cloud TPUs, as well.”

That is true of the open models from Google’s competitors in this space as well, so we’ll have to see how the Gemma models perform in real-world scenarios.

In addition to the new models, Google is also releasing a new responsible generative AI toolkit to provide “guidance and essential tools for creating safer AI applications with Gemma,” as well as a debugging tool.



Source link

Lisa Holden
Lisa Holden
Lisa Holden is a news writer for LinkDaddy News. She writes health, sport, tech, and more. Some of her favorite topics include the latest trends in fitness and wellness, the best ways to use technology to improve your life, and the latest developments in medical research.

Recent posts

Related articles

AI chip startup DEEPX secures $80M Series C at a $529M valuation 

DEEPX is a South Korean on-device AI chip (NPU, or neural processing unit) startup that makes hardware...

Infighting among fintech players has caused TabaPay to ‘pull out’ from buying bankrupt Synapse

TabaPay has abandoned its plans to purchase the assets of troubled banking-as-a-service startup Synapse, TabaPay confirmed to...

Google built some of the first social apps for Android, including Twitter and others

Here’s a tidbit of startup history that may not be widely known outside of the tech firms...

Plinky is an app for you to collect and organize links easily

The internet is full of cool websites, and some of them are so interesting and useful, it’s...

WhatsApp’s latest update streamlines navigation and adds a ‘darker dark mode’

WhatsApp is updating its mobile apps for a fresh and more streamlined look, while also introducing a...

Google I/O 2024: How to watch

Google I/O kicks off on Tuesday with a 10 a.m. PT keynote. As ever, the presentation will...

Triomics raises $15M Series A to automate cancer clinical trials matching

For cancer patients, medicines administered in clinical trials can help save or extend lives. But despite thousands of...

Tesla drives Luminar lidar sales and Motional pauses robotaxi plans

Welcome back to TechCrunch Mobility — your central hub for news and insights on the future of...