Meta unveils biggest Llama 3 AI model, touting language and math gains

By Katie Paul

NEW YORK (Reuters) – Meta Platforms released the biggest version of its mostly free Llama 3 artificial intelligence models on Tuesday, boasting multilingual skills and general performance metrics that nip at the heels of paid models from rivals like OpenAI.

The new Llama 3 model can converse in eight languages, write higher-quality computer code and solve more complex math problems than previous versions, the Facebook parent company said in blog posts and a research paper announcing the release.

Its 405 billion parameters, or variables that the algorithm takes into account to generate responses to user queries, dwarfs the previous version released last year though is still smaller than leading models offered by competitors.

OpenAI’s GPT-4 model, by contrast, is reported to have one trillion parameters and Amazon is investing in a model with 2 trillion parameters.

The release comes as tech companies are racing to show that their growing portfolios of resource-hungry large language models can deliver significant enough gains in known problem areas such as advanced reasoning to justify the gargantuan sums that have been invested in them.

In addition to its flagship 405 billion parameter model, Meta is also releasing updated versions of its lighter-weight 8 billion and 70 billion parameter Llama 3 models initially introduced in the spring, the company said.

All three new models are multilingual and can handle larger user requests via an expanded “context window,” which Meta’s head of generative AI, Ahmad Al-Dahle, said would improve the experience of generating computer code in particular.

“That was the number one feedback we got from the community,” Al-Dahle told Reuters in an interview, noting that bigger context windows give the models something akin to a longer memory that aids in processing multi-step requests.

Meta releases its Llama models largely free-of-charge for use by developers, a strategy Chief Executive Mark Zuckerberg says will pay off in the form of innovative products and greater engagement on the company’s core social networks. Some investors have raised their eyebrows at the costs entailed, however.

The company also stands to gain if developers opt to use its free models over paid ones, which would undercut the business models of its rivals. With its announcement, Meta touted gains on key math and knowledge tests that may make that prospect more appealing.

Although progress on AI development is notoriously difficult to measure, test results provided by Meta appeared to suggest that its largest Llama 3 model was nearly matching and in some cases besting Anthropic’s Claude 3.5 Sonnet and OpenAI’s GPT-4o, which are widely regarded as the two most powerful frontier models on the market.

On the MATH benchmark of competition level math word problems, for example, Meta’s model posted a score of 73.8, compared to GPT-4o’s 76.6 and Claude 3.5 Sonnet’s 71.1.

The model scored 88.6 on MMLU, a benchmark that covers dozens of subjects across math, science and the humanities, while GPT-4o scored 88.7 and Claude 3.5 Sonnet scored 88.3.

In their paper, Meta researchers also teased upcoming “multimodal” versions of the models due out later this year that layer image, video and speech capabilities on top of the core Llama 3 text model.

Early experiments indicate those models can perform “competitively” with other multimodal models such as Google’s Gemini 1.5 and Anthropic’s Claude 3.5 Sonnet, they said.

(Reporting by Katie Paul, Editing by Louise Heavens)

Source link

Meta unveils biggest Llama 3 AI model, touting language and math gains

Recent posts

Why Intel Stock Popped on Friday

Nearly half of dementia cases could be avoided or delayed by tackling 14 risk factors

20 Funny Pet Tweets That Prove They're Saints Until Their Owners Mess Something Up

US arrests of Mexican drug lords could bring fresh charges in home country

Susan Smith furious at parole denial, 30 years after killing kids: Insider | Banfield

Rivian Just Forecast This Important Metric Will Turn Positive. Is Now the Time to Buy the Stock?

Nurses Are Confessing Their Most Closely Guarded Secrets That Patients Don't Know

50 "Before And After" Photos That Will Completely And Totally Change Your Perspective On The World

CNN commentator Scott Jennings joins Los Angeles Times editorial board after owner calls for more conservative voices

Scientists hopeful after sighting of rare whale not seen in over a century: 'Such an unbelievable occurrence'

50 Pictures That Prove The American Education System Is 100% Totally And Completely Doomed

Japanese researchers test pioneering drug to regrow teeth

'1 in 100 million': Watch as beautiful, rare, cotton candy lobster explores new home

Man gets 226-year prison sentences for killing 2 Alaska Native women. He filmed the torture of one

There’s finally a retro PC emulator on the App Store

Related articles

I'm A Colorectal Cancer Doctor — Here Are 5 Things I'd Never, Ever Do

New York's governor orders firing of prison staffers involved in inmate's fatal beating

Ten Palestinians killed in airstrikes on houses in central Gaza, medics say

AP Top Stories December 21 P

What your peeing frequency can say about your health

Ex-OpenAI engineer who raised legal concerns about the technology he helped build has died

Thousands Attend 'Liberation Festival' in Post-Assad Aleppo

Christmas market becomes scene of horror after fatal gap left in security bollards

Company

Follow us