Meta execs obsessed over beating OpenAI's GPT-4 internally, court filings reveal

Date:

Share post:


Executives and researchers leading Meta’s AI efforts obsessed over beating OpenAI’s GPT-4 model while developing Llama 3, according to internal messages unsealed by a court on Tuesday in one of the company’s ongoing AI copyright cases, Kadrey v. Meta.

“Honestly… Our goal needs to be GPT-4,” said Meta’s VP of Generative AI, Ahmad Al-Dahle, in an October 2023 message to Meta researcher Hugo Touvron. “We have 64k GPUs coming! We need to learn how to build frontier and win this race.”

Though Meta releases open AI models, the company’s AI leaders were far more focused on beating competitors that don’t typically release their model’s weights, like Anthropic and OpenAI, and instead gate them behind an API. Meta’s execs and researchers held up Anthropic’s Claude and OpenAI’s GPT-4 as a gold standard to work toward.

The French AI startup Mistral, one of the biggest open competitors to Meta, was mentioned several times in the internal messages, but the tone was dismissive.

“Mistral is peanuts for us,” Al-Dahle said in a message. “We should be able to do better,” he said later.

Tech companies are racing to upstage each other with cutting-edge AI models these days, but these court filings reveal just how competitive Meta’s AI leaders truly were – and seemingly still are. At several points in the message exchanges, Meta’s AI leads talked about how they were “very aggressive” in obtaining the right data to train Llama; at one point, an exec even said that “Llama 3 is literally all I care about,” in a message to coworkers.

Prosecutors in this case allege that Meta’s executives occasionally cut corners in their mad race to shipping AI models, training on copyrighted books in the process.

Touvron noted in a message that the mix of datasets used for Llama 2 “was bad,” and talked about how Meta could use a better mix of data sources to improve Llama 3. Touvron and Al-Dahle then talked about clearing the path to use the LibGen dataset, which contains copyrighted works from Cengage Learning, Macmillan Learning, McGraw Hill, and Pearson Education.

“Do we have the right datasets in there[?]” said Al-Dahle. “Is there anything you wanted to use but couldn’t for some stupid reason?”

Meta CEO Mark Zuckerberg has previously said he’s trying to close the performance gap between Llama’s AI models and closed models from OpenAI, Google, and others. The internal messages reveal the intense pressure within the company to do so.



Source link

Alexandra Williams
Alexandra Williams
Alexandra Williams is a writer and editor. Angeles. She writes about politics, art, and culture for LinkDaddy News.

Recent posts

Related articles

Las Vegas woman allegedly drugged, fleeced older men she met on dating apps

A Las Vegas woman accused of luring men via dating sites, drugging them, and taking money was...

Shooting in south Tallahassee injures person, but 'nobody's talking' to police

A shooting in south Tallahassee has left one man with a bullet wound and police with few...

Author of Upcoming Elon Musk Biography Says ‘There Is No Evidence’ Billionaire Has Any ‘Intellectual Achievements’

Attorney, journalist, and Elon Musk biographer Seth Abramson eviscerated both Elon Musk and his “fanboys” who have...

Endangered bighorn sheep spotted in downtown Palm Springs. Here's what to do if they return

Things you may spot in downtown Palm Springs: Tourists and local residents strolling along Palm Canyon Drive,...

Trump fires chairman of the Joint Chiefs of Staff

WASHINGTON (AP) — President Donald Trump abruptly fired Air Force Gen. CQ Brown Jr. as chairman of...

Negotiations remain fragile as the future of Gaza is under debate | Elizabeth Vargas Reports

There are concerns that the second phase of the ceasefire agreement between Hamas and Israel may not...

Delta Passenger Who Survived Upside Down Crash Landing in Toronto Sues Airline: 'Drenched With Jet Fuel'

Eighty people aboard a Delta aircraft survived when their flight crash-landed at Toronto Pearson International Airport on...

Woman accused of drugging and robbing older men in a deadly romance scheme

LAS VEGAS (AP) — A woman used online dating apps to lure at least four older men...