Nvidia’s CEO defends his moat as AI labs change how they improve their AI models

Nvidia raked in more than $19 billion in net income during the last quarter, the company reported on Wednesday, but that did little to assure investors that its rapid growth would continue. On its earnings call, analysts prodded CEO Jensen Huang about how Nvidia would fare if tech companies start using new methods to improve their AI models.

The method that underpins OpenAI’s o1 model, or “test-time scaling,” came up quite a lot. It’s the idea that AI models will give better answers if you give them more time and computing power to “think” through questions. Specifically, it adds more compute to the AI inference phase, which is everything that happens after a user hits enter on their prompt.

Nvidia’s CEO was asked whether he was seeing AI model developers shift over to these new methods, and how Nvidia’s older chips would work for AI inference.

Huang told investors that o1, and test-time scaling more broadly, could play a larger role in Nvidia’s business moving forward, calling it “one of the most exciting developments” and “a new scaling law.” Huang did his best to ensure investors that Nvidia is well-positioned for the change.

The Nvidia CEO’s remarks aligned with what Microsoft CEO Satya Nadella said onstage at a Microsoft event on Tuesday: o1 represents a new way for the AI industry to improve its models.

This is a big deal for the chip industry because it places a greater emphasis on AI inference. While Nvidia’s chips are the gold standard for training AI models, there’s a broad set of well-funded startups creating lightning-fast AI inference chips, such as Groq and Cerebras. It could be a more competitive space for Nvidia to operate in.

Despite recent reports that improvements in generative models are slowing, Huang told analysts that AI model developers are still improving their models by adding more compute and data during the pretraining phase.

Anthropic CEO Dario Amodei also said on Wednesday during an onstage interview at the Cerebral Valley summit in San Francisco that he is not seeing a slowdown in model development.

“Foundation model pretraining scaling is intact and it’s continuing,” said Huang on Wednesday. “As you know, this is an empirical law, not a fundamental physical law, but the evidence is that it continues to scale. What we’re learning, however, is that it’s not enough,” said Huang.

That’s certainly what Nvidia investors wanted to hear, since the chipmaker’s stock has soared more than 180% in 2024 by selling the AI chips that OpenAI, Google, and Meta train their models on. However, Andreessen Horowtiz partners and several other AI executives have previously said that these methods are already starting to show diminishing returns.

Huang noted that most of Nvidia’s computing workloads today are around the pre-training of AI models – not inference — but he attributed that more to where the AI world is today. He said that one day, there will simply be more people running AI models, meaning more AI inference will happen. Huang noted that Nvidia is the largest inference platform in the world today and the company’s scale and reliability gives it a huge advantage compared to startups.

“Our hopes and dreams are that someday, the world does a ton of inference, and that’s when AI has really succeeded,” said Huang. “Everybody knows that if they innovate on top of CUDA and Nvidia’s architecture, they can innovate more quickly, and they know that everything should work.”

Source link

Nvidia’s CEO defends his moat as AI labs change how they improve their AI models

Recent posts

Tyler, the Creator changes his tune on Elon Musk

Fusion pioneer Commonwealth Fusion Systems is selling core magnet tech to the University of Wisconsin

WhatsApp introduces ‘Favorites’ for quick access to contacts and groups that matter most

Apple signs the White House’s commitment to AI safety

Mozilla exits the fediverse and will shutter its Mastodon server in December

Musk’s amended lawsuit against OpenAI names Microsoft as defendant

Bumble users can now report profiles that use AI-generated photos

General Catalyst raises $8B in fresh funds to back startups globally

Who cut the plant-based cheese? Plonts did with microbes, and it’s stinky

Spain’s antitrust watchdog fines Booking.com nearly $450M for unfair terms and restricting rivals

Perplexity’s CEO punts on defining ‘plagiarism’

SoCreate wants to transform screenwriting software with AI imagery and community sharing tools

Lyft restructures its micromobility business and Volkswagen brings ChatGPT to US vehicles

Last Day: Exhibit your startup with big savings at TechCrunch Disrupt 2024

Archer to set up air taxi network in LA by 2026 ahead of World Cup

Related articles

Lighthouse, an analytics provider for the hospitality sector, lights up with $370M at a $1B valuation

DOJ: Google must sell Chrome to end monopoly

WhatsApp will finally let you unsubscribe from business marketing spam

OneCell Diagnostics bags $16M to help limit cancer reoccurrence using AI

India’s Arzooo, once valued at $310M, sells in distressed deal

OpenAI accidentally deleted potential evidence in NY Times copyright lawsuit

Hyundai reveals the Ioniq 9, its biggest EV to date

Reddit appears to be back after a 4-hour-long outage

Company

Follow us