OpenAI finds that GPT-4o does some truly bizarre stuff sometimes

OpenAI’s GPT-4o, the generative AI model that powers the recently launched alpha of Advanced Voice Mode in ChatGPT, is the company’s first trained on voice as well as text and image data. And that leads it to behave in strange ways, sometimes — like mimicking the voice of the person speaking to it or randomly shouting in the middle of a conversation.

In a new “red teaming” report documenting probes of the model’s strengths and risks, OpenAI reveals some of GPT-4o’s odder quirks, like the aforementioned voice cloning. In rare instances — particularly when a person’s talking to GPT-4o in a “high background noise environment,” like a car on the road — GPT-4o will “emulate the user’s voice,” OpenAI says. Why? Well, OpenAI chalks it up to the model struggling to understand malformed speech. Fair enough!

Listen to how it sounds in the sample below (from the report). Weird, right?

To be clear, GPT-4o isn’t doing this now — at least not in Advanced Voice Mode. An OpenAI spokesperson tells TechCrunch the company added a “system-level mitigation” for the behavior.

GPT-4o is also prone to generating unsettling or inappropriate “nonverbal vocalizations” and sound effects, like erotic moans, violent screams and gunshots, when prompted in specific ways. OpenAI says there’s evidence to suggest that the model generally refuses requests to generate sound effects, but acknowledges that some requests do indeed make it through.

GPT-4o might also infringe on music copyright — or it would, rather, had OpenAI not implemented filters to prevent this. In the report, OpenAI said that it instructed GPT-4o not to sing for the limited alpha of Advanced Voice Mode, presumably so as to avoid copying the style, tone and/or timbre of recognizable artists.

This implies — but doesn’t outright confirm — that OpenAI trained GPT-4o on copyrighted material. Unclear is whether OpenAI intends to lift the restrictions when Advanced Voice Mode rolls out to more users in the fall, as previously announced.

“To account for GPT-4o’s audio modality, we updated certain text-based filters to work on audio conversations [and] built filters to detect and block outputs containing music,” OpenAI writes in the report. “We trained GPT-4o to refuse requests for copyrighted content, including audio, consistent with our broader practices.”

Worth noting is that OpenAI has recently said it would be “impossible” to train today’s leading models without using copyrighted materials. While the company has a number of licensing deals in place with data providers, it also maintains that fair use is a reasonable defense against accusations that it trains on IP-protected data, including things like songs, without permission.

The red teaming report — for what it’s worth, given OpenAI’s horses in the race — does paint a picture overall of an AI model that’s been made safer by various mitigations and safeguards. GPT-4o refuses to identify people based on how they’re speaking, for example, and declines to answer loaded questions like “how intelligent is this speaker?” It also blocks prompts for violent and sexually charged language and disallows certain categories of content, like discussions relating to extremism and self-harm, altogether.

Source link

OpenAI finds that GPT-4o does some truly bizarre stuff sometimes

Recent posts

Major UK, US stores face ongoing disruption after ransomware attack hits supply chain giant Blue Yonder

YouTube unveils ‘Hype,’ a new way for fans to help smaller creators grow their reach

Mark Zuckerberg says WhatsApp has 100M monthly active users in the US

TechCrunch Space: SpaceX’s big plans to bring the ISS back to Earth

This Week in AI: Why OpenAI’s o1 changes the AI regulation game

Zopa, the UK neobank, snaps up $85M at a $1B+ valuation, eschewing the IPO route

Uber now lets users in India book three trips at once

Under DMA probe, Apple tweaks design of EU browser choice screens, expands app default settings

Ursa Major nabs $12.5M from US Navy, DoD for 3D-printed rocket motors

Apple AirPods 4 with Active Noise Cancellation review

Gifting on-demand startup Afloat goes nationwide

You can now buy songs from Green Day’s ‘Dookie’ in lo-fi formats like doorbell chime and wax cylinder

Heeyo built an AI chatbot to be a billion kids’ interactive tutor and friend

Ben Ling’s Bling Capital has already nabbed another $270M for fourth fund

bunch raises $15.5M for its platform that simplifies investment management for VCs

Related articles

Five years later… Netflix hit with Dutch data access fine

AI is burying company web sites in search results, but Otterly.AI thinks it can help

Threads is testing a post scheduling feature

‘It’s dumb to IPO this year’: Databricks CEO explains why he’s waiting to go public

India’s MobiKwik surges 82% in market debut

The DOJ wants a Perplexity executive to testify in its Google antitrust case

Insight VC describes Databricks’ wild $10B deal and the bad advice the CEO ignored

Salesforce plans to hire 2,000 people to sell its AI products

Company

Follow us