Exclusive: Google’s Gemini is forcing contractors to rate AI responses outside their expertise

Generative AI may look like magic, but behind the development of these systems are armies of employees at companies like Google, OpenAI, and others, known as “prompt engineers” and analysts, who rate the accuracy of chatbots’ outputs to improve their AI.

But a new internal guideline passed down from Google to contractors working on Gemini, seen by TechCrunch, has led to concerns that Gemini could be more prone to spouting out inaccurate information on highly sensitive topics, like healthcare, to regular people.

To improve Gemini, contractors working with GlobalLogic, an outsourcing firm owned by Hitachi, are routinely asked to evaluate AI-generated responses according to factors like “truthfulness.”

These contractors were until recently able to “skip” certain prompts, and thus opt out of evaluating various AI-written responses to those prompts, if the prompt was way outside their domain expertise. For example, a contractor could skip a prompt that was asking a niche question about cardiology because the contractor had no scientific background.

But last week, GlobalLogic announced a change from Google that contractors are no longer allowed to skip such prompts, regardless of their own expertise.

Internal correspondence seen by TechCrunch shows that previously, the guidelines read: “If you do not have critical expertise (e.g. coding, math) to rate this prompt, please skip this task.”

But now the guidelines read: “You should not skip prompts that require specialized domain knowledge.” Instead, contractors are being told to “rate the parts of the prompt you understand” and include a note that they don’t have domain knowledge.

This has led to direct concerns about Gemini’s accuracy on certain topics, as contractors are sometimes tasked with evaluating highly technical AI responses about issues like rare diseases that they have no background in.

“I thought the point of skipping was to increase accuracy by giving it to someone better?” one contractor noted in internal correspondence, seen by TechCrunch.

Contractors can now only skip prompts in two cases: if they’re “completely missing information” like the full prompt or response, or if they contain harmful content that requires special consent forms to evaluate, the new guidelines show.

Google did not respond to TechCrunch’s requests for comment by press time.

Source link

Exclusive: Google’s Gemini is forcing contractors to rate AI responses outside their expertise

Recent posts

Why Marc Andreessen was ‘very scared’ after meeting with the Biden administration about AI

Elon Musk said he’s not interested in acquiring TikTok

Apple might release a $2,000 Vision headset next year

SailPoint’s dull debut did little to loosen the stuck IPO window, expert says

Benchmark is raising $170M for its latest partners-only fund

Marc Benioff warns that AI, while useful, is overhyped and partly blames Microsoft

Toyota’s next-generation cars will be built with Nvidia supercomputers and operating system

Ben Affleck tells actors and writers not to worry about AI

UnitedHealth confirms 190 million Americans affected by Change Healthcare data breach

xAI adds image understanding capabilities to Grok

First TikTok, now smart cars: How Biden’s new proposed ban will affect US automakers

Fearless Fund has made a big deal since settling its controversial lawsuit

Early Thanksgiving online sales numbers are up 7% YoY to $15.6B, on par with pre-pandemic trends

Meta’s Threads is developing its own take on Bluesky’s ‘Starter Packs’

Lyten buys battery manufacturing assets from beleaguered Northvolt

Related articles

Rivian will launch hands-off highway driver assist ‘in a few weeks’

Solar crushed 2024, but emissions were up as industry used more natural gas

Rivian inches closer to profitability but warns ‘changes to government policies’ could hurt

Your Android phone could have stalkerware. Here’s how to remove it

Beta Technologies’ bet on electric flight and Hyundai’s new Tesla charging port comes up short

Amazon shuts down Chime, its Zoom alternative

Cherryrock Capital raises new $172M fund from all-star investors to back diverse founders

Instagram’s new ad format lets creators get paid for testimonials in comments

Company

Follow us