NIST releases a tool for testing AI model risk

The National Institute of Standards and Technology (NIST), the U.S. Commerce Department agency that develops and tests tech for the U.S. government, companies and the broader public, has re-released a testbed designed to measure how malicious attacks — particularly attacks that “poison” AI model training data — might degrade the performance of an AI system.

Called Dioptra (after the classical astronomical and surveying instrument), the modular, open source web-based tool, first released in 2022, seeks to help companies training AI models — and the people using these models — assess, analyze and track AI risks. Dioptra can be used to benchmark and research models, NIST says, as well as to provide a common platform for exposing models to simulated threats in a “red-teaming” environment.

“Testing the effects of adversarial attacks on machine learning models is one of the goals of Dioptra,” NIST wrote in a press release. “The open source software, like generating child available for free download, could help the community, including government agencies and small to medium-sized businesses, conduct evaluations to assess AI developers’ claims about their systems’ performance.”

A screenshot of Diatropa’s interface.

Image Credits: NIST

Dioptra debuted alongside documents from NIST and NIST’s recently created AI Safety Institute that lay out ways to mitigate some of the dangers of AI, like how it can be abused to generate nonconsensual pornography. It follows the launch of the U.K. AI Safety Institute’s Inspect, a toolset similarly aimed at assessing the capabilities of models and overall model safety. The U.S. and U.K. have an ongoing partnership to jointly develop advanced AI model testing, announced at the U.K.’s AI Safety Summit in Bletchley Park in November of last year.

Dioptra is also the product of President Joe Biden’s executive order (EO) on AI, which mandates (among other things) that NIST help with AI system testing. The EO, relatedly, also establishes standards for AI safety and security, including requirements for companies developing models (e.g. Apple) to notify the federal government and share results of all safety tests before they’re deployed to the public.

As we’ve written about before, AI benchmarks are hard — not least of which because the most sophisticated AI models today are black boxes whose infrastructure, training data and other key details are kept under wraps by the companies creating them. A report out this month from the Ada Lovelace Institute, a U.K.-based nonprofit research institute that studies AI, found that evaluations alone aren’t sufficient to determine the real-world safety of an AI model in part because current policies allow AI vendors to selectively choose which evaluations to conduct.

NIST doesn’t assert that Dioptra can completely de-risk models. But the agency does propose that Dioptra can shed light on which sorts of attacks might make an AI system perform less effectively and quantify this impact to performance.

In a major limitation, however, Dioptra only works out-of-the-box on models that can be downloaded and used locally, like Meta’s expanding Llama family. Models gated behind an API, such as OpenAI’s GPT-4o, are a no-go — at least for the time being.

Source link

NIST releases a tool for testing AI model risk

Recent posts

Transport for London outages drag into weekend after cyberattack

Privacy app maker Proton transitions to non-profit foundation structure

Mitti Labs aims to make rice farming less harmful to the climate, starting in India

Apple signs the White House’s commitment to AI safety

Kiteworks captures $456M at a $1B+ valuation to help secure sensitive data

India’s Zomato to raise $1B ahead of rival Swiggy IPO

Fintech shutdowns, Klarna’s move into banking and which companies are hiring

Moxie, which helps nurses launch medspas, raises a preemptive Series B from Lachy Groom

Forestay, Europe’s newest $220M growth-stage VC fund, will focus on AI

With this latest deal, Flipboard looks to build a news ecosystem beyond X

Microsoft’s Mustafa Suleyman says he loves Sam Altman, believes he’s sincere about AI safety

VW taps Rivian in $5B EV deal and the fight over Fisker’s assets

Elon Musk’s X still struggles to grow subscription revenue

Mark Zuckerberg says he’s done apologizing

This Week in AI: The fate of generative AI is in the courts’ hands

Related articles

WhatsApp rolls out voice message transcripts

Threads adjusts its algorithm to show you more content from accounts you follow

Spotify tests a video feature for audiobooks as it ramps up video expansion

Candela brings its P-12 electric ferry to Tahoe and adds another $14M to build more

OneRail’s software helps solve the last-mile delivery problem

Bill to ban social media use by under-16s arrives in Australia’s parliament

Lighthouse, an analytics provider for the hospitality sector, lights up with $370M at a $1B valuation

DOJ: Google must sell Chrome to end monopoly

Company

Follow us