NIST releases a tool for testing AI model risk

The National Institute of Standards and Technology (NIST), the U.S. Commerce Department agency that develops and tests tech for the U.S. government, companies and the broader public, has re-released a testbed designed to measure how malicious attacks — particularly attacks that “poison” AI model training data — might degrade the performance of an AI system.

Called Dioptra (after the classical astronomical and surveying instrument), the modular, open source web-based tool, first released in 2022, seeks to help companies training AI models — and the people using these models — assess, analyze and track AI risks. Dioptra can be used to benchmark and research models, NIST says, as well as to provide a common platform for exposing models to simulated threats in a “red-teaming” environment.

“Testing the effects of adversarial attacks on machine learning models is one of the goals of Dioptra,” NIST wrote in a press release. “The open source software, like generating child available for free download, could help the community, including government agencies and small to medium-sized businesses, conduct evaluations to assess AI developers’ claims about their systems’ performance.”

A screenshot of Diatropa’s interface.

Image Credits: NIST

Dioptra debuted alongside documents from NIST and NIST’s recently created AI Safety Institute that lay out ways to mitigate some of the dangers of AI, like how it can be abused to generate nonconsensual pornography. It follows the launch of the U.K. AI Safety Institute’s Inspect, a toolset similarly aimed at assessing the capabilities of models and overall model safety. The U.S. and U.K. have an ongoing partnership to jointly develop advanced AI model testing, announced at the U.K.’s AI Safety Summit in Bletchley Park in November of last year.

Dioptra is also the product of President Joe Biden’s executive order (EO) on AI, which mandates (among other things) that NIST help with AI system testing. The EO, relatedly, also establishes standards for AI safety and security, including requirements for companies developing models (e.g. Apple) to notify the federal government and share results of all safety tests before they’re deployed to the public.

As we’ve written about before, AI benchmarks are hard — not least of which because the most sophisticated AI models today are black boxes whose infrastructure, training data and other key details are kept under wraps by the companies creating them. A report out this month from the Ada Lovelace Institute, a U.K.-based nonprofit research institute that studies AI, found that evaluations alone aren’t sufficient to determine the real-world safety of an AI model in part because current policies allow AI vendors to selectively choose which evaluations to conduct.

NIST doesn’t assert that Dioptra can completely de-risk models. But the agency does propose that Dioptra can shed light on which sorts of attacks might make an AI system perform less effectively and quantify this impact to performance.

In a major limitation, however, Dioptra only works out-of-the-box on models that can be downloaded and used locally, like Meta’s expanding Llama family. Models gated behind an API, such as OpenAI’s GPT-4o, are a no-go — at least for the time being.

Source link

NIST releases a tool for testing AI model risk

Recent posts

Here’s why ServiceTitan was on the clock to go public

United and Air Canada can now use Apple AirTags to track lost luggage

Apple event 2024: How to watch the iPhone 16 launch

Rivian launches smaller $1,400 camp kitchen, 5 years after initial demo

Despite VCs investing $75B in Q4 , it’s still hard for startups to raise money, data proves

Full Nature Farms launches smart irrigation system at CES 2025 to reduce agricultural water waste

More teens report using ChatGPT for schoolwork, despite the tech’s faults

Al Gore roasts corporations and politicians, comparing their climate crisis promises to ‘New Year’s resolutions’

India’s top court clears way for Byju’s insolvency proceedings

Bench customers are now being forced to hand over their data or risk losing it, they say

In memory of Steve O’Hear

Meta’s Nick Clegg says Elon Musk has potential to be a political ‘puppet master’

Nvidia releases more tools and guardrails to nudge enterprises to adopt AI agents

Bug lets anyone bypass WhatsApp’s ‘View Once’ privacy feature

Former Palantir CISO Dane Stuckey joins OpenAI to lead security

Related articles

Karmen secures $9.4 million for its revenue-based financing products

President Trump signs exec order to make Musk’s DOGE commission more official

Trump signs exec order delaying TikTok enforcement action for 75 days

President Trump repeals Biden’s AI executive order

UK to unveil ‘Humphrey’ assistant for civil servants with other AI plans to cut bureaucracy

OpenAI’s agent tool may be nearing release

Friend delays shipments of its AI companion pendant

US safety regulators expand Ford hands-free driving tech investigation

Company

Follow us