TechCrunch Minute: How Anthropic found a trick to get AI to give you answers it’s not supposed to

Date:

Share post:


If you build it, people will try to break it. Sometimes even the people building stuff are the ones breaking it. Such is the case with Anthropic and its latest research which demonstrates an interesting vulnerability in current LLM technology. More or less if you keep at a question, you can break guardrails and wind up with large language models telling you stuff that they are designed not to. Like how to build a bomb.

Of course given progress in open-source AI technology, you can spin up your own LLM locally and just ask it whatever you want, but for more consumer-grade stuff this is an issue worth pondering. What’s fun about AI today is the quick pace it is advancing, and how well — or not — we’re doing as a species to better understand what we’re building.

If you’ll allow me the thought, I wonder if we’re going to see more questions and issues of the type that Anthropic outlines as LLMs and other new AI model types get smarter, and larger. Which is perhaps repeating myself. But the closer we get to more generalized AI intelligence, the more it should resemble a thinking entity, and not a computer that we can program, right? If so, we might have a harder time nailing down edge cases to the point when that work becomes unfeasible? Anyway, let’s talk about what Anthropic recently shared.



Source link

Lisa Holden
Lisa Holden
Lisa Holden is a news writer for LinkDaddy News. She writes health, sport, tech, and more. Some of her favorite topics include the latest trends in fitness and wellness, the best ways to use technology to improve your life, and the latest developments in medical research.

Recent posts

Related articles

Senate study proposes ‘at least’ $32B yearly for AI programs

A long-running working group in the Senate has issued its policy recommendation for federal funding for AI:...

FBI seizes hacking forum BreachForums — again

The FBI along with a coalition of international law enforcement agencies seized the notorious cybercrime forum BreachForums...

Netflix to take on Google and Amazon by building its own ad server

Netflix announced during its Upfronts presentation on Wednesday that it’s launching its own advertising technology platform only...

Matt Garman taking over as CEO with AWS at crossroads

It’s tough to say that a $100 billion business finds itself at a critical juncture, but that’s...

Google still hasn’t fixed Gemini’s biased image generator

Back in February, Google paused its AI-powered chatbot Gemini’s ability to generate images of people after users complained of...

Google’s call-scanning AI could dial up censorship by default, privacy experts warn

A feature Google demoed at its I/O confab yesterday, using its generative AI technology to scan voice...

The top AI announcements from Google I/O

Google’s going all in on AI — and it wants you to know it. During the company’s...

Uber has a new way to solve the concert traffic problem

Uber is taking a shuttle product it developed for commuters in India and Egypt and converting it...