Anthropic’s Claude adds a prompt playground to quickly improve your AI apps

Date:

Share post:


Prompt engineering became a hot job last year in the AI industry, but it seems Anthropic is now developing tools to at least partially automate it.

Anthropic released several new features on Tuesday to help developers create more useful applications with the startup’s language model, Claude, according to a company blog post. Developers can now use Claude 3.5 Sonnet to generate, test and evaluate prompts, using prompt engineering techniques to create better inputs and improve Claude’s answers for specialized tasks.

Language models are pretty forgiving when you ask them to perform some tasks, but sometimes small changes to the wording of a prompt can lead to big improvements in the results. Normally you’d have to figure out that wording yourself, or hire a prompt engineer to do it, but this new feature offers quick feedback that could make finding improvements easier.

The features are housed within Anthropic Console under a new Evaluate tab. Console is the startup’s test kitchen for developers, created to attract businesses looking to build products with Claude. One of the features, unveiled in May, is Anthropic’s built-in prompt generator; this takes a short description of a task and constructs a much longer, fleshed out prompt, utilizing Anthropic’s own prompt engineering techniques. While Anthropic’s tools may not replace prompt engineers altogether, the company said it would help new users, and save time for experienced prompt engineers.

Within Evaluate, developers can test how effective their AI application’s prompts are in a range of scenarios. Developers can upload real-world examples to a test suite or ask Claude to generate an array of AI-generated test cases. Developers can then compare how effective various prompts are side-by-side, and rate sample answers on a five-point scale.

A prompt being fed generated data to find good and bad responses.
Image Credits: Anthropic

In an example from Anthropic’s blog post, a developer identified that their application was giving answers that were too short across several test cases. The developer was able to tweak a line in their prompt to make the answers longer, and apply it simultaneously to all their test cases. That could save developers lots of time and effort, especially ones with little or no prompt engineering experience.

Anthropic CEO and co-founder Dario Amodei said prompt engineering was one of the most important things for widespread enterprise adoption of generative AI in an interview from Google Cloud Next earlier this year. “It sounds simple, but 30 minutes with a prompt engineer can often make an application work when it wasn’t before,” said Amodei.



Source link

Lisa Holden
Lisa Holden
Lisa Holden is a news writer for LinkDaddy News. She writes health, sport, tech, and more. Some of her favorite topics include the latest trends in fitness and wellness, the best ways to use technology to improve your life, and the latest developments in medical research.

Recent posts

Related articles

India weighs easing market share limits for UPI payment operators

The governing body overseeing India’s popular UPI payments rail is considering easing its proposed market share cap...

Palmer Luckey returns to headsets as Anduril partners with Microsoft on U.S. military tech

Palmer Luckey, the Hawaiian-shirt wearing founder who sold Oculus VR for $2 billion before co-founding the military...

CEO of self-driving startup Motional is stepping down

Motional, the autonomous vehicle startup backed by Hyundai, is shaking up its leadership ranks. Karl Iagnemma, an...

Craig Newmark pledges $100M to fight hacking by foreign governments

Craigslist founder Craig Newmark plans to donate $100 million to further strengthen U.S. cybersecurity, addressing what he...

Bluesky addresses trust and safety concerns around abuse, spam, and more

Social networking startup Bluesky, which is building a decentralized alternative to X (formerly Twitter), offered an update...

Fal.ai, which hosts media-generating AI models, raises $23M from a16z and others

Fal.ai, a dev-focused platform for AI-generated audio, video, and images, today revealed that it’s raised $23 million...

Bill requiring AM radio in new cars gets closer to law

A House committee overwhelmingly voted to approve a bill that would require new cars to be built...

HTC takes on Apple’s Vision Pro and PC Gaming with $1,000 Vive Focus Vision

TechCrunch spent some time with the $1,119 Vive XR Elite portable headset that had Meta’s Quest Pro...