Anthropic’s Claude adds a prompt playground to quickly improve your AI apps

Date:

Share post:


Prompt engineering became a hot job last year in the AI industry, but it seems Anthropic is now developing tools to at least partially automate it.

Anthropic released several new features on Tuesday to help developers create more useful applications with the startup’s language model, Claude, according to a company blog post. Developers can now use Claude 3.5 Sonnet to generate, test and evaluate prompts, using prompt engineering techniques to create better inputs and improve Claude’s answers for specialized tasks.

Language models are pretty forgiving when you ask them to perform some tasks, but sometimes small changes to the wording of a prompt can lead to big improvements in the results. Normally you’d have to figure out that wording yourself, or hire a prompt engineer to do it, but this new feature offers quick feedback that could make finding improvements easier.

The features are housed within Anthropic Console under a new Evaluate tab. Console is the startup’s test kitchen for developers, created to attract businesses looking to build products with Claude. One of the features, unveiled in May, is Anthropic’s built-in prompt generator; this takes a short description of a task and constructs a much longer, fleshed out prompt, utilizing Anthropic’s own prompt engineering techniques. While Anthropic’s tools may not replace prompt engineers altogether, the company said it would help new users, and save time for experienced prompt engineers.

Within Evaluate, developers can test how effective their AI application’s prompts are in a range of scenarios. Developers can upload real-world examples to a test suite or ask Claude to generate an array of AI-generated test cases. Developers can then compare how effective various prompts are side-by-side, and rate sample answers on a five-point scale.

A prompt being fed generated data to find good and bad responses.
Image Credits: Anthropic

In an example from Anthropic’s blog post, a developer identified that their application was giving answers that were too short across several test cases. The developer was able to tweak a line in their prompt to make the answers longer, and apply it simultaneously to all their test cases. That could save developers lots of time and effort, especially ones with little or no prompt engineering experience.

Anthropic CEO and co-founder Dario Amodei said prompt engineering was one of the most important things for widespread enterprise adoption of generative AI in an interview from Google Cloud Next earlier this year. “It sounds simple, but 30 minutes with a prompt engineer can often make an application work when it wasn’t before,” said Amodei.



Source link

Lisa Holden
Lisa Holden
Lisa Holden is a news writer for LinkDaddy News. She writes health, sport, tech, and more. Some of her favorite topics include the latest trends in fitness and wellness, the best ways to use technology to improve your life, and the latest developments in medical research.

Recent posts

Related articles

Apple Intelligence is now live in public beta. Here’s what it offers and how to enable it.

Apple Intelligence took another major step toward mainstream availability Thursday with the launch of the iOS 18.1,...

Google rolls out automatic passkey syncing via Password Manager

Passkeys, the digital credentials that let you sign into apps and websites without entering a password, are...

Quilt, Furno Materials, and RA Capital Management share the stage at TechCrunch Disrupt 2024

Launching a new product is challenging, but doing it in a space dominated by tech giants requires...

Announcing our next wave of Startup Battlefield judges at TechCrunch Disrupt 2024

Startup Battlefield 200 is a major highlight at every Disrupt, and we’re thrilled to find out which...

Amazon debuts an AI assistant for sellers, Project Amelia

Amazon sellers now have access to an AI assistant designed to help them grow their business by...

Karman Industries hopes its SpaceX-inspired heat pumps will replace industrial boilers

Industrial heat, which is used by companies as diverse as breweries and food processors to chemical manufacturers...

Brightband sees a bright (and open-source) future for AI-powered weather forecasting

With an explosion of weather and climate data that the last generation of tools can’t handle, is...

Phlair’s carbon sucking technology could lower direct air capture’s costs

When it comes to climate change, there’s no such thing as a “get out of jail free”...