Anthropic publishes the ‘system prompts’ that make Claude tick

Generative AI models aren’t actually human-like. They have no intelligence or personality — they’re simply statistical systems predicting the likeliest next words in a sentence. But like interns at a tyrannical workplace, they do follow instructions without complaint — including initial “system prompts” that prime the models with their basic qualities, and what they should and shouldn’t do.

Every generative AI vendor, from OpenAI to Anthropic, uses system prompts to prevent (or at least try to prevent) models from behaving badly, and to steer the general tone and sentiment of the models’ replies. For instance, a prompt might tell a model it should be polite but never apologetic, or to be honest about the fact that it can’t know everything.

But vendors usually keep system prompts close to the chest — presumably for competitive reasons, but also perhaps because knowing the system prompt may suggest ways to circumvent it. The only way to expose GPT-4o‘s system prompt, for example, is through a prompt injection attack. And even then, the system’s output can’t be trusted completely.

However, Anthropic, in its continued effort to paint itself as a more ethical, transparent AI vendor, has published the system prompts for its latest models (Claude 3.5 Opus, Sonnet and Haiku) in the Claude iOS and Android apps and on the web.

Alex Albert, head of Anthropic’s developer relations, said in a post on X that Anthropic plans to make this sort of disclosure a regular thing as it updates and fine-tunes its system prompts.

We’ve added a new system prompts release notes section to our docs. We’re going to log changes we make to the default system prompts on Claude dot ai and our mobile apps. (The system prompt does not affect the API.) pic.twitter.com/9mBwv2SgB1

— Alex Albert (@alexalbert__) August 26, 2024

The latest prompts, dated July 12, outline very clearly what the Claude models can’t do — e.g. “Claude cannot open URLs, links, or videos.” Facial recognition is a big no-no; the system prompt for Claude 3.5 Opus tells the model to “always respond as if it is completely face blind” and to “avoid identifying or naming any humans in [images].”

But the prompts also describe certain personality traits and characteristics — traits and characteristics that Anthropic would have the Claude models exemplify.

The prompt for Opus, for instance, says that Claude is to appear as if it “[is] very smart and intellectually curious,” and “enjoys hearing what humans think on an issue and engaging in discussion on a wide variety of topics.” It also instructs Claude to treat controversial topics with impartiality and objectivity, providing “careful thoughts” and “clear information” — and never to begin responses with the words “certainly” or “absolutely.”

It’s all a bit strange to this human, these system prompts, which are written like an actor in a stage play might write a character analysis sheet. The prompt for Opus ends with “Claude is now being connected with a human,” which gives the impression that Claude is some sort of consciousness on the other end of the screen whose only purpose is to fulfill the whims of its human conversation partners.

But of course that’s an illusion. If the prompts for Claude tell us anything, it’s that without human guidance and hand-holding, these models are frighteningly blank slates.

With these new system prompt changelogs — the first of their kind from a major AI vendor — Anthropic’s exerting pressure competitors to publish the same. We’ll have see if the gambit works.

Source link

Anthropic publishes the ‘system prompts’ that make Claude tick

Recent posts

French biotech Generare speeds up hunt for new drugs by cloning natural molecules

SpaceX alums find traction on Earth with their Mars-inspired CO2-to-fuel tech

US government urges high-ranking officials to lock down mobile devices following telecom breaches

Affirm launches in the UK, as ‘buy now, pay later’ market faces regulatory overhaul

Moxie, which helps nurses launch medspas, raises a preemptive Series B from Lachy Groom

Instagram’s latest feature is a digital business card for your profile

Londoners will soon see drones ferrying blood between hospitals

Audible recruits voice actors to train audiobook-generating AI

X rolls out its real-time search tool, Radar, to Premium+ subscribers

Nodal connects hopeful parents with surrogates as reproductive freedom hangs in limbo

Apple Intelligence is coming to the EU in April 2025

Pavel Durov says Telegram is now profitable

YouTube Shorts’ collaborative Add Yours sticker is now available to all users

Bluesky tops app charts and sees ‘all-time-highs’ after Brazil bans X

Glint Solar grabs $8M to help accelerate solar energy adoption across Europe

Related articles

HPE investigating security breach after hacker claims theft of sensitive data

MoneyHash, which provides single access to payment services in MENA, banks $5.2M

Karmen secures $9.4 million for its revenue-based financing products

President Trump signs exec order to make Musk’s DOGE commission more official

Trump signs exec order delaying TikTok enforcement action for 75 days

President Trump repeals Biden’s AI executive order

UK to unveil ‘Humphrey’ assistant for civil servants with other AI plans to cut bureaucracy

OpenAI’s agent tool may be nearing release

Company

Follow us