Mistral releases Codestral, its first generative AI model for code

Date:

Share post:


Mistral, the French AI startup backed by Microsoft and valued at $6 billion, has released its first generative AI model for coding, dubbed Codestral.

Like other code-generating models, Codestral is designed to help developers write and interact with code. It was trained on over 80 programming languages, including Python, Java, C++ and JavaScript, explains Mistral in a blog post. Codestral can complete coding functions, write tests and “fill in” partial code, as well as answer questions about a codebase in English.

Mistral describes the model as “open,” but that’s up for debate. The startup’s license prohibits the use of Codestral and its outputs for any commercial activities. There’s a carve-out for “development,” but even that has caveats: The license goes on to explicitly ban “any internal usage by employees in the context of the company’s business activities.”

The reason could be that Codestral was trained partly on copyrighted content. Mistral didn’t confirm or deny this in the blog post, but it wouldn’t be surprising; there’s evidence that the startup’s previous training datasets contained copyrighted data.

Codestral might not be worth the trouble, in any case. At 22 billion parameters, the model requires a beefy PC in order to run. (Parameters essentially define the skill of an AI model on a problem, like analyzing and generating text.) And while it beats the competition according to some benchmarks (which, as we know, are unreliable), it’s hardly a blowout.

Image Credits: Mistral

While impractical for most developers and incremental in terms of performance improvements, Codestral is sure to fuel the debate over the wisdom of relying on code-generating models as programming assistants.

Developers are certainly embracing generative AI tools for at least some coding tasks. In a Stack Overflow poll from June 2023, 44% of developers said that they use AI tools in their development process now while 26% plan to soon. Yet these tools have obvious flaws.

An analysis of more than 150 million lines of code committed to project repos over the past several years by GitClear found that generative AI dev tools are resulting in more mistaken code being pushed to codebases. Elsewhere, security researchers have warned that such tools can amplify existing bugs and security issues in software projects; over half of the answers OpenAI’s ChatGPT gives to programming questions are wrong, according to a study from Purdue.

That won’t stop companies like Mistral and others from attempting to monetize (and gain mindshare with) their models. This morning, Mistral launched a hosted version of Codestral on its Le Chat conversational AI platform as well as its paid API. Mistral says it’s also worked to build Codestral into app frameworks and development environments like LlamaIndex, LangChain, Continue.dev and Tabnine.



Source link

Lisa Holden
Lisa Holden
Lisa Holden is a news writer for LinkDaddy News. She writes health, sport, tech, and more. Some of her favorite topics include the latest trends in fitness and wellness, the best ways to use technology to improve your life, and the latest developments in medical research.

Recent posts

Related articles

Spain’s exposure to climate change helps Madrid-based VC, Seaya, close €300M climate-tech fund

According to a recent Dealroom report on the Spanish tech ecosystem, the combined enterprise value of Spanish...

Forestay, Europe’s newest $220M growth-stage VC fund, will focus on AI

Forestay, an emerging VC based out of Geneva, Switzerland has been busy. This week it closed its...

A year later, what Threads could learn from other social networks

Threads, Meta’s alternative to Twitter, just celebrated its first birthday. After launching on July 5 last year,...

J2 Ventures, focused on military healthcare, grabs $150M for its second fund

J2 Ventures, a firm led mostly by the U.S. military veterans, announced on Thursday that it has...

HealthEquity says data breach is an ‘isolated incident’

On Tuesday, health tech services provider HealthEquity disclosed in a filing with federal regulators that it had...

Roll20, an online tabletop role-playing game platform, discloses data breach

The popular online tabletop and role-playing game platform Roll20 announced on Wednesday that it had suffered a...

Fizz, the anonymous Gen Z social app, adds a marketplace for college students

Teddy Solomon just moved to a new house in Palo Alto, so he turned to the Stanford...

Deep tech VC Sidney Scott explains why he’s closing his firm as this area booms

Sidney Scott decided to take himself out of the venture capital rat race and is now jokingly...