Mistral releases new AI models optimized for edge devices

Date:

Share post:


French AI startup Mistral has released its first generative AI models designed to be run on edge devices, like laptops and phones.

The new family of models, which Mistral is calling “Les Ministraux,” can be used or tuned for a variety of applications, from text generation to working in conjunction with more capable models to complete tasks.

There’s two Les Ministraux models available — Ministral 3B and Ministral 8B — both of which have a context window of 128,000 tokens, meaning they can ingest roughly the length of a 50-page book.

“Our most innovative customers and partners have increasingly been asking for local, privacy-first inference for critical applications such as on-device translation, internet-less smart assistants, local analytics, and autonomous robotics,” Mistral writes in a blog post. “Les Ministraux were built to provide a compute-efficient and low-latency solution for these scenarios.”

The Les Ministraux models are available for download — but only for research purposes. Mistral is requiring developers and companies interest in self-deployment to contact it for a commercial license.

Otherwise, developers can use Ministral 3B and Ministral 8B on its cloud platform. Ministral 8B costs 10 cents per million output/input tokens (~750,000 words), while Minstral 3B costs 4 cents per million output/input tokens.



Source link

Lisa Holden
Lisa Holden
Lisa Holden is a news writer for LinkDaddy News. She writes health, sport, tech, and more. Some of her favorite topics include the latest trends in fitness and wellness, the best ways to use technology to improve your life, and the latest developments in medical research.

Recent posts

Related articles

Meta COO Sheryl Sandberg sanctioned by judge for allegedly deleting emails

A Delaware judge has sanctioned Sheryl Sandberg, Meta’s former COO and board member, for allegedly deleting emails...

Microsoft is no longer OpenAI’s exclusive cloud provider

Microsoft was once the exclusive provider of data center infrastructure for OpenAI to train and run its...

Scale AI’s Alexandr Wang has published an open letter lobbying Trump to invest in AI

Alexandr Wang, the CEO of Scale AI, has taken out a full-page ad in The Washington Post...

Perplexity launches Sonar, an API for AI search

Perplexity on Tuesday launched an API service called Sonar, allowing enterprises and developers to build the startup’s...

Trump targets EV charging funding programs Tesla benefits from

President Donald Trump is trying to halt the flow of funding for EV charging infrastructure from two...

Spotify introduces educational audio courses, starting in the UK

Spotify is expanding its streaming service to now include educational courses in addition to music, podcasts, and...

Funding to fintechs continues to decline, but at a slower pace

Welcome to TechCrunch Fintech!  This week, we’re looking at just how much fintech startups raised in 2024, a...

Forum software NodeBB joins the fediverse

Before there was social media, there were internet forums. Millions of forum sites continue to operate, which...