Chinese AI company MiniMax releases new models it claims are competitive with the industry’s best

Date:

Share post:


Chinese firms continue to release AI models that rival the capabilities of systems developed by OpenAI and other U.S.-based AI companies.

This week, MiniMax, an Alibaba- and Tencent-backed startup that has raised around $850 million in venture capital and is valued at more than $2.5 billion, debuted three new models: MiniMax-Text-01, MiniMax-VL-01, and T2A-01-HD. MiniMax-Text-01 is a text-only model, while MiniMax-VL-01 can understand both images and text. T2A-01-HD, meanwhile, generates audio — specifically speech.

MiniMax claims that MiniMax-Text-01, which is 456 billion parameters in size, performs better than models such as Google’s recently unveiled Gemini 2.0 Flash on benchmarks like MATH and SimpleQA, which measure the ability of a model to answer math problems and fact-based questions. Parameters roughly correspond to a model’s problem-solving skills, and models with more parameters generally perform better than those with fewer parameters.

As for MiniMax-VL-01, MiniMax says that it rivals Anthropic’s Claude 3.5 Sonnet on evaluations that require multimodal understanding, like ChartQA, which tasks models with answering graph- and diagram-related queries (e.g. “What is the peak value of the orange line in this graph?”). Granted, MiniMax-VL-01 doesn’t quite best Gemini 2.0 Flash on many of these tests. OpenAI’s GPT-4o and Meta’s Llama 3.1 beat it on several, as well.

Of note, MiniMax-Text-01 has an extremely large context window. A model’s context, or context window, refers to input (for example, text) that a model considers before generating output (additional text). With a context window of 4 million tokens, MiniMax-Text-01 can analyze around 3 million words in one go — or just over five copies of “War and Peace.”

For context (no pun intended), MiniMax-Text-01’s context window is roughly 31 times the size of GPT-4o’s and Llama 3.1’s.

The last of MiniMax’s models released this week, T2A-01-HD, is an audio generator optimized for speech. T2A-01-HD can generate a synthetic voice with adjustable cadence, tone, and tenor in around 17 different languages, including English and Chinese, and clone a voice from just 10 seconds of an audio recording.

MiniMax didn’t publish benchmark results comparing T2A-01-HD to other audio-generating models. But to this reporter’s ear, T2A-01-HD’s outputs sound on par with audio models from Meta and startups like PlayAI.

With the exception of T2A-01-HD, which is exclusively available through MiniMax’s API and Hailuo AI platform, MiniMax’s new models can be downloaded from GitHub and the AI dev platform Hugging Face.

Just because the models are “openly” available doesn’t mean they aren’t locked down in certain aspects, however. MiniMax-Text-01 and MiniMax-VL-01 aren’t truly open source in the sense that MiniMax hasn’t released the components (e.g. training data) needed to recreate them from scratch. Moreover, they’re under MiniMax’s restrictive license, which prohibits developers from using the models to improve rival AI models, and requires that platforms with more than 100 million monthly active users request a special license from MiniMax.

MiniMax was founded in 2021 by former employees of SenseTime, one of China’s largest AI firms. The company’s projects include apps like Talkie, an AI-powered role-playing platform along the lines of Character AI, and text-to-video models that MiniMax has released in Hailuo.

Some of MiniMax’s products have become the subject of minor controversy.

Talkie, which was pulled from Apple’s App Store in December for unspecified “technical” reasons, features AI avatars of public figures including Donald Trump, Taylor Swift, Elon Musk, and LeBron James, none of whom appear to have consented to being featured in the app.

In December, Broadcast magazine reported that MiniMax’s video generators can reproduce the logos of British television channels, suggesting that MiniMax’s models were trained on content from those channels. And MiniMax is reportedly being sued by iQIYI, a Chinese video streaming service that alleges MiniMax illicitly trained on iQIYI’s copyrighted recordings.

MiniMax’s new models arrive days after the outgoing Biden Administration proposed harsher export rules and restrictions on AI technologies for Chinese ventures. Companies in China were already prevented from buying advanced AI chips, but if the new rules go into effect as written, companies will be faced with stricter caps on both the semiconductor tech and models needed to bootstrap sophisticated AI systems.

On Wednesday, the Biden Administration announced additional measures focused on keeping sophisticated chips out of China. Chip foundries and packaging companies that want to export certain chips will be subjected to broader license requirements unless they exercise greater scrutiny and due diligence to prevent their products from reaching Chinese clients.



Source link

Lisa Holden
Lisa Holden
Lisa Holden is a news writer for LinkDaddy News. She writes health, sport, tech, and more. Some of her favorite topics include the latest trends in fitness and wellness, the best ways to use technology to improve your life, and the latest developments in medical research.

Recent posts

Related articles

Solar crushed 2024, but emissions were up as industry used more natural gas

The U.S. invested a record-breaking $338 billion in the energy transition last year, according to a new...

Rivian inches closer to profitability but warns ‘changes to government policies’ could hurt

Rivian’s cost–cutting measures have gotten it a lot closer to profitability, but the company is warning that...

Your Android phone could have stalkerware. Here’s how to remove it

Consumer-grade spyware apps that covertly and continually monitor your private messages, photos, phone calls and real-time location...

Beta Technologies’ bet on electric flight and Hyundai’s new Tesla charging port comes up short

Welcome back to TechCrunch Mobility — your central hub for news and insights on the future of...

Amazon shuts down Chime, its Zoom alternative

Amazon Chime, the tech giant’s underwhelming alternative to Zoom and Google Meet, is shutting down for good....

Cherryrock Capital raises new $172M fund from all-star investors to back diverse founders

Cherryrock Capital, founded by ex-TaskRabbit CEO Stacy Brown-Philpot, announced Wednesday the closing of its $172 million Fund...

Instagram’s new ad format lets creators get paid for testimonials in comments

Instagram is introducing a new way for creators to work with brands to make money by recommending...

Twitch caps streamers’ storage at 100 hours of highlights and uploads

Twitch on Wednesday announced it will begin limiting streamers to 100 hours of highlights and uploads, and...