SuperAnnotate helps companies manage their AI data sets

Date:

Share post:


High-quality data may be the key to high-quality AI. With studies finding that data set curation, rather than size, is what really affects an AI model’s performance, it’s unsurprising that there’s a growing emphasis on data set management practices. According to some surveys, AI researchers today spend much of their time on data prep and organization tasks.

Brothers Vahan Petrosyan and Tigran Petrosyan felt the pain of having to manage lots of data while training algorithms in college. Vahan went so far as to create a data management tool during his Ph.D. research on image segmentation.

A few years later, Vahan realized that developers — and even corporations — would happily pay for similar tooling. So the brothers founded a company, SuperAnnotate, to build it.

“During the explosion of innovation in 2023 surrounding models and multimodal AI, the need for high-quality datasets became more stringent, with each organization having multiple use cases requiring specialized data,” Vahan said in a statement. “We saw an opportunity to build an easy-to-use, low-code platform, like a Swiss Army Knife for modern AI training data.”

SuperAnnotate, whose clients include Databricks and Canva, helps users create and keep track of large AI training data sets. The startup initially focused on labeling software, but now provides tools for fine-tuning, iterating and evaluating data sets.

Image Credits:SuperAnnotate

With SuperAnnotate’s platform, users can connect data from local sources and the cloud to create data projects on which they can collaborate with teammates. From a dashboard, users can compare the performance of models by the data that was used to train them, and then deploy those models to various environments once they’re ready.

SuperAnnotate also provides companies access to a marketplace of crowd-sourced workers for data annotation tasks. Annotations are usually pieces of text labeling the meaning or parts of data that models train on, and serve as guideposts for models, “teaching” them to distinguish things, places and ideas.

To be frank, there are several Reddit threads about SuperAnnotate’s treatment of the data annotators it uses, and they aren’t flattering. Annotators complain about communication issues, unclear expectations, and low pay.

For its part, SuperAnnotate claims it pays fair market rates and that its demands on annotators aren’t outside the norm for the industry. We’ve asked the company to provide more detailed information about its practices and will update this piece if we hear back.

There are several competitors in the AI data management space, including startups like Scale AI, Weka and Dataloop. San Francisco-based SuperAnnotate has managed to hold its own, however, recently raising $36 million in a Series B round led by Socium Ventures, with participation from Nvidia, Databricks Ventures, Play Time Ventures and Defy.vc.

The fresh capital, which brings SuperAnnotate’s total raised to just over $53 million, will be used for augmenting its current team of around 100, for product R&D, and for growing SuperAnnotate’s customer base of roughly 100 companies.

“We aim to build a platform capable of fully adapting to enterprises’ evolving needs and offering extensive customization in data fine-tuning,” Vahan said.



Source link

Lisa Holden
Lisa Holden
Lisa Holden is a news writer for LinkDaddy News. She writes health, sport, tech, and more. Some of her favorite topics include the latest trends in fitness and wellness, the best ways to use technology to improve your life, and the latest developments in medical research.

Recent posts

Related articles

EV startup Canoo files for bankruptcy and ceases operations

Seven-year-old electric vehicle startup Canoo has filed for bankruptcy and will “cease operations immediately.” The company is...

Amazon suspends US drone deliveries following crash at testing facility

Amazon has paused testing of its delivery drones following a crash involving two of its models, according...

ChatGPT’s head of product will testify in the US government’s case against Google

The U.S. government wants to prove that Google’s competitors face overwhelming barriers to entry as part of...

Netradyne snags $90M at $1.35B valuation to expand smart dashcams for commercial fleets

Distracted driving is one of the leading causes of car accidents and a major reason why auto...

Perplexity acquires Read.cv, a social media platform for professionals

Read.cv, a social media platform for professionals that competed with LinkedIn, has been acquired by AI-powered search...

TikTok ban: How to download your videos and data

The Supreme Court has upheld a ban on TikTok. Before the app goes dark on Sunday, you’re...

AI startup Character AI tests games on the web

Character AI, a startup that lets users chat with different AI-powered characters, is now testing games on...

Bluesky saw 17x increase in moderation reports in 2024 after rapid growth

Bluesky on Friday published its moderation report for the past year, noting the sizable growth the social...