Bluesky’s open API means anyone can scrape your data for AI training

Date:

Share post:


Bluesky might not be training AI systems on user content as other social networks are doing, but there’s little stopping third-parties from doing so.

Per a report by 404 Media, a machine learning librarian at AI firm Hugging Face pulled 1 million public posts from Bluesky via its Firehose API for machine learning research, pushing the dataset to a public repository. Daniel van Strien later removed the data due to the controversy that ensued, however it serves as a timely reminder that everything you post publicly to Bluesky is, well, public.

Bluesky said that it’s looking at ways to enable users to communicate their consent preferences externally, though it’s up to those parties whether they respect those preferences.

The company posted: “Bluesky won’t be able to enforce this consent outside of our systems. It will be up to outside developers to respect these settings. We’re having ongoing conversations with engineers & lawyers and we hope to have more updates to share on this shortly!”

What’s clear here is that while Bluesky is surging in popularity, its rapid rise to the forefront of the global consciousness will mean it’s subject to the same levels of scrutiny as other major social platforms.



Source link

Lisa Holden
Lisa Holden
Lisa Holden is a news writer for LinkDaddy News. She writes health, sport, tech, and more. Some of her favorite topics include the latest trends in fitness and wellness, the best ways to use technology to improve your life, and the latest developments in medical research.

Recent posts

Related articles

TechCrunch Sessions: AI speaker applications close March 7

On June 5, TechCrunch Sessions: AI will kick off — and you can be a part of the...

Podcasting platform Podcastle launches a text-to-speech model with more than 450 AI voices

Podcast recording and editing platform Podcastle is now joining other companies in the AI-powered, text-to-speech race by...

Google upgrades Colab with an AI agent tool

Google Colab, Google’s cloud-based notebook tool for coding, data science, and AI, is gaining a new “AI...

Anthropic raises $3.5B to fuel its AI ambitions

AI startup Anthropic today announced that it raised $3.5 billion at a $61.5 billion post-money valuation, led...

US said to halt offensive cyber operations against Russia 

The United States has suspended its offensive cyber operations against Russia, according to reports, amid efforts by...

Chinese buyers are getting Nvidia Blackwell chips despite U.S. export controls

Upholding export controls on semiconductor chips made in the U.S. made chips may be harder than Washington...

As Skype shuts down, its legacy is end-to-end encryption for the masses

In the early evening of March 5, 2012, in Cairo, Egyptian revolutionaries stormed the headquarters of the...

Opera announces a new agentic feature for its browser

Norway-based browser company Opera announced a new agent feature called Browser Operator as a feature preview. The...