this post was submitted on 09 Jun 2023
274 points (100.0% liked)
Mildly Infuriating
427 readers
1 users here now
Home to all things "Mildly Infuriating" Not infuriating, not enraging. Mildly Infuriating. All posts should reflect that.
I want my day mildly ruined, not completely ruined. Please remember to refrain from reposting old content. If you post a post from reddit it is good practice to include a link and credit the OP. I'm not about stealing content!
It's just good to get something in this website for casual viewing whilst refreshing original content is added overtime.
...
Also check out:
Partnered Communities:
...
Reach out to LillianVS for inclusion on the sidebar.
All communities included on the sidebar are to be made in compliance with the instance rules.
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
True, users do maintain copyright of anything they write, but they also give reddit license to use it how it wants, including sub-licensing it to others. That means the corps absolutely DO NOT need the permission of users to train their AI. They just buy the rights to use the data from reddit.
This includes images and videos that are uploaded to the reddit servers directly.
Reddit has the right to use the data and sell that data to others. Also, some data you can scrape, but there's additional data that is available only through the API. Web scraping is not reliable, especially if reddit actively flags your spider and blocks it. They are not the idiots we want to believe they are. No mega corp is going to risk not having competitive access to data to feed their AIs when the cost for them to just pay is insignificant.
This shit was definitely not in the user agreement when I signed up in 2007.