this post was submitted on 22 Jul 2023
25 points (100.0% liked)
Asklemmy
1454 readers
64 users here now
A loosely moderated place to ask open-ended questions
If your post meets the following criteria, it's welcome here!
- Open-ended question
- Not offensive: at this point, we do not have the bandwidth to moderate overtly political discussions. Assume best intent and be excellent to each other.
- Not regarding using or support for Lemmy: context, see the list of support communities and tools for finding communities below
- Not ad nauseam inducing: please make sure it is a question that would be new to most members
- An actual topic of discussion
Looking for support?
Looking for a community?
- Lemmyverse: community search
- sub.rehab: maps old subreddits to fediverse options, marks official as such
- !lemmy411@lemmy.ca: a community for finding communities
~Icon~ ~by~ ~@Double_A@discuss.tchncs.de~
founded 5 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
It wasn’t archive. It was a site that was specifically storing reddit threads in a queryable manner. Unfortunately, I didn’t explore the particular site enough. Ideally, I’d like to be able to pull an arbitrary month’s worth of /r/politics, and ideally only select posts with more than 30 replies or some upvote/downvote threshold.
As an aside, was it confirmed that they were restoring content? When I was doing it, I was using a python script that just grabbed all posts and comments for a given user name, overwrote the data with some ipsum text, and then deleted it. Sometimes, even when I’d finally get the post count down to zero, I’d find I still had some posts the following day.
It turned out that the api wasn’t showing posts from subreddits that had gone dark. So if a sub you posted in was dark when you were running your script, it wouldn’t be able to see/delete those posts until the sub opened back up. It made it look like there was something sketchy going on, but at least in my case it was just two types of protests clashing.
I haven't read an official study about it, but I've seen random posts and comments of mine poping back here and there with no apparent link in time or subreddit.
Let me know if you find what you're looking for, I'm interested :)
There were a few efforts in datahoarders to archive stuff before July 1st. I believe the correct search term to find them would be "redarc".
Here's one project: https://github.com/Yakabuff/redarc