Here’s the bottom line…
- Reddit exists to serve you ads, farm and sell your data.
- Reddit doesn’t like or support you data hoarding.
- Reddit only cares if you’re making them money.
- Reddit says one thing and does another.
- Reddit will strip and ban mods that aren’t willing to bend over.
We could go on, but you get the point… You have no say here, you lick the boots or fuck you.
So the API is about to be shafted, many apps/bots will die, other things will change, you know what’s up. But the more important thing directly related to the DataHoarding community is that Reddit has now very effectively killed Pushshift from a data hoarding perspective which was the only place you could get the most complete up-to-date Reddit data in bulk.
Reddit has now taken control of Pushshift, had them delete bulk data downloads, prevents them releasing new dumps and limits PS API access to only mods Reddit approves of.
/r/DataHoarder moving forward…
We will continue to exist and operate as we have for as long as Reddit allows us to. We will promote alternatives for those of you who wish leave finding DataHoarder communities elsewhere. We will promote every project, tool and download that seeks to keep Reddit data available to both DataHoarders and researchers. We will continue to hoard. We will not hit any fucking delete buttons.
New rule.
-
- r/techsupport exists.
We see a lot of basic vaguely dh related tech support questions here, we’re going to be more actively removing these posts. Many of these also clearly break rule 1 as they’re asked every other week.
Sidebar updates.
- Historic Reddit Archives & Download Tools, Etc.
- #datahoarders @ The-Eye Discord (tag a helper, #support exists also)
- c/datahoarder @ lemmy.ml (experimental, we we’re invited there)
- r/DataHorader 2013-2023 Searchable Archives
Happy Hoarding.
They really should point people here In the “new post” directions/rules