In case you didn’t know, you can’t train an AI on content generated by another AI because it causes distortion that reduces the quality of the output. It is also very difficult to filter out AI text from human text in a database. This phenomenon is known as AI collapse.
So if you were to start using AI to generate comments and posts on Reddit, their database would be less useful for training AI and therefore the company wouldn’t be able to sell it for that purpose.
They’re paying Reddit to not sue them.
Regardless, the content that’s available through PS is the content that people are talking about overwriting or deleting. They can’t edit or delete stuff that PushShift couldn’t see in the first place.
Given how many defences Google would have against that ant called Reddit suing it, ranging from actual fair points to “ackshyually”, I find it unlikely.
Emphasis mine. Can you back up this claim?
I’m asking this because the content from PS is up to March/2023, it’s literally a year old. There was a lot of activity in Reddit in the meantime, and it’s from my impression that people talking about this are the ones who already erased their content in the APIcalypse, but kept using Reddit because there’s some subject “stuck” there that they’d like to use.