@fne8w2ah@lemmy.world
link
fedilink
English
38M

That’s why spez the hurensohn “refreshed” the T&Cs very recently.

@xantoxis@lemmy.world
link
fedilink
English
158M

Damn. I keep meaning to use one of those things that deletes all your reddit data. I doubt it’ll actually do anything (reddit has no ethical framework so they won’t think twice about indexing “deleted” data) but I still need to do that.

@Alpha71@lemmy.world
link
fedilink
English
118M

Yeah, I deleted a banned account only to still find the posts I made still up. So I went in and manually deleted EVEY. SINGLE. ONE.

Guess what. They still show up.

@ipkpjersi@lemmy.ml
link
fedilink
English
158M

I’d bet a year of my salary that it only deletes it from public view so people can no longer get helped from Reddit’s Google search results, but a copy (or more than one copy) is still retained on their internal servers.

Maybe I’m miss remembering but weren’t they restoring stuff users deleted during the API protest?

@ipkpjersi@lemmy.ml
link
fedilink
English
18M

They absolutely were, yeah.

So nothing realy new after alls half reddit is repost bot .

@Kbobabob@lemmy.world
link
fedilink
English
-1
edit-2
8M

Lol, what do you think Lemmy is? There’s a lot of posts on here directly scraped from Reddit by bots.

I am willing to bet the most active subreddits that are not too bot infested are the NSFW ones. Reddit AI is going to be creepy and horny.

@db2@lemmy.world
link
fedilink
English
88M

Greedy little pigboy Steve couldn’t resist. Every day they seem to do something that reaffirms leaving was the best plan.

Fake4000
creator
link
fedilink
English
1098M

Shit move from Reddit. Glad I jumped ship to lemmy.

Honestly, lemmy has less users compared to Reddit, yet you still get more engagement.

@AtariDump@lemmy.world
link
fedilink
English
128M

@Quadhammer@lemmy.world
link
fedilink
English
78M

If gollum and Steve Buscemi had a secret baby

Boozilla
link
fedilink
English
898M

I don’t miss the dipshits, pun spammers, and smug power mods of reddit at all. I do miss their niche subs and smarter users. Like it or not, they do have some brainy folks peppered among the shit posters.

We have some good folks here, too. Just need more of them.

It’s a shame reddit has been dialing up the shit faucet slowly enough that most of their users don’t notice how awful it is now. They’ve grown accustomed to the poor quality of the content and weaponized greed of the owners.

Ready! Player 31
link
fedilink
English
28M

Going back to /r/all on reddit now just pure trash. It’s unbelievable how badly it’s declined, very recently.

Boozilla
link
fedilink
English
28M

I wonder how much of it is just bots and karma farmers pretending to talk to each other. It’s really awful.

Fake4000
creator
link
fedilink
English
398M

In all honesty, when I joined Reddit right after digg went to shit. It was amazing. Reddit was great, 3rd party apps were welcome, their interface was straightforward, and they had none of those NFT gold shit.

It just went downhill.

I left Reddit. Had over 600k Karma after a few years answering all kinds of questions from Veteran help to complex engineering.

Fuck Reddit. Will never go back. It’s a shell of what it was only a few years ago.

Boozilla
link
fedilink
English
28M

Glad you’re here with us!

deweydecibel
link
fedilink
English
12
edit-2
8M

smug power mods of reddit at all.

Oh they’re here too. They’re not causing too much drama because there’s not enough going on, but they’re here. Some of them are admins of certain instances.

The ones that aren’t here yet will eventually find their way here when Lemmy continues to grow. And the most concerning thing about that is how many more tools Lemmy is providing them to fuck with users.

At least on Reddit, mods couldn’t see votes. Lemmy actually just made it easier for them.

The next move is to use AI to generate posts and comments

Fake4000
creator
link
fedilink
English
48M

I honestly think that has been happening with all these publications websites.

ME5SENGER_24
link
fedilink
English
88M

FUCK REDDIT! FUCK U/SPEZ! The Red-exit shall endure, VIVA LA LEMMY!!

@pixxelkick@lemmy.world
link
fedilink
English
39
edit-2
8M
  1. Called this awhile back, this is why Reddit has such a high evaluation.

  2. Poisoning your data won’t do anything but give them more data, do you seriously think reddit servers don’t track every edit you make to posts? You’d literally just be providing training data of original human vs poisoned. They’d still have your original post, and they have a copy of everytime you edit it.

  3. Whoever buys reddit will have sole access to one of the larger (I don’t think largest though) pools of text training Data on the internet, with full licensed usage of it. I expect someone like Google, FB, MS, OpenAI, etc would pay big $$$ for that.

“But can’t people already scrape it?”

  1. Well yes, but it’s at best legally dubious in some places

  2. Scraping Data off reddit only gets you current versions of posts (which means you can get poisoned dara, and cant see deleted content), and is extremely slow… if you own the server you have first class access to all posts in a database, including g the originals and diffs of everytime soneone edited a post, and all the deleted posts too.

Think about if you perhaps wanted to train an AI to detect posts that require flagging for moderation, if you scrape reddit data, you can’t find deleted posts that got moderated…

But, if you have the raw original data, you 100% would have a list of every post that got deleted by mods and even the mod message on why it was deleted

You surely can see the value of such data, that only owners of reddit are currently privy to atm…

request your reddit data and they deliver you every comment you ever made

They’ve also got vote counts and breakdowns of who is making those votes. This data will be worth more for AI training than any similar volume of data other than maybe the contents of Wikipedia. Assuming they didn’t have it set up to delete the vote breakdowns when they archived threads.

Why are those breakdowns worth so much? Because they can be used to build profiles on each voter (including those who only had lurker accounts to vote with), so they can build AIs that know how to speak with the MAGA cult, Republicans who aren’t MAGA, liberals, moderates, centrists, socialists, communists, anarchists. Not only that, they’ll be able to look at how sentiments about various things changed over time with each of these groups, watch people move from one to another as their opinions evolved, see how someone pretends to be a member of whatever group (assuming they voted honestly and posted under their fake persona).

Oh and also, all of that data is available through the fediverse but it’s free to train on to anyone who sets up a server. Which makes me question whether the fediverse is a good thing because even changing federation to opt-in instead of opt-out just covers whether your server accepts data from another. It’s always shared.

Open and private are on opposite sides of a spectrum. You can’t have both, best you can do is settle for something in the middle.

@pixxelkick@lemmy.world
link
fedilink
English
48M

Which makes me question whether the fediverse is a good thing

I’d argue it’s good, because it means open source AI has a fighting chance with FOSS data to train on without needing to fork over a morbillion dollars to Reddits owners.

Whatever use cases the reddit data can train on, FOSS researchers can repeat it on Lemmy data and release free models that average joes can use on their own without having to subscribe to shit like Microsoft Copilot and friends to stay relevant.

@Breezy@lemmy.world
link
fedilink
English
28M

What if reddit also kept all deleted comments and post, im sure there are shit loads of things people type out just to delete, thinking all the while it’ll never see the light of day.

I’d be surprised if they don’t keep all of that. There were a number of sites for looking at deleted posts. They’d just go and grab everything and compare what was still there with what wasn’t and highlight the stuff that wasn’t there anymore.

Which is also possible here, though the mod log reduces the need for it. But if someone is looking for posts people change their mind about wanting anyone to see, deleting it highlights it instead of hides it for anyone who is watching for that.

@Breezy@lemmy.world
link
fedilink
English
3
edit-2
8M

I think that site was unddit, but yes those were posted then later deleted. Im talking about just typing out a post or comment and never posting just simply backing out of the page or hitting cancel. Im not just if any of that is stored on the site or just locally.

Oh, yeah, I’ve wondered the same myself. Hell, that might have been a motivation for removing the API access.

@pixxelkick@lemmy.world
link
fedilink
English
38M

They definitely do, it’s common for such systems to never actually delete anything because storage is cheap. It likely just is flagged deleted=true and the searches just return WHERE [post].Deleted = False on queries on the backend.

So it looks deleted to the consumer, but it’s all saved and squirreled away on the backend.

It’s good to keep all this shit for both legal reasons (if someone posts illegal stuff then deletes it, you still can give it to the feds), as well as auditing (mods can’t just delete stuff to cover it up, the original still exists and admins can see it)

Sounds like something a bunch of governments would be interested in. As you pointed out you get to see why human mods made certain decisions. Could you an edge in manipulation.

@vynlwombat@lemmy.world
link
fedilink
English
-2
edit-2
8M

You’re not wrong. But on point #1, you’re just an asshole

@Falcon@lemmy.world
link
fedilink
English
18M

With respect to 2, it would stop others scrapping the content to train more open models on. This would essentially give Reddit exclusive access to the training data.

Slightly unrelated question, but is there an easy way to delete all my Reddit posts and comments? I used the Nuke add-on in the past, but it doesn’t work anymore.

I wanna delete my Reddit account, but I’d prefer to erase my history before doing that.

@gnate@lemmy.world
link
fedilink
English
58M

This userscript worked for me (in the last 24hrs): https://greasyfork.org/en/scripts/23605-reddit-history-sanitizer

I used Redact. It seemed to work.

With their API changes I’m not sure.

This is what I used and was recommended during the great purge.

https://github.com/j0be/PowerDeleteSuite

@axo@lemmy.world
link
fedilink
English
338M

I barely post on reddit, just lurk but this made me finally sign up for an account here.

Fake4000
creator
link
fedilink
English
258M

Welcome to lemmy.

If they build an AI based on reddit content it will be the devil incarnate.

@valkyre09@lemmy.world
link
fedilink
English
4
edit-2
8M

Can’t wait to hear the fan fiction the AI bot generates

@Pinecone@lemmy.world
link
fedilink
English
138M

If you thought gpt4 was confidently incorrect wait until you see this next ai.

XIIIesq
link
fedilink
English
108M

If you’re not paying for the product, you are the product.

@WhatAmLemmy@lemmy.world
link
fedilink
English
6
edit-2
8M

And even when you pay for the product, you are the product, because capitalism requires infinite growth from a finite system.

@bbkpr@lemmy.world
link
fedilink
English
108M

Good, so let’s train crappy AI on posts by crappier AI, which was trained by posts from even crappier AI before it.

Create a post

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


  • 1 user online
  • 182 users / day
  • 580 users / week
  • 1.37K users / month
  • 4.49K users / 6 months
  • 1 subscriber
  • 7.41K Posts
  • 84.7K Comments
  • Modlog