Since half or more of reddit is now bots and shills, I don’t imagine the training data is going to be great. That’s fine, Gemini already sucks, so it’ll be hard to make it worse.
There are many, many, many things posted as fact over the years on reddit that are not only untrue, but dangerous or even deadly in the case of some of the most idiotic advice given. I wish good luck telling them all apart to the poor 3rd world contractors the big commercial AI companies exploituse to “train” their stochastic parrots.
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !technology@lemmy.world
This is a most excellent place for technology news and articles.
Since half or more of reddit is now bots and shills, I don’t imagine the training data is going to be great. That’s fine, Gemini already sucks, so it’ll be hard to make it worse.
The data being generated now sure, but there’s still the years of actually useful data there.
Then add on the remaining half of comments that are from sensible users and it’s a decent, and still fairly unique, dataset.
There are many, many, many things posted as fact over the years on reddit that are not only untrue, but dangerous or even deadly in the case of some of the most idiotic advice given. I wish good luck telling them all apart to the poor 3rd world contractors the big commercial AI companies
exploituse to “train” their stochastic parrots.