Skip Navigation

How can the Fediverse protect against AI slop?

The Fediverse is a great system for preventing bad actors from disrupting "real" human-human conversations, because all of the mods, developers and admins are all working out of a desire to connect people (as opposed to "trust and safety" teams more concerned about user retention).

Right now it seems that the Fediverses main protection is that it just isn't a juicy enough target for wide scale spam and bad faith agenda pushers.

But assuming the Fediverse does grow to a significant scale, what (current or future) mechanisms are/could be in place to fend off a flood of AI slop that is hard to distinguish from human? Even the most committed instance admins can only do so much.

For example, I have a feeling all "good" instances in the near future will eventually have to turn on registration applications and only federate with other instances that do the same. But it's not crazy to imagine that GPT could soon outmaneuver most registration questions which means registrations will only slow the growth of the problem but not manage it long-term.

Any thoughts on this topic?

52
52 comments
  • Reminds me of this one:

    img

    - source

    46
    • What's the incentive to operate an LLM on the fediverse that is truly helpful and not just trying to secretly sell something/push an agenda?

      12
      • Well, I am not saying that the scenario is a perfect match, just that it reminded me of that:-).

        Though to answer your question, if Reddit were all AI slop whereas we were not, then they would be foolish to not exploit (for moar profitz) the source of legitimately true info that could be useful to answer people's questions, e.g. on topics such as whether and how to use Arch Linux btw. :-P

        9
      • To train it to mimic genuine human behaviour for applications elsewhere.

        1
    • The trouble with this is that I think bots and bad faith trolls can split the difference, passing some minimum threshold of constructive and marrying it to usual trolling behaviors.

      2
      • Agreed. Though it is not just that one isolated user - the admins of Lemmy.ml are quite well-known themselves for administering their server in bad faith as well. The side-bar just says "A community of privacy and FOSS enthusiasts, run by Lemmy’s developers" (and then a link to "What is Lemmy.ml" that returns an error when I try to click it - btw for you with an account, does it go anywhere? maybe a community that is only visible to those locally with an account? for me it says "There was an error on the server. Try refreshing your browser. If that doesn't work, come back at a later time. If the problem persists, you can seek help in the Lemmy support community or Lemmy Matrix room." - but what about when you click it?). And it while people on that instance constantly criticize the USA's support for Israel's genocide in Gaza, nonetheless if you whisper a criticism towards the likes of Russia, China, or North Korea, you will be banned even from communities that you have never once visited. That is simply how they do things over there. (further reading, see also so, so very many examples in !yepowertrippinbastards@lemmy.dbzer0.com or !fediverselore@lemmy.ca or !meanwhileongrad@sh.itjust.works etc.)

        Sadly, I am not anywhere close to joking or exaggerating. Also, while they ban people for mentioning that e.g. people died in the Tiananmen Square massacre, they also protect mods who act horribly towards their fellow human being. Here's an interesting example that you can read it for yourself e.g. at https://hexbear.net/post/3706906/5518427 where after the mod told the poster (over a misunderstanding of an in-game event) that he wanted to kill them, and then even the unremoved comments from the mod doubles down with “nono I don’t want to shoot for pointing that it’s a game, I want to shoot you because…”, and then later tripled down still further, e.g. stating “I hope you die soon.”). To be clear, this post shows up on hexbear.net (for some reason, despite the original having been removed entirely), but the incident occurred on and the mod is from lemmy.ml - those instances are often intertwined, along with lemmygrad.ml.

        So you may want to consider switching instances. A further thought: I am having to reply to you from a different instance than my original comment since I have blocked all users from lemmy.ml (although PieFed's Notifications system is newly implemented and still not fully functional yet, causing me to have to hunt down why I received a Notification for a comment that I could not see:-). You will often face similar prejudice when speaking from that account on that instance - e.g. the apps Sync and Connect can also do such user-blocking of instances, and several instances such as lemmy.cafe and quokk.au and dubvee.org have outright defederated from lemmy.ml entirely. Thus you may sometimes feel like you are speaking into a void and wondering why nobody will respond to you - I am explaining that this may well be a reason why.

        I hope that you don't feel that I am picking on you personally, just trying to share that thought that could help you understand the contentious situation between the "tankie" vs. "liberal" instances on the Fediverse:-). If you wanted an instance that is specifically leftist, slrpnk.net seems awesome? In contrast, lemmy.ml merely pretends to be leftist, while actually advocating solely for formerly communist powers, despite them being currently capitalistic, and definitely authoritarian - e.g. you will see people praising the virtues of North Korea there, but nowhere else on the Fediverse that I have yet seen! Although for me, it's not even what those users believe, so much as their improper argumentation form about it, e.g. here's an example from the bad-faith user you mentioned, posted just prior to the USA election, which seems to be an attempt to encourage the BuT bOtH sIdEs EqUaL ThO rhetoric:

        img

        And I see this kind of thing so often from users on lemmy.ml, that I just blocked the entire instance - again, I hope you personally don't feel attacked by this, just sharing my reasoning in case that may be helpful for you.:-)

        2
  • There are two groups here, bots, and bad actors. We've found that these measures have mostly stopped them both.

    Bots

    • Registration applications. Its been extremely easy to differentiate bots from real people by asking a series of simple questions, and only let the real people in.
    • Reports: so that mods / admins can see them quickly.
    • Blocking open-signup servers that don't have required applications, that usually serve as spam-attacks against the whole fediverse.

    Some bots still get through occasionally, but not many compared to before. And some servers have more "lax" application questions, so they let more through.

    Bad actors

    • Registration applications. Most of the trolls are of a temperament where they refuse to do the work of answering questions earnestly. They can't help themselves but give obviously trolling answers, if they do even bother to do that work at all.
    • Reports: same as above.
    • Ban + remove. Mods and admins can ban and remove all a person's content at the click of a button. So even if the troll did the work of getting past the front door, then all their work is nullified by an action that takes less than 5 seconds. So they wasted much more of their time, than they did for admins, and accomplished nothing lasting.
    24
  • Hi there! Admin of Tucson.social here.

    I think that the only way the fediverse can honestly handle this is through local/regional nodes not interest based global nodes.

    Ideally this would manifest as some sort of non-profit entity that would work with municipalities to create community owned spaces that have paid moderation.

    So then comes the problem of folks not agreeing with a local nodes moderation staff - but that's also WHY it should be local. It's much easier to petition and organize against someone who exists in your town than some guy across the globe who happens to own a large fediverse node.

    This model just doesn't work (IMO) if nodes can't be accountable to a local community. If you don't like how Mastodon, or lemmy.world are moderated you have zero recourse. For Tucson.social - citizens of Tucson can appeal to me directly, and because they are my fellow citizens I take them FAR more seriously.

    Only then will people be trusting enough to allow for the key element to protecting against AI Slop. Human Indemnification Systems. Right now, if you wanted to ask the community of lemmy.world to provide proof they are human, you'd wind up with an exodus. There's just no trust for something like that and it would be hard to acquire enough trust.

    With a local node, that conversation is still difficult, but we can do things that just don't scale with global nodes. Things like validating a person by meeting them to mark them as "indemnified" on a platform, or utilizing local political parties to validate if a given person is "real" or not using voter rolls.

    But yeah, this is a bit rambly, but I'll conclude that this is a problem that exists at the intersection between trust and scale and that I believe that local nodes are the only real solution that can handle both.

    17
    • lemmy.world are moderated you have zero recourse

      !yepowertrippinbastards@lemmy.dbzer0.com

      6
      • ???

        I don't particularly have any issues with them.

        But if a user did, they don't have much recourse. I'm talking about that as a structural aspect. Not a moral one.

        But sure if you just want to claim this puts me in the !yepowertrippinbastards@lemmy.dbzer0.com community by ripping it out from any relevant context, go ahead I guess?

        4
      • "Power tripping mods" definitionally cannot exist on the fediverse where anyone can create an instance or community. Even on Reddit, 99% of the time someone said a mod was "power tripping" it was just a right winger upset that the mod removed their disruptive nonsense.

        The purpose of communities like the one you linked to is to shame mods into employing a passive, generic bare-minimum style of moderation, when we should be encouraging the opposite if we want diversity in the fediverse.

        1
    • Thanks for the thoughtful response. I too think that regional instances would be ideal for a "backbone" of the social web. But at the same time, I feel that interest-based connection is a truly unique strength of the internet and it would be a sad thing to lose to the slop.

      Ultimately, I think that more, smaller instances is likely the best "ultimate" defense against slop since there is no incentive for them to scale beyond their needs. But every instance admin is technically responsible for the content on all federated instances. Which can get overwhelming!

      5
      • I mean, regional instances don't have to stop folks from engaging primarily with interest based communities.

        Some regions will dominate certain interests for example - here in Tucson we're consider one of the Amateur Astronomy capitals of the world. If mander.xyz were to disappear tomorrow, Tucson would make a good home for all of the fediverse's astronomy needs even though its a region based instance.

        Further, there's nothing that states an interest-based instance needs any registration. One could imagine a world where local instances have all the users and identities, and the interest based instances simply provide communities to the larger fediverse with no users of their own.

        But yeah, it's definitely a paradigm shift that makes interest based communities a bit more difficult to find.

        4
  • Instead of trying to detect and block it, just disincentivize it.

    Most AI spam on social media tries to exploit various systems intended to predict “good” content on the basis of a user’s past activity, by tracking reputation/karma/etc. Bots build up karma by posting a massive amount of innocuous (but usually insipid) content, then leverage that karma to increase the visibility of malicious content. Both halves of this process result in worse content than if the karma system didn’t exist in the first place.

    8
  • Maybe it was silently assumed but nobody so far mentioned the endless stream of scrapers that go through my probably juicy but private instance. I‘m banning a new bot every week and by now they have switched to distributed actions. I get over 400 requests per hour by a couple ips for the same stuff with changing useragents because I wrote automated detection mechanisms. I might just make my instance login only.

    8
  • I don't think there is any way to have a genuine "open forum" amongst complete strangers. There have always been human troll farms pushing narratives using sock puppet accounts, AI is just enabling it to reach new scales.

    I actually am for echo chambers when it comes to social media, but one in which you only follow people you know or trust and ignore complete strangers and to make sure you get news and critical information from OUTSIDE social media, again with institutions you trust.

    6
    • Yes, strong moderation by members of the community is sufficient to recognize and remove bad (human) actors. The question is one of volume and overwhelming those human mods. GPT can create hundreds of bad-faith accounts.

      1
  • The fediverse architecture was built from the beginning to allow instance-by-instance exercise of discretion to mute any systemic effects that could take over the network as a whole.

    This was I think oriented toward limiting swarming behavior from trolls, but I think it also applies to AI bots.

    Right now it seems that the Fediverses main protection is that it just isn’t a juicy enough target for wide scale spam and bad faith agenda pushers.

    If you ask me they are already here right now, but I think it's not the architecture of the fediverse, but the judgment of individual mods that have let us down in this case.

    5
  • I think that being human scale is largely the appeal of the Fediverse. Each instance isn't meant to grow to the size of a centralized platform, but to be a relatively small community of people with some shared interests. I look at it similarly to the way IRC channels worked back in the day. You tend to have a group of people whom you interact with frequently and that's how you know they're human. If some bot enters the community then it becomes obvious very quickly.

    5
  • I have had similar thoughts, I think the answer ultimately lies in active mods that can really get to know a community and it's users and identify when users are pushing a narrative even if they can't confirm if they are a bot or not.

    Also as @dessalines@lemmy.ml pointed out, user registrations. On startrek.website we have a question that is easy for a star trek fan to answer but not easy for a bot (although getting back to your concern, chatGPT probably would have no problem)

    4
  • "The fediverse" really can't. That's just the reality of a decentralized system. It's going to be up to individual instances to sort it out.

    But that's a good thing, because what it means is that different instances can and will try different approaches, and between them, they'll sooner or later hit on the one(s) that will be most effective.

    3
  • What can be done? Smarter people can probably list plenty of things. But in the end, it's a constant race trying to out compete. And with LLMs/AI, you can literally train it on the system you want it to overcome with that express purpose and let it work out the "how" and you're back to square one again.

    I think it can best be put in song

    Or put another way: how do you make a bear proof trashcan that can defeat a bear but not the dumbest of humans?

    1
  • As you said, a 44k monthly active users plateform is probably not worth investing time from spammers and agenda pushers.

    If at some point we'll make it, we'll see. Seems like we are still quite far.

    -1
You've viewed 52 comments.