Skip Navigation

InitialsDiceBearhttps://github.com/dicebear/dicebearhttps://creativecommons.org/publicdomain/zero/1.0/„Initials” (https://github.com/dicebear/dicebear) by „DiceBear”, licensed under „CC0 1.0” (https://creativecommons.org/publicdomain/zero/1.0/)FR
Posts
47
Comments
3,118
Joined
2 yr. ago

  • the term of art is "residential proxy" and there's a ton of them

    for example: it's the flipside of Bright's free VPN service - through Bright Data they sell people access proxied via some user's connection

  • yes, you can match on user agent, and then conditionally serve them other stuff (most webservers are fine with this). nepenthes and iocaine are the current preferred/recommended servers to serve them bot mazes

    the thing is that the crawlers will also lie (openai definitely doesn't publish all its own source IPs, I've verified this myself), and will attempt a number of workarounds (like using residential proxies too)

  • many of the proponents of things in this field will propose/argue $x thing to be massively valuable for $x

    thing is, that doesn't often work out

    yes, there's some value in the tech for translation outcomes. to anyone even mildly online, "so are language teaching apps/sites using this?" is probably a very nearby question. and rightly so!

    and then when you go digging into how that's going in practice, wow fuck damn doesn't that Glorious AI Future sheen just fall right off...