Skip Navigation

Posts
0
Comments
7
Joined
2 days ago

  • This seems like a dumb benchmark.

    ClockBench evaluates whether models can read analog clocks - a task that is trivial for humans, but current frontier models struggle with.

    What do you mean trivial? Most humans I know can't read the most basic white-background-big-black-numbers clocks.

    Someone rigged the jury to get 90% on this:

  • There's a difference between

    "A pedophile committed a crime in my house (but I had nothing to do with it)."

    and

    "Gee, the pedophiles seem to think my house is a great place to do crime, because they keep doing it, but that's none of my business."

  • I imagine it's because Second Life was never popular with children.

    As bad as mostly-adult spaces can be, the worst kinds of humans seem to skitter around children's spaces.

  • Once upon a time, I set up my phone so I didn't need to look at it: it was basically e-ink and audiobooks.

    Then I started adding games and learning apps back (I don't remember why), and now I feel like I'm not going back until e-ink reaches parity with smartphones (refresh rate, cell coverage, near-current OS).

  • It was originally an ad (I think) for a claymore plushie.

  • To help meet their quota, ICE has started wearing these glasses:

    Edit: misspelled quota