SnoringEarthworm

joined 1 day ago
[–] SnoringEarthworm@sh.itjust.works 4 points 2 hours ago* (last edited 2 hours ago) (1 children)

This seems like a dumb benchmark.

ClockBench evaluates whether models can read analog clocks - a task that is trivial for humans, but current frontier models struggle with.

What do you mean trivial? Most humans I know can't read the most basic white-background-big-black-numbers clocks.

Someone rigged the jury to get 90% on this:

There's a difference between

"A pedophile committed a crime in my house (but I had nothing to do with it)."

and

"Gee, the pedophiles seem to think my house is a great place to do crime, because they keep doing it, but that's none of my business."

I imagine it's because Second Life was never popular with children.

As bad as mostly-adult spaces can be, the worst kinds of humans seem to skitter around children's spaces.

Once upon a time, I set up my phone so I didn't need to look at it: it was basically e-ink and audiobooks.

Then I started adding games and learning apps back (I don't remember why), and now I feel like I'm not going back until e-ink reaches parity with smartphones (refresh rate, cell coverage, near-current OS).

[–] SnoringEarthworm@sh.itjust.works 37 points 23 hours ago* (last edited 3 hours ago)

To help meet their quota, ICE has started wearing these glasses:

Edit: misspelled quota