Mate in None: Lessons from Large Language…

Aug 9

A tournament showdown and a tense battle against the brand new ChatGPT 5

7 Comments

It's interesting that while standard chess engines have essentially perfect "sight" of the board in both its present and future state, the AI models play like a human playing blindfolded for the first time. A lot of their moves make some kind of sense, but the mistakes are really glaring.

Expand full comment

Reply (1)

Jennifer Shahade

Aug 10

The fact that it can find ...Rxe3!! is amazing. Though it also played ...Qxh2?? and ...Rxe2?? so maybe there's some pattern recognition around capturing things that is causing both brilliancies and blunders.

Expand full comment

Reply (1)

Andy Lee

Aug 10

Broken clocks and all that, I suspect.

Expand full comment

Jim Henderson

Aug 10

"Yes, Qxh2 --- I get to play mate in one with an extra tempo!"

Expand full comment

Reply (1)

Jennifer Shahade

Aug 10

now that's a way to look at it!

Expand full comment

Metaphysical Man

Aug 9

I can't believe there was actual money and resources spent on a tournament held between entities that can't even be counted on to play legal moves.

Expand full comment

Reply (1)

Jennifer Shahade

Aug 10Edited

I think that it's similar to my rationale. Assessing their chess can help reveal weaknesses that may apply in other areas. Those weaknesses can be more obvious in a perfect information game. This is important since so many people are relying them more and more and even sometimes for tasks they're ill suited for.

Expand full comment

Games and The Grid

Mate in None: Lessons from Large Language…