Discussion about this post

User's avatar
Jason Benn's avatar

Thank you for the fascinating writeup!

Expand full comment
Vlad Goodenough's avatar

This ARC “solution” isn’t really intelligence — it’s just leaning on how good LLMs already are at spitting out Python. The fancy loop around it is basically trial-and-error: throw a bunch of functions at the puzzle, keep the ones that fit, and ask the model to tweak them. That’s clever engineering, but it doesn’t get us closer to machines that can actually reason. If someone came up with a single killer prompt tomorrow, this whole setup would be pointless. It’s more of a hack than a step toward real understanding.

Expand full comment
10 more comments...

No posts