The unreasonable effectiveness of stochastic parrots

The year is 2040 and the ARC Prize make the following exciting announcement: "We are pleased to announce ARC-AGI 16, the next evolution of the ARC Prize. Current frontier models score 1.2% on this benchmark where humans easily score 100%"....

Triviality tests and the sycophancy problem

Maybe you're familiar with the concept of a triviality test in political debate. It's really simple - it involves asking yourself the question: Would anyone disagree with this? Think about a mayor claiming that "I want to make this city great" -...

Software 2.5, Software 3.0 and Software 3.5

A couple of weeks ago Andrej Karpathy generated a ton of buzz with a talk where he presented the idea of Software 3.0 - describing LLMs as the next revolution of computers and programming itself - the transition from programming language-driven...

Thinking? What thinking?

Do machines think? The Large Language Model (LLM) revolution in AI has shifted the debate on whether machines can even think from what it used to be. 10 years ago the difference in capability between human beings and AI were extremely visible -...