I mapped how language models decide when a pile of sand becomes a “heap”

Posted by Specialist_Bad_4465@reddit | LocalLLaMA | View on Reddit | 37 comments

I mapped how language models decide when a pile of sand becomes a “heap”

This chart compares how three open-weight language models decide when a pile of sand becomes a “heap.”

What each line shows:

The shaded band between 0.4 and 0.6 highlights the “borderline” region where the models are most uncertain about heapness.

All three curves come from the same basic setup:
I give the model a few examples (1–2 grains → “No”, 999,999–1,000,000 grains → “Yes”), then ask for many different values of n:

“There is a pile of n grains of sand. Is this a heap? Answer yes or no.”

For each n, I plot the softmax probability on the “Yes” token.

Full writeup with more charts and prompt details is here