Mapping True Coding Efficiency (Coding Index vs. Compute Proxy)

Posted by NewtMurky@reddit | LocalLLaMA | View on Reddit | 10 comments

TPS (Tokens Per Second) is a misleading metric for speed. A model can be "fast" but use 5x more reasoning tokens to solve a bug, making it slower to reach a final answer.

I mapped ArtificialAnalysis.ai data to find the "Efficiency Frontier"—models that deliver the highest coding intelligence for the least "Compute Proxy" (Active Params × Tokens).

The Data:

Key Takeaways: