AI of the Week · Analysis

Computing power is becoming the new currency in the AI race

We are in a phase where technical nuances are beginning to blur in favor of raw power. As the giants move toward massive infrastructure and inference-time computation, the rules of the game for how we build future systems are changing.

By Daniel Merthen · Halland · Published 2026-06-12 · 2 min read

AI Infrastructure Tech Analysis

Where architecture meets raw power

There is a certain relief in seeing the big players start to speak the same language, even if that language is about massive amounts of hardware. When Google steps in and guarantees payments for Anthropic's massive chip deal, it isn't just a nice technical symbiosis we are witnessing. It is an acknowledgment that the current race requires an infrastructure so extensive that no single actor can bear the weight alone. We see a consolidation where the boundaries between competitors blur in the pursuit of the raw power required to drive next-generation models.

At the same time, we see launches like Claude Fable and Gemini 3.5 Live Translate, which indicate a maturity in use cases. But behind these impressive features lies a more fundamentally changed logic for how models are optimized. We are moving away from simply hoping that better architecture will solve the problems, toward systematically injecting more computation into the final stage of the process.

Performance is now a function of inference-time computation.
Daniel Merthen

Inference-time computation as the new truth

This is the most critical insight for those of us who actually build systems today. When we see that GPT-5.5 shows only marginal improvements compared to its predecessor at max-compute calculations, we understand that we have reached a point where intelligent design meets its physical limit. To achieve the next step in intelligence, more time and more power are now required during inference or fine-tuning. It is no longer just a matter of smarter code; it's about daring to spend computing power where it provides the greatest benefit for the end user.

For those of us building solutions, this represents a paradigm shift. We can no longer just optimize for efficiency in an isolated bubble. We must start thinking about how we allocate computing resources strategically across the entire chain. If performance is a function of inference time, then our ability to manage and prioritize these resources becomes the deciding factor in the future.

We have left the early experimental phase behind us. Now it's about mastering the economics of computation to actually scale real intelligence.

Mon · Wed · Fri

Polaris in your inbox —
Monday, Wednesday, Friday — always free.

The most important things in AI, filtered for Swedish entrepreneurs. No noise. Just what actually matters.

Free forever · cancel anytime

Where architecture meets raw power

Inference-time computation as the new truth

Polaris in your inbox — Monday, Wednesday, Friday — always free.

Polaris in your inbox —
Monday, Wednesday, Friday — always free.