GPU.ai now offers custom GPU buildouts →

Vol. 01 · Dispatches

Updated April 20, 2026

8 entries

Notes from the
compute layer.

Field reports, market commentary, and the occasional announcement from the team building the aggregation layer for AI compute. We write when we have something honest to say.

Featured

2 pinned

Pinned · From the Founders

Apr 20, 2026

A letter from the founders: compute should be a public utility for builders

Why we started GPU.ai, what we believe about the compute layer, and the world we're trying to build toward — by the founders.

Ranbir Badwal & Aditya ReddyCo-founders, GPU.ai

Read

Pinned · Partnerships

Dec 10, 2025

GPU.ai partners with NovaCore to bring Blackwell to the aggregation layer

A strategic partnership giving GPU.ai customers first-look access to NovaCore's Hyderabad Blackwell cluster — and giving NovaCore tenants a single API into U.S. east-coast capacity. One compute layer. Two continents.

Ranbir BadwalCo-founder & CEO

Read

EngineeringMarch 30, 2026

Training and inference need different infrastructure

Teams that optimize their GPU clusters for training often find them poorly suited for inference — and vice versa. Here's why the hardware requirements diverge, and how an aggregated supply model lets you spec each workload correctly without buying twice.

Aditya Reddy3 min read

MarketMarch 12, 2026

What to look for in a GPU cloud (besides the price tag)

Price per GPU-hour is table stakes. The seven things that actually determine whether a provider will work for your workload — and why we built our pricing engine to expose every one of them.

John Nguyen3 min read

MarketMarch 1, 2026

The GPU shortage is over. The GPU market is just getting started.

H100 lead times have collapsed from 52 weeks to under 8. The shortage narrative is outdated — what's replacing it is a real, liquid, price-discovered market for AI compute. That's a much bigger deal.

Aryamaan Singhania3 min read

ResearchFebruary 21, 2026

DeepSeek and the open-source inference shift: where the bottleneck moves next

DeepSeek V3 trained for $5.6M. R1 added another $294K. When frontier-quality models cost single-digit millions and ship under open weights, the real moat moves from the model to the inference fleet — and that fleet has to be elastic.

Aditya Reddy3 min read

EngineeringJanuary 28, 2026

NVIDIA Blackwell Ultra: what the GB300 NVL72 actually changes

1.5x the FP4 compute of B200. 288GB HBM3e per GPU. 8TB/s memory bandwidth. The headline specs are real — here's what they mean for your training run, your serving fleet, and your next quarter's GPU bill.

Aditya Reddy3 min read

EngineeringDecember 18, 2025

Why bare metal still matters for training (and where it doesn't)

Virtualization overhead costs you 10–15% of your GPU compute. At supercluster scale, that's millions in wasted spend. We surface bare metal and virtualized capacity side-by-side so you can pick the right one for each workload.

William Han2 min read

What's next

Build on the compute layer.

Per-second billing. Sixty seconds from CLI to SSH.

$100 FREE TRIAL Meet the team

Notes from thecompute layer.

A letter from the founders: compute should be a public utility for builders

GPU.ai partners with NovaCore to bring Blackwell to the aggregation layer

Training and inference need different infrastructure

What to look for in a GPU cloud (besides the price tag)

The GPU shortage is over. The GPU market is just getting started.

DeepSeek and the open-source inference shift: where the bottleneck moves next

NVIDIA Blackwell Ultra: what the GB300 NVL72 actually changes

Why bare metal still matters for training (and where it doesn't)

Build on the compute layer.

Notes from the
compute layer.