OpenAI unveils Jalapeño, its first inference ASIC built with Broadcom

In detail

Chip name: Jalapeño; purpose-built ASIC for inference workloads
Developed in partnership with Broadcom; testing underway
Early results claim substantially better performance‑per‑watt than current state‑of‑the‑art
Targeted at real‑time models (ChatGPT/Codex); heavy pre‑training likely to remain on Nvidia GPUs

Why it matters

A purpose‑built inference chip can reduce OpenAI’s reliance on Nvidia and lower per‑request operating costs, affecting pricing and scalability for real‑time AI services.

For you Assess whether your cloud/AI vendor mix will change when OpenAI deploys Jalapeño; update TCO models and vendor contracts for potential shifts in inference pricing.

Sources

TechCrunch
The Verge