In detail
- Chip name: Jalapeño; purpose-built ASIC for inference workloads
- Developed in partnership with Broadcom; testing underway
- Early results claim substantially better performance‑per‑watt than current state‑of‑the‑art
- Targeted at real‑time models (ChatGPT/Codex); heavy pre‑training likely to remain on Nvidia GPUs
Why it matters
A purpose‑built inference chip can reduce OpenAI’s reliance on Nvidia and lower per‑request operating costs, affecting pricing and scalability for real‑time AI services.
For you Assess whether your cloud/AI vendor mix will change when OpenAI deploys Jalapeño; update TCO models and vendor contracts for potential shifts in inference pricing.