HardwareBusinessModels

OpenAI and Broadcom unveil Jalapeño—custom chip for LLM inference

OpenAI and Broadcom have developed a new ASIC called Jalapeño, designed specifically to optimize large language model inference in data centers.

In detail

  • The chip was built from scratch for LLM inference based on insights from OpenAI's roadmap; development took nine months.
  • OpenAI claims substantially better performance per watt than current state-of-the-art systems; detailed technical report coming in the coming months.
  • Both companies plan to deploy Jalapeño chips in data centers by end of 2026—part of OpenAI's strategy to control the full stack and reduce Nvidia dependence.

Why it matters

This signals that specialized chips for AI inference are becoming standard—for enterprises with large inference workloads, this could lower costs and latency, but also create new dependencies.

For you Monitor Jalapeño's availability and performance; if your inference costs are a pain point, specialized chips may soon offer an alternative to GPU clusters.

← All news

Summaries are generated automatically and link to the original source.