Google builds the TPU (2015)
Google revealed it had quietly built its own AI chip, the Tensor Processing Unit, to run its machine-learning workloads. It had been using the chips in its data centers for about a year before saying so. The goal was the same as OpenAI's: serve models faster and cheaper than off-the-shelf hardware.
Google cut its cost of running AI services and reduced its dependence on outside chip suppliers.
The TPU became a core part of Google Cloud and proved a big AI company could design competitive silicon in-house.
Jalapeño follows the same playbook a decade later. It shows custom inference chips can pay off, and that the hard part is volume manufacturing, not the first design.
