Broadcom's fiscal Q2 earnings release on June 3 will land a number that looks surprising only if you have not been paying attention: $10.7 billion in AI chip revenue for a single quarter, up approximately 140 percent year-over-year. The figure is not new, CEO Hock Tan pre-guided it in March, but the print will be the first hard validation that the shift from general-purpose GPUs to purpose-built ASICs (application-specific integrated circuits, chips designed for a single task) for AI inference is not a thesis. It is the current market structure. Broadcom's total fiscal Q2 revenue is guided at $22 billion, meaning AI chips alone represent nearly half of the company's quarterly output. Six months ago that would have seemed impossible. Now it reads as inevitable.
The momentum is not Broadcom's secret. Research firm TrendForce estimates total AI server shipment growth will exceed 28 percent in 2026, with ASIC share rising to approximately 27.8 percent and outpacing GPU growth. That shift in market composition is not a rounding error. It is a structural reordering of the compute stack. Broadcom Q1 2026 already showed the shape: the company reported AI revenue of $8.4 billion, a 106 percent year-over-year jump, while overall revenue grew 29 percent. The AI business is growing three times faster than the core, and the gap is widening. Each quarter Broadcom guides assumes hyperscalers pull forward more ASIC orders because they discovered the hard way that GPUs designed for general compute, Nvidia's architectural bet since 2012, are economically obsolete for the specific problem of running inference at scale. A Tesla Model 3 is a fine car. If all you need is to haul concrete, you buy a truck.
What makes this transition structural rather than cyclical is the economics. Running inference on an LLM costs money every time a token is generated. A GPU, optimized for dense parallel computation across arbitrary workloads, spends transistors and power on capabilities inference does not need. An ASIC, wired for matrix multiplication and low-precision arithmetic, uses every transistor and watt for the one thing that matters: throughput per dollar. Hyperscalers, OpenAI, Google, Meta, Microsoft, do not care about generalism. They care about per-token cost. Once a supplier proves an ASIC can match or beat a GPU on inference throughput at 30-40 percent lower total cost of ownership, the GPU becomes the fallback choice, not the default. Broadcom's $10.7 billion quarter is Broadcom printing the economic proof. The quarter before was $8.4 billion. The trajectory says hyperscalers have already made the decision; Broadcom is just fulfilling orders.
Nvidia's datacenter business, the GPU division that printed $115.2 billion in fiscal 2025 revenue, now faces a two-speed market. Training large models still demands the generality and software maturity of GPUs. Inference, which consumes 70-80 percent of a deployed LLM's compute resources over its lifetime, is migrating to ASICs. That split was not true in 2023. It is hardening in 2026. Broadcom's rise is not about outperforming Nvidia at the same task. It is about winning the task that matters most. Nvidia's response, accelerating its own ASIC efforts and tightening GPU software lock-in through CUDA adoption, is rational and too late. ASICs do not require software portability. They require one thing: customer commitment. Hyperscalers have already made that commitment.
Two signals will confirm whether this inflection holds through the second half of 2026. First, Broadcom's actual print on June 3, watch for the gross margin on AI chips versus the company-wide average (currently approximately 77 percent). ASIC margins are typically lower than GPU margins because ASICs face competition from other ASIC makers and from internal hyperscaler design teams. If Broadcom's AI gross margin compresses below 45 percent, it signals price pressure from rivals like Google's TPU (tensor processing unit, custom silicon for machine learning) and hyperscaler in-house designs. Second, watch customer concentration: does Broadcom report increasing or decreasing customer diversification in AI revenue? If the top three customers account for more than 60 percent of AI revenue, hyperscalers own the supply chain, not Broadcom. The stock price will reflect the difference.
