NVIDIA - Risks Overview

Hyperscaler Custom ASIC Threat to GPU Dominance

NVIDIA's dominant position in AI infrastructure faces a structural long-term threat from hyperscalers developing custom ASICs for internal workloads:

Google (TPU), Amazon (Trainium/Inferentia), Microsoft (Maia), and Meta are all investing heavily in internally designed chips optimized for specific AI workloads, particularly inference.
Broadcom and Marvell supply ASIC design capabilities to these hyperscalers, and Broadcom has signaled strong growth expectations for its AI ASIC business.
Hyperscalers collectively represent slightly over 50% of NVIDIA's data center revenue. Even partial substitution of NVIDIA GPUs with in-house ASICs on stable, predictable inference workloads would materially affect NVIDIA's revenue.
NVIDIA's counterargument is that its full-stack platform handles every phase of AI — training, post-training, and inference — on a unified architecture, while ASICs are optimized for narrow tasks and become less useful as model architectures evolve rapidly. NVLink Fusion, which allows custom ASICs to connect to NVIDIA's platform, can be read as both a sign of confidence and a hedge.
The key risk is not that hyperscalers abandon NVIDIA wholesale, but that they gradually shift an increasing portion of inference workloads to ASICs, reducing NVIDIA's share of total AI compute spend even as that total spend grows.

Sustainability of Demand at Current Investment Levels

The pace of AI infrastructure investment raises questions about whether build-out is outrunning near-term monetization:

Analyst forecasts cited by NVIDIA suggest hyperscaler CapEx will approach $1T in 2027, nearly doubling from current levels. NVIDIA has visibility to $1T in Blackwell and Rubin revenue through the end of calendar 2027.
The bear case is that tokens are currently being subsidized — deployed at below-cost prices to drive adoption — and that the cadence of CapEx may pause or slow if model monetization and enterprise AI adoption do not keep pace with infrastructure investment.
NVIDIA's response is that compute equals revenue in the AI era, and that agentic AI inference demand has inflected. As evidence, NVIDIA notes that H100 and A100 cloud rental pricing has risen rather than fallen (H100 pricing up 20% YTD in Q1 FY27), suggesting demand genuinely exceeds supply rather than reflecting speculative overbuilding.
However, the historical precedent of cloud spending optimization cycles is real — AWS grew 29% in FY22 and slowed to 13% in FY23 as enterprises cut IT budgets. A similar, even temporary, pause in AI infrastructure CapEx would have a disproportionate impact on NVIDIA given the concentration of its revenue base.

China Market Foreclosure and Competitive Ecosystem Risk

The permanent closure of the China market is more than a revenue loss — it risks creating a parallel AI ecosystem that challenges NVIDIA globally:

China represents a market NVIDIA estimates at nearly $50B annually for data center compute. Prior to the H20 ban, H20 revenue was running at $7–8B per quarter.
The Q1 FY26 H20 export ban triggered a $4.5B inventory and purchase obligation charge, and NVIDIA estimates it lost $8B of H20 revenue in Q2 FY26.
Huawei's Ascend chips, bolstered by recent IPOs and domestic scale, are filling the void and building a Chinese developer ecosystem. NVIDIA explicitly acknowledges that the export controls "helped our competitors build larger developer and customer ecosystems to challenge us worldwide."
China has approximately 50% of the world's AI researchers. If a generation of Chinese AI developers builds on Huawei's stack rather than CUDA, the resulting models, frameworks, and ecosystem tools may be natively incompatible with NVIDIA infrastructure — weakening CUDA's claim to universality over time.
A new complication: China's antitrust authority issued a preliminary finding on September 15, 2025, that NVIDIA's compliance with U.S. export controls "discriminated unfairly against customers in China," potentially leading to financial penalties or operational restrictions.
H200 licenses have been approved but no revenue has been recognized, and NVIDIA cannot determine whether any imports will be allowed. There is no near-term replacement product.

Gross Margin Pressure from Escalating Input Costs and Architecture Complexity

NVIDIA has guided to sustain mid-70s gross margins long-term, but structural pressures are building:

Non-GAAP gross margins fell from 75.0% in FY25 to 71.1% in FY26, primarily due to the $4.5B H20 charge and Blackwell ramp complexity. Excluding the H20 charge, Q4 FY26 recovered to 75.2%.
Each new architecture generation — particularly the rack-scale NVLink 72 and NVLink 144 systems — involves dramatically more components, manufacturing complexity, and supply chain coordination. Each GB200 NVL72 rack has 1.2 million components across 350 manufacturing sites.
Management acknowledged in Q3 FY26 that "input costs are on the rise" and that the company is "working to hold gross margins in the mid-seventies." This language signals ongoing cost pressure that requires active mitigation.
Memory costs, yield management on leading-edge silicon, and supply chain premiums for expedited manufacturing all weigh on cost of goods. NVIDIA's fabless model passes TSMC cost inflation directly through to its margins.
The annual product cadence that is central to NVIDIA's competitive moat also means the company is perpetually in a new architecture ramp, a period when margins are structurally lower due to manufacturing complexity and expediting costs.

Return on Ecosystem Investments

NVIDIA is deploying $17.5B in FY26 alone in equity investments in AI model companies and infrastructure funds, creating new financial and strategic risks:

Investments include stakes in OpenAI, Anthropic, xAI, and Mistral, plus $3.5B in land, power, and shell guarantees to early-stage companies.
A new risk disclosure notes that NVIDIA has been asked to provide customer financing for data center build-outs — arrangements that could introduce counterparty and credit risk at significant scale.
The rationale is ecosystem expansion: every model that runs on NVIDIA ensures developer lock-in to CUDA. But the investments also represent a concentration in early-stage AI companies with uncertain long-term economics.
The Groq IP licensing deal required "significant, nonrefundable payments" with an uncertain return, illustrating the execution risk in technology bets that may not pan out.
As AI model companies like OpenAI and Anthropic scale, they are becoming NVIDIA's largest customers while also being NVIDIA portfolio companies — creating a conflict-of-interest dynamic if favorable commercial terms are offered to investees.

Customer Concentration Risk

Revenue concentration has increased to levels that amplify any customer-specific disruption:

In FY26, one direct customer accounted for 22% of total revenue and another represented 14%, compared to FY25 when the top three customers each represented 11–12%.
This concentration reflects the dominance of hyperscalers, but it means that a shift in buying behavior at even one of NVIDIA's top customers — whether due to ASIC substitution, spending pauses, or competitive dynamics — would have an outsized impact on results.
Total supply commitments reached $145B in Q1 FY27, secured against customer demand forecasts. If large customers revise their build plans downward, NVIDIA could face inventory charges similar to the $4.5B H20 write-down in Q1 FY26 or the $2.17B inventory provisions in FY23.

Gaming Segment Vulnerability

Gaming, at roughly 7–8% of FY26 revenue, faces structural headwinds that limit its ability to contribute meaningfully to growth:

Management guided in Q4 FY26 for supply constraints to be a headwind to Gaming in Q1 FY27 and beyond, driven by competition for manufacturing capacity between gaming GPUs and the higher-priority data center business.
The RTX 50 Series drove 41% revenue growth in FY26 to $16B, but the launch tailwind is now largely in the base. The next major upgrade cycle is not imminent.
Tariffs on imported goods and higher memory and system prices have already weighed on consumer demand, causing a modest sequential decline in Q1 FY27 edge computing revenue.
The gaming GPU market is cyclical and showed a 27% revenue decline in FY23 when channel inventory built up. As the installed base of Blackwell-era RTX 50 Series cards grows, upgrade incentives for the next generation will diminish unless NVIDIA can offer another step-change in performance.