This is a sample of CapRelay's research. CapRelay provides similar research and more for every US-listed company.
Start free trialNVIDIA designs and sells AI computing infrastructure. Its core product is the GPU (graphics processing unit), a processor purpose-built for parallel computation that is now the foundational building block of AI training and inference workloads. NVIDIA's GPUs power the training of large AI models (like GPT and Gemini), the deployment of those models at scale (inference), and a growing array of reasoning and agentic AI applications. Beyond the GPU itself, NVIDIA sells full-stack AI infrastructure: complete rack-scale systems that combine GPUs, CPUs, networking interconnects, and software into an integrated AI computing platform.
NVIDIA's customers include:
NVIDIA sells to large data center customers directly and through OEMs, ODMs, and system integrators who build and deploy NVIDIA-based infrastructure.
NVIDIA reports two segments:
Compute & Networking (~90%+ of revenue) includes:
Graphics includes:
NVIDIA makes money primarily by selling AI computing hardware at a premium driven by performance leadership. The key dynamics:
Performance-per-watt as the core value proposition: Data centers are constrained by power. Since AI factories directly monetize token generation (the output of AI inference), a data center's revenue is a direct function of tokens produced per watt of electricity consumed. NVIDIA argues that each generation of its architecture delivers substantially higher performance per watt than its predecessors — and that this advantage translates directly into higher revenue for customers operating power-limited facilities. This is the primary reason customers pay NVIDIA's prices rather than seeking alternatives.
Full-stack pricing: NVIDIA does not just sell chips. A single GB200 NVLink 72 rack contains 1.2 million components, weighs ~two tons, and integrates GPUs, CPUs, networking switches, and NVIDIA's software stack. The rack-scale system architecture means NVIDIA captures value across compute, networking, and software within a single customer deployment. NVIDIA estimates it captures roughly $30B+ per gigawatt of AI data center capacity in the current Blackwell generation.
Annual product cadence: NVIDIA ships a new GPU architecture roughly annually (Hopper → Blackwell → Blackwell Ultra → Rubin). Each generation delivers significantly higher performance per watt, which improves customer economics and drives upgrade cycles. Customers plan multi-year CapEx cycles around NVIDIA's roadmap. The pace of innovation is itself a competitive moat: the rapid cadence makes it very difficult for competitors to close the performance gap before the next generation arrives.
Software ecosystem as a retention mechanism: NVIDIA's CUDA programming platform, first introduced in 2006, has over 7.5 million developers and supports more than 6,000 applications. Software written for CUDA runs on all NVIDIA GPUs, and NVIDIA continually improves software performance for older hardware. Management notes that A100 GPUs shipped six years ago are still running at full utilization today due to software improvements — a TCO argument that competitors without a comparable ecosystem cannot match.
Gross margins: NVIDIA targets non-GAAP gross margins in the mid-70% range as Blackwell ramps. Margins are temporarily pressured during new architecture ramps due to manufacturing complexity and cost of expediting. NVIDIA's fabless model (relying on TSMC for wafer production) means its cost structure is largely variable.
Capital allocation: NVIDIA maintains large inventory and purchase commitments to secure supply chain capacity, using its balance sheet to guarantee offtake to suppliers. It also makes strategic equity investments in key AI model companies (OpenAI, Anthropic, xAI, Mistral) to deepen technical partnerships and expand the CUDA ecosystem.
The AI accelerator market is currently a de facto oligopoly with NVIDIA holding a dominant position.
Key competitors:
Why customers choose NVIDIA:
Barriers to entry are high but not insurmountable at the chip level. The harder challenge for competitors is matching the CUDA software ecosystem, which represents 20+ years of developer investment. Jensen Huang's argument — that NVIDIA runs every AI model, is available in every cloud, and can handle every phase of AI workload — is the clearest articulation of the platform's structural advantage. That said, hyperscalers' custom ASIC programs are a genuine long-term competitive threat, particularly for inference workloads where model architectures are more stable.
China market: U.S. export controls have effectively closed NVIDIA's data center compute business in China, which NVIDIA estimates as a ~$50B annual market. Huawei's Ascend chips are the primary beneficiary. NVIDIA estimates the lost China opportunity is a material and ongoing competitive harm, as Chinese customers building on Huawei's platform contribute to an alternative ecosystem that could challenge NVIDIA globally.
NVIDIA articulates its growth opportunity around three simultaneous platform transitions:
CPU to GPU accelerated computing: Much of the world's existing cloud computing still runs on CPUs. As Moore's Law slows, NVIDIA argues that transitioning these CPU workloads (data analytics, simulation, classical ML) to GPU accelerated computing is both inevitable and ongoing — and these workloads happen to run on NVIDIA's existing infrastructure.
Classical ML to generative AI: Hyperscalers' core revenue-generating workloads — ad recommendation systems, search ranking, content moderation — are migrating from classical ML to generative AI. This transition is still in progress and requires substantially more compute per workload.
Generative AI to agentic and physical AI: Agentic AI (AI systems that reason, plan, and take multi-step actions) requires orders of magnitude more inference compute than one-shot generative AI. Physical AI (robotics, autonomous vehicles, industrial automation) requires training in simulated environments (Omniverse/Cosmos) and dedicated in-device computing (Jetson/DRIVE platforms). NVIDIA sees these as multi-trillion-dollar long-term demand drivers.
Management estimates total AI infrastructure investment will reach $3–4T by the end of the decade, up from ~$600B in annualized CapEx among just the top four hyperscalers today.
Sovereign AI is an emerging growth vector — countries building national AI infrastructure — with NVIDIA tracking over $20B in sovereign AI revenue in FY2026, more than double the prior year.
Product roadmap: NVIDIA is on an annual cadence — Blackwell (FY2025–2026), Blackwell Ultra (FY2026 H2), Rubin (FY2027). Each generation targets a roughly 10x reduction in cost per token relative to the prior generation, which continuously expands the economically viable market for AI inference.