Can Nvidia sustain growth as hyperscalers ramp custom silicon and inference overtakes training demand?

Question

Andrew Franks · Accepted Answer

The strong growth in Nvidia's valuation and performance continues, though this growth story is one which could prove hard to maintain as hyperscalers build their own custom silicon and as inference workloads eclipse training workloads in total AI compute. Nvidia's growth story in training workloads has been largely built around three pillars - a ubiquitous CUDA ecosystem and associated software moat and simply having the best-performing GPUs for AI workloads. Cloud native chips cannot easily emulate the former and hyperscalers are increasingly building in-house chips to bridge the performance gap. As inference workloads account for a greater proportion of AI compute overall, customers will likely demand more cost-effective, energy-optimised hardware that is more suited to persistent deployment as opposed to big model training. Nvidia will need to adjust its product mix to reflect these changes with inference-optimised GPUs and edge AI products, as well as an integrated software stack which increases customer switching costs.

But that's not the whole story. Nvidia's long-term durability will be derived from the control of the platform versus the raw performance of the silicon itself. The company's entire software stack, from TensorRT to DGX Cloud, gives it a guaranteed revenue stream and guarantees interoperability that a custom chip can't offer. It also helps that Nvidia is opening up new verticals like automotive AI, robotics and digital twins, which also help Nvidia to avoid some of the hyperscaler cycle while opening up industries that are less likely to build custom silicon. Over time, the company will have to pivot from the "enabler" of AI to the "infrastructure backbone" of intelligent systems. Nvidia's cross-cutting differentiation of compute, software, and stickiness will help to compound the company's already existing dominance, even as the product/market balance shifts toward inference.

Runbo Li · Answer

Based on my time at Meta, I think custom chips are the way to go for making Magic Hour's video tools cheaper for creators. We've hit growth walls because of GPU shortages before. These new chips could let us lower prices, but our software is what will actually make us special long-term. So watch the silicon trends, but invest just as much in the creative tools that nobody else has.

Geremy Yamamoto · Answer

The bigger Nvidia becomes, the more it will face headwinds from the growing share of custom silicon that hyperscalers are designing and of inference over training. To lessen dependency on Nvidia, with which they claim cost advantages and control advantages, companies including Google, Amazon, and Microsoft are investing in proprietary chips (TPUs, Inferentia, Maia AI Accelerators, etc.) These custom solutions have been tuned for specific workloads and are particularly for inference, which is more cost-sensitive, and will benefit more from energy-efficient ASICs.

The response from Nvidia is to build out their ecosystem (CUDA, NVLink networking etc) and to develop semi-custom AI systems to keep leading designers as partners. Meanwhile, AMD and Broadcom have been leveraging their low-cost alternatives and custom solutions to try and steal some market share away from NVIDIA.

Jonathan Carcone · Answer

Nvidia has had a clear lead with the gaming industry as one of its major stronghold without a doubt. But as data centers and hyperscalers invest more and more on custom silicon for HPC and AI workloads, demand may dwindle for traditional GPUs. As a result, Nvidia has been looking to build its presence into the inference market - offering both hardware and software solutions that allow large data sets to be processed in real time. This diversifies their sources of revenue and sets them up nicely for when inferencing becomes more widespread in industry.

Iván Marchena · Answer

We only have to look at Nvidia's 20% Wall Street slide in January to understand how vulnerable the stock is to new competitors. While NVDA's blip at the beginning of the year can be attributed to the arrival of China's DeepSeek large language model, new pressures are likely to emerge closer to home in the months and years ahead.

It's reasonable to expect Nvidia's market share in chips to erode over time, but it's worth noting that the company is already actively preparing for this scenario, creating integrated hardware and software ecosystems to drive a long-serving network of clients.

Major cloud providers like Google, Amazon, and Microsoft are investing billions of dollars into developing their own custom chips, or Application-Specific Integrated Circuits (ASICs), for their workloads. These are set to directly compete with Nvidia's general-purpose GPUs, with their ability to support specific workloads helping for inference purposes.

The key issue that Nvidia faces for its longevity is that once AI models are trained using the firm's GPUs, they can be shifted to inference rather than requiring more training. This potentially creates a new market where custom ASICs offer a low-cost alternative to Nvidia's products.

Will Melton · Answer

Look, Nvidia's lead isn't a lock. Once AI's work shifts from training to actually running models, those smaller, specialized chips are going to take over. We did it at Xponent21, swapped out the standard GPUs for our real-time search, and it was way faster and cheaper. If Nvidia doesn't get a handle on the inference side soon, they might just watch their lead disappear.

Bennett Heyn · Answer

Running Backlinker AI showed me that Nvidia's growth depends on how they adapt as cloud companies build their own chips for inference. We saw a huge jump in performance moving workloads from general GPUs to specialized ones, though it required some engineering trade-offs. But teams still want easy-to-use, general infrastructure during quick pivots. Nvidia can keep growing if they focus on flexible developer tools and hybrid environments, not just raw training power.

Amanda New · Answer

Whether Nvidia can continue to grow as hyperscalers create custom silicon and inference pulls ahead of training will be a test of its innovative mettle. Although hyperscalers such as Google and Amazon are creating custom chips to reduce third-party GPU reliance, Nvidia remains competitive in AI due to its hardware, software ecosystems, such as CUDA, and an end-to-end approach, as stated by Neumann. With workloads for inference increasing, Nvidia has been placing its bets on power-efficient GPUs and AI-tuned hardware such as the H100 to seize this demand. Keeping that lead means continually innovating, holding prices down into the commodity price range, and meeting the challenge of the move-away from general-purpose ships for silicon.

Max Marchione · Answer

In our health-tech work, we run live diagnostics for patients, and speed is everything. We're watching big tech companies build their own chips because they handle these real-time models faster than traditional GPUs. Nvidia should partner with us to design chips specifically for this kind of inference work, or they risk losing the precision health market to custom solutions.

Bell Chen · Answer

I'm not convinced Nvidia's growth is a sure thing. When we built Superpencil, we noticed something - as AI moves onto devices themselves, people want chips that sip power, not guzzle it. If the big cloud companies keep pushing their own specialized chips for real-time processing at the edge, Nvidia could lose important markets unless they adjust both their hardware and how developers work with it.

Mike Otranto · Answer

There here is plenty of growth left for Nvidia in innovation and a robust software ecosystem. Its CUDA platform and AI tools are a hard act for competitors to follow. Nvidia's GPUs are optimized for both training and inference, and demand for AI inference is only going to increase. The company is branching into new markets like automotive, health care and edge computing on track for shrinking dependence on hyperscalers. If Nvidia wants to remain competitive, the company should continue building better technology, develop good partnerships and respond to customer needs. And these moves can help it continue growing as the competition heats up.

Can Nvidia sustain growth as hyperscalers ramp custom silicon and inference overtakes training demand?

11 Answers

Runbo Li

Max Marchione

Mike Otranto

Andrew Franks

Jonathan Carcone

Geremy Yamamoto

Iván Marchena

Amanda New

Pouyan Golshani

Kellon Ambrose

Jeffrey Zhou

Related Questions

Can Nvidia sustain growth as hyperscalers ramp custom silicon and inference overtakes training demand?

11 Answers

Runbo Li

Max Marchione

Mike Otranto

Andrew Franks

Jonathan Carcone

Geremy Yamamoto

Iván Marchena

Amanda New

Pouyan Golshani

Kellon Ambrose

Jeffrey Zhou