|

Your Cloud Desktop Is Running on Yesterday’s Silicon

8 min read

A LinkedIn post last week sparked a question I have been thinking about for a while: are hyperscaler price increases really imminent, and what does that mean for virtual desktops and cloud-delivered EUC? The short answer is yes, there will be a substantial impact, and the reason is baked into the economics of server hardware depreciation. To understand why, you need to know exactly which CPUs run your cloud desktops, when those chips were made, and what the lifecycle math looks like.

Let me pull back the curtain.

The Top EUC Instances Across the Big Three

For Azure, AWS, and GCP, there are three instance families each that dominate real-world EUC deployments covering everything from Windows 365 and AVD knowledge workers through to GPU-accelerated creative workstations. Here is what is actually running under the hood.

Cloud instance comparison table showing EUC instance families across Microsoft Azure, Amazon AWS and Google Cloud with CPU generations and age badges

Notice anything? With one or two exceptions, the CPUs powering the most common EUC instances across all three hyperscalers were designed and manufactured between 2019 and 2021. Some of the most widely used GPU instances, AWS G4dn and G5, Azure NVv4, are running on the same Cascade Lake and EPYC Rome chips that arrived when on-premises VDI was still the default conversation.

How Old Is Old? A CPU Timeline

To understand why this matters, let me be precise about when these processor generations came to market, when they landed in cloud instances, and where they sit in the depreciation cycle today.

Timeline showing CPU generations from Intel Cascade Lake and AMD EPYC Rome in 2019 through price increases in 2025-2026, with colored badges per cloud provider

The Depreciation Math

Here is where the hyperscaler business model becomes legible. Servers are typically depreciated over five to seven years on a straight-line basis. A rack of Cascade Lake servers purchased in 2019 for say $30,000 per unit would be fully written off by 2024–2026. After that point, the hardware runs at near-zero book value. Every dollar of compute revenue from that rack is almost pure margin.

CPU depreciation table showing Intel and AMD generations from 2019-2022 with depreciation status and margin phase

The pattern is stark. Cascade Lake and Rome, the CPUs used in the most common EUC instances, are at or beyond full depreciation. Hyperscalers have been running those boxes as pure profit machines for one to two years already. And they have not dropped the price to reflect the declining book value of the underlying hardware.

Analysis from 2022 to 2024 shows that cloud prices rose faster than inflation, while what customers receive for their money remained unchanged. That is not an accident. It is the model.

What happens next is the squeeze. When hyperscalers are finally forced to refresh because older silicon starts failing, power draw becomes untenable, or customers demand newer hardware, they will be deploying far more expensive AMD Genoa/Turin or Intel Sapphire Rapids/Emerald Rapids servers. The hardware cost resets upward. And those costs get passed to you.

The Performance Reality

There is a second dimension here that does not get discussed enough: not all cloud CPUs are equal, and the newest-generation silicon rarely lands in EUC-optimized instance families first. It goes to AI, HPC, and high-margin compute workloads.

GO-EUC benchmarking has documented this clearly for Azure. The D-series v6 (Intel Sapphire Rapids/AMD EPYC Genoa) delivers 26% better EUC performance than v5, but v5 remains the default recommendation for AVD and Windows 365 deployments due to familiarity and pricing. Most organizations are running their users on v5 or even v4 instances without realizing the performance gap they are leaving on the table.

For GPU-accelerated desktops, the situation is even more pronounced. The NVIDIA A10 found in Azure NVadsA10 v5 and AWS G5 is a 2021-era GPU. Consumer GPU cards today are considerably faster. Yet you will pay a significant EUC premium for what, in silicon terms, is a four-year-old graphics card.

Bar chart showing OBUX EUC performance scores from Broadwell 2016 through Sapphire Rapids 2023
Pull quote: The business model is simple — hyperscalers make their real money in years five and six of a server life

The Hidden Cost: What Slow Silicon Does to Knowledge Workers

A fair challenge to everything above is: Does CPU generation actually affect knowledge workers? Unless you are rendering video or running finite element analysis, does a 2019 Cascade Lake versus a 2023 Sapphire Rapids chip really change your day?

The honest answer is: not in the way most people assume, but yes, in a way that is harder to measure and arguably more damaging.

The common mental model is linear: slow computer equals wasted minutes, wasted minutes equal lost salary dollars. That model is too simple. Knowledge workers do not bank saved seconds into extra deliverables. A 2-second faster app launch does not produce two extra seconds of output. 

But that is not the real cost. The real cost is cognitive interruption. Research by Gloria Mark at UC Irvine found it takes an average of around 23 minutes to return to deep focus after a significant interruption. A sluggish VDI session on older silicon does not usually cause a single big interruption; it creates a persistent, low-grade friction: a slight lag when switching applications, a stutter when scrolling through a large spreadsheet, a visible rendering delay during a Teams screen share. None of these individually breaks your flow. Cumulatively, across a workday, they keep pulling your attention back to the machine and away from the problem you were solving. The cost is not minutes lost; it is the depth of thinking degraded. GO-EUC’s Azure CPU Showdown (June 2025) measured application response time improvements of 26% from v5 to v6 instances, and approximately 41% from Cascade Lake to Sapphire Rapids, using the OBUX methodology, which specifically captures the time between user input and application response, not boot times or synthetic benchmarks, but the interactive feel of the session. That gap does not vanish because users are knowledge workers. It shows up as the difference between a session that feels native and one that feels like a remote desktop.

If you want a ceiling-case number: if a conservative 15% of an 8-hour knowledge worker day involves moments of active CPU-bound wait application renders, file opens, browser tab loads, spreadsheet recalculations, a 40% processing penalty on that fraction costs roughly 27 minutes per day per user, or around 99 hours per year. At an average US knowledge worker salary of approximately $53/hour, that is a theoretical upper bound of $5,247 per user per year. For a 1,000-seat deployment that is over $5 million, against a compute cost of roughly $338 per user per year on a standard D4S v5. The 15% CPU-bound assumption is exactly that, an assumption, and the translation of wait time to lost output is not one-to-one. But even at one-tenth of that figure, the productivity drag exceeds the compute bill. The real cost of old silicon is not the price increase you will see on your cloud invoice in 2027. It is the one you are already absorbing, invisibly, in your workforce today.

Sources and maths table tracing all 11 claims in the productivity cost section to primary sources or derived calculations

When Will You See Price Increases?

Based on OEM cost increases, hardware lifecycle analysis, and the depreciation model above, here is my read on the timing across the big three.

Three-column card layout showing price increase predictions for Microsoft Azure Amazon AWS and Google Cloud EUC services

The common driver across all three: Dell announced 15–20% price hikes on servers in December 2025. Lenovo followed in January 2026. Cloud providers typically lag OEM cost changes by three to six months before passing them through. That window closes in mid-2026.

What This Means for EUC Strategy

A few practical positions are worth taking now.

Audit your current instance vintage. If you are on Azure NVv4, AWS G4dn, or GCP N2 (Cascade Lake), you are on hardware designed in 2019. Demand gen-refresh timelines from your cloud CSP and model what migration to newer instances looks like for both performance and cost. The answer may surprise you.

Lock in reserved pricing now. One- and three-year reservations or committed use discounts taken before mid-2026 will insulate you from the first wave of price increases. This is especially important for Windows 365 and WorkSpaces flat-rate bundles, which are harder to renegotiate once pricing changes.

Reopen the on-premises conversation. When hyperscalers are charging cloud premium pricing for 2019-era silicon, the economics of modern on-premises or co-location hardware start to look very different. A Nutanix cluster on EPYC Genoa or Intel Sapphire Rapids delivers significantly newer silicon than the average EUC cloud instance, and you control the depreciation cycle yourself.

Push for CPU generation transparency. Unlike physical servers, cloud instance documentation rarely leads with CPU stepping, errata revision, or microarchitecture generation. That is intentional. Demand it from your vendors, particularly for GPU-accelerated workloads where the GPU and CPU generations can diverge significantly. The GCP G2 is a perfect example: a 2023 NVIDIA L4 GPU sitting on a 2019 Cascade Lake CPU.

The Bottom Line

The hyperscaler model, plainly stated

Cloud providers build their margins into the lifecycle of the hardware, not just the headline per-vCPU rate. They buy Cascade Lake and Rome servers in 2019–2020, depreciate them over five to seven years, and collect full rack-rate pricing throughout, including years when the hardware’s book value is zero.

Your cloud desktop is, quite literally, running on the cheapest possible silicon the market will accept.

The coming price increases are not about greed alone. Real hardware cost inflation is arriving through the supply chain. But they will land on customers who have been quietly cross-subsidizing hyperscaler margins for the past two years through fully amortized infrastructure.

High-performing CPUs in public cloud are rare and significantly more costly. The business model of modern EUC in the public cloud is built on you not noticing the vintage of the silicon underneath. Now you do.


Have you audited the CPU generation of your current EUC instances? Is the on-premises conversation reopening in your organization? Drop a comment below or find me on X at @kbaggerman.

The following two tabs change content below.

Kees Baggerman

Kees Baggerman is Senior Technical Director — Performance & Solutions Engineering R&D at Nutanix, where he leads a global team responsible for defining how enterprise applications are delivered on the Nutanix platform. A former Citrix Technology Professional and NVIDIA Enterprise Platform Advisor, he has spent 15+ years driving EUC strategy and technical direction across architecture, product, and customer success. He has been writing here since 2011 — sharing what he learns at the intersection of platform engineering and enterprise IT.
Kees Baggerman

Kees Baggerman

Senior Technical Director at Nutanix - Former Citrix CTP - NVIDIA Enterprise Platform Advisor - 15+ years in EUC

Kees Baggerman is Senior Technical Director — Performance & Solutions Engineering R&D at Nutanix, where he leads a global team responsible for defining how enterprise applications are delivered on the Nutanix platform. A former Citrix Technology Professional and NVIDIA Enterprise Platform Advisor, he has spent 15+ years driving EUC strategy and technical direction across architecture, product, and customer success. He has been writing here since 2011 — sharing what he learns at the intersection of platform engineering and enterprise IT.

Similar Posts

2 Comments

  1. I loved this article and it highlights even more plainly how much more expensive the cloud is than an on-premises solution. While this focused on the CPU which is obviously the main focus, I also shudder to think of the price increases that will be seen soon as cloud providers start to refresh their hardware and have elevated memory pricing. I feel fortunate that we’re primarily using m7i which is Sapphire Rapids, but the base clock of 2.4GHz still leaves a bit to be desired for overall ‘feel’ of the VM. Interestingly, the published pricing for M8i.xlarge (Intel 6 procs) is only 1 penny an hour more than m7i.xlarge, but it doesn’t appear the M8i series is available for workspaces or workspaces core yet. If that gets made available in Workspaces, it’s definitely the go-to choice.

    1. Your point about the hardware refresh memory tax is sharp. Cloud providers have to absorb or pass down the costs of high-speed DDR5, and that rarely swings in the customer’s favor over a multi-year cycle, and I fear we will likely be looking at a major TCO shock once those refresh cycles hit the billing console.

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.