The Cloud Desktop CPU Lottery: What Are You Actually Running On?

17 min read

Kees Baggerman·Senior Technical Director at Nutanix – Former Citrix CTP – NVIDIA Enterprise Platform Advisor – 15+ years in EUC

All posts →

There are two ways to run a cloud desktop on Microsoft Azure. In the first, Microsoft manages the infrastructure entirely: Windows 365 delivers Cloud PCs sized in vCPUs and RAM, and the VM type, CPU generation, and physical host are Microsoft’s decisions, not yours. In the second, you provision Azure Virtual Desktop yourself: you pick the VM SKU, configure the host pool, and control the lifecycle. Both end up in the same place: you do not know which physical CPU (silicon) your session is actually running on, and neither product surfaces that information by default (a conscious choice).

The way you lose that visibility differs in each case, and so do the options for responding. This post covers both. The root cause is the same across both products: Azure VM SKUs can span multiple CPU generations, and the one your session lands on affects performance in ways that matter for EUC workloads.

The practical effect is that two of your users on identical subscriptions, sitting next to each other in the same office, can experience performance that is measurably different throughout the working day. One can render a PowerPoint in 3 seconds on modern silicon (2023). The other waits seven seconds, using older silicon (2014). Neither knows why, neither can do anything about it, and your help desk cannot reproduce it on demand.

The Multi-Generation SKU Problem

The Dv3 series is a useful worked example because its documentation is explicit and its hardware range is wide. It is largely phased out as the primary recommendation for new EUC deployments, but it remains in service in many environments built before v5 became the standard recommendation. Here is what Microsoft documents as the CPU options for a Dv3 VM (Dv3 and Dsv3-series documentation):

Intel Xeon E5-2673 v3 (Haswell-EP), 2014
Intel Xeon E5-2673 v4 (Broadwell-EP), 2016
Intel Xeon 8171M (Skylake), 2017
Intel Xeon Platinum 8272CL (Cascade Lake), 2019
Intel Xeon Platinum 8370C (Ice Lake), 2021
Intel Xeon Platinum 8573C (Emerald Rapids), 2024

Six CPU generations, nine years of silicon, one SKU identifier, one price.

When you request a Standard_D4s_v3, Azure assigns you to a physical host from whatever is available in the cluster at that moment. You do not choose. You cannot request a specific generation. You get one.

This is documented policy. A Microsoft engineer confirmed in the Azure Q&A forum that there is no deterministic way to choose a desired CPU SKU for standard multi-tenant instances, and that the platform selects from whatever is available at the time of provisioning.

Real-world testing by the research team at GO-EUC confirmed this. Across repeated tests of the same VM SKU, they observed different CPU models being assigned to identical Dv3-class machines. Deallocating and starting a VM is enough to land it on a different physical host with a different processor generation, with no notification to the customer that anything has changed.

Dv3 is not a unique case. The same pattern runs through most older D-series generations. Dv2 and Dsv2 carry the same five-generation list. Ev3 and Esv3 list four generations. Even the Dsv5, which is the current recommended SKU for most new AVD deployments, officially runs on three CPU generations: Ice Lake, Sapphire Rapids, and Emerald Rapids. The lottery persists, just over a narrower range.

E-series and F-series instances are also used for EUC workloads with different memory or compute profiles, and any SKU family that spans more than one CPU generation has this same property. Dv3 is an extreme case, not the only case.

Windows 365: The Lottery You Cannot Influence

Windows 365 Cloud PCs are hosted by Microsoft on behalf of customers. The underlying VM series is selected by Microsoft and is not exposed to the customer. Public Windows 365 documentation describes Cloud PC sizes only in terms of vCPU and RAM, not by VM series or CPU generation. You cannot request a specific SKU family. You cannot request a specific CPU generation. The lottery happens once, at provisioning time, and the result is what you get.

From that point, a Windows 365 user reconnects to the same underlying Cloud PC on every subsequent session. The CPU generation assigned at provisioning is the one they are on for the life of that Cloud PC, stable under normal operations, until an administrator resizes or reprovisions the machine. Azure can move VMs between physical hosts during planned maintenance events. Still, in practice, live migration preserves CPU features compatible with the source host, so a generation change is unusual but not explicitly guaranteed. There is no redraw on reconnect. The lottery was a one-time event.

What happens on deallocation is documented in core Azure VM behavior. When an Azure VM is stopped and deallocated, it loses its physical host allocation. When it starts again, Azure places it on whatever hardware is available at that moment, which may be a different physical host and potentially a different CPU generation. A VM that stops at the OS level without being deallocated remains on the same host.

The distinction matters because the two Windows 365 products behave differently in this regard. In Windows 365 Enterprise, an admin restart or a user-initiated shutdown typically does not deallocate the underlying VM, so the one-draw model holds under normal operations. For Windows 365 Frontline, the situation is fundamentally different. Frontline Cloud PCs are deallocated between sessions by design: the shared licensing model requires it, since multiple users take turns on a smaller pool of Cloud PCs. A Frontline user re-enters the CPU lottery on every session connection. The lottery mechanics for Windows 365 Frontline are functionally identical to AVD pooled non-persistent.

That being said, Microsoft now ships Frontline in two distinct modes:

Frontline Shared mode: Cloud PCs reset to a clean state after each sign-out; the description in the post fits this mode. See Snapshot-based reset for Frontline shared mode.
Frontline Dedicated mode: each user gets a persistent, personalized Cloud PC. The license is pooled (up to 3 users per license, one active at a time), but the Cloud PC itself is not wiped between sessions. From the official Frontline page: “Up to 3 Cloud PCs per license, one active at a time”; from session time limits docs: “Windows 365 Frontline Cloud PCs in dedicated mode remain powered on for two hours after the user session becomes inactive.”

So “Frontline users re-enter the CPU lottery on every session connection” is true in Shared mode, but not as much in Dedicated mode. Frontline Dedicated behaves much more like Enterprise from a CPU-lottery perspective.

Azure Virtual Desktop: The Lottery You Are Managing

AVD is a different situation because the SKU choice is yours. When you build a host pool, you choose the VM size. That choice determines which CPU generation range the pool draws from. D-series instances cover general-purpose pooled desktops. E-series suits memory-intensive workloads. F-series suits task worker and cost-optimized single-session scenarios. The choice matters for more than raw performance: it determines the span of CPU generations your users are exposed to.

Within that choice, multi-generational variance still applies. Every SKU family that spans more than one CPU generation means VMs in your host pool could be on different underlying hardware, even when they are the same VM size. Pooled host pools compound this because every full sign-out re-enters the draw.

The Physical Host Behind Your Instance

Microsoft publishes exact host specifications in its Azure Dedicated Host documentation. Dedicated Host is the product that lets enterprises buy exclusive access to a physical server, and to price it, Microsoft has to disclose the underlying hardware. That documentation is the clearest public view of what the shared tenant fleet actually looks like under the hood.

Note that Dedicated Host offerings cover only the more recent generations; the actual shared tenant fleet also includes older silicon, including Haswell and Skylake, which are not available through this product.

For the D-series Intel line, the physical host specs per generation look like this:

Table showing Azure Dedicated Host physical specs per CPU generation, Broadwell through Emerald Rapids, with physical cores, vCPUs, and max D4s VMs per host

Two things stand out. First, the number of VMs per physical host has tripled from a Broadwell box to an Emerald Rapids one. That is relevant to the probability calculation below. Second, Hyper-Threading means you get roughly 1.5-2 vCPUs per physical core, depending on the generation and host configuration. A Standard_D4s VM uses 4 vCPUs, which corresponds to 2 physical cores. On a Broadwell host, that 40-core machine can support 16 D4s VMs. On an Emerald Rapids host, the same VM size fits 48. The newer the host, the more sessions it can carry, which affects how likely you are to land on one.

The Probability Maths

Exact fleet composition per region is not disclosed by any hyperscaler. What you can do is model the probability based on reasonable deployment assumptions.

A typical Azure region will have been built out in waves corresponding to hardware procurement cycles. Cascade Lake was the dominant server platform from roughly 2019 to 2022. Ice Lake from 2022 to 2024. Sapphire Rapids and beyond from 2024 onwards. Older generations like Broadwell and Skylake are still in service, but are no longer being added to. The Dv3 label spans all of them because Microsoft has never imposed a hard version boundary on that SKU family.

Model a realistic EUC-region scenario. Assume a Dv3-capable cluster in a mature Azure region has the following host composition:

Table showing probability of landing on each CPU generation in a mature Azure region, weighted by available D4s session slots per host type

The key insight is that probability is not uniform across host types. Newer hosts pack more VMs, so even if Cascade Lake accounts for 46% of physical hosts by count, it still represents a disproportionately large share of total available session slots. Conversely, Broadwell hosts are underweight in the session lottery because there are fewer slots per machine.

Under this scenario, a user connecting to a Standard_D4s_v3 in a mature region has roughly a 68% chance of landing on a CPU from 2019 or earlier. They have approximately a 7% chance of landing on Broadwell hardware designed over a decade ago. Because the session assignment is stateless and load-balanced, the same user connecting the next day gets a fresh draw, provided they fully sign out. More on that below.

Run a more favorable scenario: a newer region where Microsoft has been actively deploying Ice Lake and Sapphire Rapids hardware and retiring older stock. Flip the distribution so 60% of hosts are Ice Lake or newer. The odds of landing on something from 2019 or earlier drop to around 30%. Better, but still not zero, and still completely invisible to the user.

The Performance Gap Between Those Draws

The CPU generations within a single SKU differ by only a few percentage points in EUC performance. They span years of microarchitecture development.

GO-EUC’s v6 SKU upgrade research provides the clearest published reference for EUC-specific CPU performance across generations using the OBUX scoring model. Working from their data and publicly available CPU benchmarks, here is a representative performance range across the D-series CPU generations. Note that Sapphire Rapids and Emerald Rapids do not appear in the Dv3 SKU; they are included here as reference points for the Dsv5 and Dsv6 ranges, respectively:

Table showing OBUX EUC performance scores for Intel CPU generations from Haswell (2013) to Emerald Rapids (2023) with bar indicators and delta vs Cascade Lake baseline

Within Dv3, the performance spread between the worst and best hardware draws is enormous: roughly 75–100%. Broadwell-to–Ice Lake variance is about 75%, while Haswell-to–Ice Lake reaches nearly 100%. Although Haswell nodes are gradually aging out of the fleet, they still appear in production clusters today.

In practical terms, the most common ‘bad draw’ currently delivers only ~60–65% of the performance of a ‘good draw’ — at exactly the same hourly price, and with no visibility or control over which hardware a customer receives. Customers can end up paying identical rates for infrastructure that delivers nearly half the performance, with no transparency into the underlying hardware assignment.

How Often Do You Get a New Draw

Whether a user re-enters the lottery depends on how they disconnect and what deployment model they are on.

In pooled or random non-persistent deployments, users re-enter the CPU lottery on every full sign-out. AVD’s breadth-first load balancing assigns each new session to the host with the fewest active sessions at that moment. The outcome is deterministic given the current pool state, but whichever host has the lowest session count could be any CPU generation. Most users reconnect after a full sign-out and land on a different host. In a mixed-generation fleet, that means unpredictable performance from one day to the next: sometimes better, sometimes worse, always invisible to the user. Citrix random non-persistent desktops have the same behavior: every logon is a fresh draw from the pool.

In dedicated or persistent deployments, there is no lottery on reconnect. The user always goes back to the same VM. Whatever CPU generation was assigned at provisioning is stable under normal operations.

There is a third scenario worth noting. When a user closes their laptop or locks their screen without signing out, which is the most common daily pattern for knowledge workers, AVD and Citrix both hold the disconnected session alive on the same session host. The next connection picks up on the same physical machine. No new draw occurs. The lottery outcome is effectively sticky across a working day regardless of deployment type.

The table below maps this out across the major EUC deployment models, sourced from Azure Virtual Desktop session terminology and Citrix delivery method documentation:

Table showing CPU lottery behaviour across EUC platforms, Windows 365, AVD, Citrix DaaS, and Amazon WorkSpaces, with session model and reconnect CPU switch rate

Read the tables together, and a clear picture forms. Dedicated and persistent users are on one CPU generation for the duration of that VM allocation. The lottery was a one-time event. Pooled and non-persistent users are re-rolling constantly, but in a mixed-generation fleet, that is not an escape route. It is just unpredictability. Neither outcome gives you confidence about what your users are actually running on at any given moment.

This Is Not Just an Azure Problem

AWS follows a similar abstraction model. The Amazon WorkSpaces bundle catalog (Value, Standard, Performance, Power, PowerPro, Graphics.g4dn, GraphicsPro.g4dn) is published in tier names rather than EC2 instance families, and AWS does not document which specific EC2 instance type underpins each bundle. The Graphics.g4dn bundle is the exception, where AWS named the underlying G4dn family directly. For the rest, the customer sees the bundle name and not the silicon underneath, mirroring the Windows 365 abstraction model. As an illustration of how wide an AWS family can range, the T3 family, which AWS markets for virtual desktop workloads, among others, officially runs on either Skylake-SP or Cascade Lake processors. Two generations, no customer choice, no visibility into which one a given instance lands on.

What You Can Do

For Windows 365 Customers

The SKU choice is not yours to make. The available mitigations mostly focus on visibility.

Start with a query. On Windows, (Get-CimInstance CIM_Processor).Name Returns the physical processor model (WMIC was disabled by default in Windows 11 24H2, so the CIM approach works across all current versions). Running this from a logon script across your Cloud PC estate and logging the results shows which hardware generation your users are on. There is no action you can take to change it without an admin-initiated resize, but knowing the distribution is the starting point for any escalation or remediation conversation.

If your organization uses Windows 365 Frontline, treat it as equivalent to AVD pooled non-persistent for planning purposes: every session is a new draw from whatever hardware is available. The diagnostic script above applies equally, but reducing variance in a Frontline environment requires working with Microsoft on the underlying Cloud PC pool configuration rather than individual resizes.

If the CPU generation your users are on is a concern, the available path is an admin-initiated resize to a newer Cloud PC size. Whether that results in a better CPU generation depends on what Microsoft provisions at that point. There is no guarantee, but a resize to a newer Cloud PC tier is more likely to land on recent silicon than to remain on hardware provisioned years ago.

For AVD Customers

The direct fix is SKU migration. The Dsv5 on Azure runs on three CPU generations, all post-2020. The Dsv6 is locked to Emerald Rapids. Moving an AVD deployment from Dv3 or Dv4 to v5 or v6 not only improves average performance but also eliminates the oldest hardware from the draw entirely. The same reasoning applies to E-series and F-series host pools: check the documented CPU generation range for your chosen SKU and decide whether the span is acceptable.

Proximity Placement Groups (PPGs) are a documented intermediate option if a full SKU migration is not immediately feasible. Constraining a host pool to a PPG pins the VMs to a specific physical cluster. Within a cluster, hardware typically reflects a single procurement wave, which reduces the span of CPU generations in the draw without requiring a SKU change. PPGs do not guarantee a single generation, but they meaningfully narrow the range for environments that cannot migrate today.

For pooled deployments, consider host pool segmentation by VM generation. Running separate host pools for v4 and v5 instances, rather than mixing them, makes the performance story predictable and gives you a clean migration path: new users go into the v5 pool, existing users migrate in waves.

If you need to diagnose which CPU generation users are landing on today, deploy a logon script that runs (Get-CimInstance CIM_Processor).Name and logs the result to your monitoring platform. What most organizations find when they first do this is a wider spread than expected, and a clear correlation between the reported CPU model and the help desk tickets about session performance.

Azure Dedicated Host solves the problem completely by binding your VMs to a specific physical host type and, therefore, a specific CPU generation. It costs more, but the price is transparent, and the silicon is guaranteed. For GPU-accelerated workloads in particular, where CPU and GPU generations can diverge significantly, this is worth modeling against the performance variance you are currently absorbing.

On AWS, Dedicated Hosts provide the same guarantee: instances run on a physical server allocated exclusively to your account and assigned to a specific processor family. For WorkSpaces deployments, pairing Dedicated Hosts with a WorkSpaces bundle that maps to the desired instance type eliminates the need for the CPU generation variable.

Google Cloud offers Sole-tenant Nodes for the same purpose. If your GCP-hosted VDI workload is on N1 and the multi-generation spread is a concern, sole-tenancy pins your instances to a node with a specific CPU platform, eliminating the random draw at the cost of dedicated capacity pricing.

The version number on a cloud instance SKU does not guarantee a CPU generation. It is a feature set boundary. The hardware underneath it spans whatever the provider had in the rack when your session started. For most workloads, that variance is an acceptable trade-off. For EUC, where session performance is directly felt by the person sitting at the keyboard, it is worth understanding exactly what lottery you are buying into, and how often that ticket gets redrawn.

The Bottom Line

The lottery is real and invisible.

A SKU version number is a feature-set boundary, not a CPU-generation guarantee. The hardware underneath spans whatever the provider had available in the rack when your session started, and that can mean an eight-year spread on a single instance family.

For Windows 365 Enterprise users, the draw occurs only during provisioning and remains stable for the life of that Cloud PC. For Windows 365 Frontline users, the draw happens on every session connection. For AVD users on pooled host pools, the draw happens on every full sign-out. In a mixed-generation fleet, none of these outcomes gives you confidence about what your users are actually running on.

For EUC, a bad draw means your users are running at half the performance of a good draw, for the same hourly price, with no visibility into which one they received.

The fix starts with visibility. Query the CPU model from within sessions, segment AVD host pools by generation, and consider whether the SKU family you are on still falls within an acceptable range. The lottery will not stop, but you can start reading the ticket.

Have you run CPU queries across your host pool and found a wider spread than expected? I would be interested to hear what the distribution looks like in real environments. Drop a comment below or find me on X at @kbaggerman.

Bio
Latest Posts

Kees Baggerman

Kees Baggerman is Senior Technical Director — Performance & Solutions Engineering R&D at Nutanix, where he leads a global team responsible for defining how enterprise applications are delivered on the Nutanix platform. A former Citrix Technology Professional and NVIDIA Enterprise Platform Advisor, he has spent 15+ years driving EUC strategy and technical direction across architecture, product, and customer success. He has been writing here since 2011 — sharing what he learns at the intersection of platform engineering and enterprise IT.

Latest posts by Kees Baggerman (see all)

Citrix MCSIO on Nutanix AHV: A Solution to a Problem That Doesn’t Exist - June 10, 2026
The Cloud Desktop CPU Lottery: What Are You Actually Running On? - June 2, 2026
Your Cloud Desktop Is Running on Yesterday’s Silicon - May 21, 2026
When the Orchard Ships Production Software: AI-Augmented Development in the Real World - May 17, 2026
Nutanix Documentation Script v5.0: Visual Reports, Brand Templates, and Seven Embedded Diagrams - May 15, 2026

Kees Baggerman

Senior Technical Director at Nutanix - Former Citrix CTP - NVIDIA Enterprise Platform Advisor - 15+ years in EUC

X / Twitter LinkedIn GitHub Email

Kees Baggerman

Latest posts by Kees Baggerman (see all)

Updated: VM Reporting Script for Nutanix AHV with Powershell

How to deploy Citrix NetScaler VPX on Nutanix AHV

Protecting your Citrix desktops on Nutanix using powershell

Updated (again!): VM Reporting Script for Nutanix AHV/vSphere with Powershell

AOS 5.0 released, XenServer TP in full effect!

Leave a Reply Cancel reply

The Multi-Generation SKU Problem

Windows 365: The Lottery You Cannot Influence

Azure Virtual Desktop: The Lottery You Are Managing

The Physical Host Behind Your Instance

The Probability Maths

The Performance Gap Between Those Draws

How Often Do You Get a New Draw

This Is Not Just an Azure Problem

What You Can Do

For Windows 365 Customers

For AVD Customers

The lottery is real and invisible.

Kees Baggerman

Latest posts by Kees Baggerman (see all)

Similar Posts

Leave a Reply Cancel reply