Migration Guide

How to decide whether 800G belongs in your enterprise AI fabric without buying a forklift upgrade in disguise

The pressure to jump to 800G is real, but most of the guidance is written for hyperscalers with a hundred thousand GPUs and a different budget. This guide is for the enterprise architect sizing an AI or GPU fabric in the dozens-to-low-thousands of GPUs, who needs to know when 800G is genuinely justified, when it is premature, and how to choose switch silicon today that does not become a rip-and-replace tomorrow.

⏱ 18 min read Enterprise-focused | Engineering-led | Migration-ready

Key Takeaways

  • Most enterprise AI clusters do not need 800G today - bandwidth per port is rarely the first bottleneck in clusters with dozens to low thousands of GPUs.
  • The critical decision is SerDes generation, not port speed labels - 200G-per-lane switch silicon makes 1.6T an optics upgrade rather than a forklift replacement.
  • 800G optical modules consume 14-20W per port and can dominate total switch power, making thermal modeling essential for deployable density calculations.
  • OSFP offers the longest forward roadmap and thermal headroom; QSFP-DD prioritizes backward compatibility with existing 400G optics investments.

Enterprises feel pressure to adopt 800G for AI workloads, but bandwidth per port is rarely the first thing that breaks in an enterprise cluster

The risk is overbuying for headroom you will not touch, or buying an "800G" platform that quietly forecloses the next upgrade.

Most guidance on 800G migration comes from hyperscaler environments with fundamentally different scale, budget, and operational constraints. When applied to enterprise AI clusters - typically dozens to low thousands of GPUs rather than hundreds of thousands - this guidance leads to systematic overbuying and architectural decisions that create future lock-in.

The four decisions that determine your 800G migration

A sound 800G decision comes down to four linked choices. Get the lane generation right and the rest becomes an optics upgrade rather than a forklift.

These decisions are interdependent and must be made in sequence. The lane generation choice, in particular, determines whether your next upgrade is a simple optics swap or a complete infrastructure replacement.

The four critical choices

Each decision constrains the next, and the lane choice carries the whole investment forward. Understanding this dependency is essential for avoiding costly architectural dead ends.

Decision one: justification

Is 800G earning its premium in your environment? Large collective-heavy training with 800G-capable NICs justifies it; inference-dominant or modest mixed workloads usually do not.

The justification test comes down to three factors: whether the network actually gates Job Completion Time today, whether your endpoints can utilize 800G speeds, and whether your workload profile generates enough east-west traffic to saturate higher-speed links.

Most enterprise AI workloads are inference-dominant or involve training jobs that are not network-bound at current scales. The 800G premium - both in switch silicon and optics - only makes sense when the network is demonstrably the bottleneck, not when it might become one at some theoretical future scale.

Decision two: lane economics

800G today is 8x100G PAM4; the path to 1.6T is 8x200G PAM4. Buying 200G-lane-capable switch silicon now makes the next jump an optics upgrade, not a chassis replacement.

This is the most critical architectural decision in the entire migration. Switch vendors market "800G" platforms built on both 100G-per-lane and 200G-per-lane SerDes technology. The port speed is the same today, but the upgrade path is completely different.

A 100G-per-lane platform reaches its maximum theoretical throughput at 800G. When 1.6T becomes standard, you need new switch silicon. A 200G-per-lane platform can reach 1.6T by changing optics alone - the same chassis, the same power envelope, the same operational model.

Decision three: power and thermal

800G optical modules draw roughly 14 to 20W per port and can dominate total switch power. Deployable density is whatever the rack airflow can actually cool, not the slot count.

Power and thermal modeling is where most 800G designs fail in practice. The switch silicon power consumption is well-documented, but the optics power is often treated as an afterthought until the first fully-populated switch exceeds the rack's cooling capacity.

At 800G speeds, optical transceivers can consume more power than the switch silicon itself. A 32-port 800G switch with high-power optics can draw over 1000W just for the transceivers, before accounting for the switching silicon, fans, and power supply inefficiencies.

Thermal envelope calculations

Calculate total switch power fully populated, not half-populated, and confirm the rack can shed the heat at the density you are planning. This includes transceiver power, switching silicon power, fan power, and power supply losses - all of which contribute to the thermal load the rack must handle.

Decision four: optics selection

OSFP offers thermal headroom and the longest forward roadmap; QSFP-DD prioritizes backward compatibility with existing 400G optics. Confirm transceiver-to-platform qualification rather than assuming it.

The optics form factor decision has long-term implications for sourcing, compatibility, and upgrade paths. This is not just about mechanical fit - different form factors have different thermal envelopes, power budgets, and vendor ecosystems.

OSFP was designed specifically for higher-power, higher-speed applications and has more thermal headroom for future speed increases. QSFP-DD extends the existing QSFP family and typically offers backward compatibility with 400G and lower-speed modules, protecting existing optics investments.

How to work through the decision

Take the four decisions in order. Each one constrains the next, and the lane choice carries the whole investment forward.

The decision process is deliberately sequential because each choice eliminates options for the subsequent decisions. Starting with workload justification prevents overbuying; choosing SerDes generation early protects the upgrade path; modeling power and thermal early prevents deployment surprises.

Profile the workload and the endpoints

Determine whether the network actually gates Job Completion Time today, and confirm where your server NICs top out. There is no benefit to an 800G fabric in front of 400G endpoints. Look at actual traffic patterns, not theoretical maximums, and understand whether your workloads are network-bound, compute-bound, or memory-bound.

Choose silicon by SerDes generation, not port label

Look past the 800G sticker to the underlying lane rate. A 200G-per-lane platform protects the path to 1.6T; a 100G-per-lane platform is an 800G terminal device. Ask vendors specifically about SerDes generation and confirm the maximum theoretical port speed the silicon can support.

Model the per-port power and thermal envelope

Calculate total switch power fully populated, not half-populated, and confirm the rack can shed the heat at the density you are planning. Include transceiver power, switching power, cooling power, and power supply losses in your calculations.

Match optics to reach, form factor, and platform

Select OSFP or QSFP-DD based on greenfield versus brownfield priorities, then qualify each transceiver against the platform rather than assuming compatibility. Develop a sourcing strategy that avoids single-vendor lock-in for transceivers.

OSFP or QSFP-DD for your 800G optics?

The choice between OSFP and QSFP-DD depends on whether you prioritize forward compatibility or backward compatibility, and whether thermal headroom or existing investments matter more to your deployment.

OSFP: Forward-looking choice

Larger module with greater thermal headroom, designed with higher-power, higher-speed optics and the 1.6T transition in mind. Best fit for greenfield AI fabrics where the longest forward roadmap and thermal margin matter most. Tradeoff: does not carry forward existing QSFP-family optics; a deliberate forward-looking choice rather than a compatibility one.

QSFP-DD: Compatibility-focused choice

Prioritizes backward compatibility; cages typically accept earlier QSFP-family modules, protecting existing optics investments. Best fit for brownfield estates extending existing 400G deployments through a mixed-generation transition. Tradeoff: less thermal headroom than OSFP at the top of the range; confirm qualified optics per platform.

What you'll walk away with

This guide provides four practical tools for making sound 800G migration decisions without falling into the common traps that lead to overbuying or architectural dead ends.

Justification test

A short set of criteria - cluster size, training versus inference, growth trajectory, NIC speed - for deciding whether 800G is warranted now or whether 400G with a clean upgrade path is the smarter spend.

Lane-generation checklist

The questions to ask a switch vendor to confirm 200G-lane capability and protect the path to 1.6T, cutting through marketing positioning to understand the actual SerDes architecture.

Power and thermal worksheet

A per-port budgeting approach that turns slot count into deployable, coolable density, accounting for transceivers, switching silicon, cooling, and power supply losses.

Related Resources

FAQs

Frequently Asked Questions

Should I skip 800G and wait for 1.6T?

For most enterprises, no. Deploy what your endpoints need today, often 400G, on switch silicon with 200G-capable lanes. That gives you current-generation economics now and an optics-only path to 800G and 1.6T later, instead of waiting indefinitely for a standard your servers cannot yet use.

Is 800G overkill for an enterprise AI cluster?

It depends entirely on workload. Large, collective-heavy training jobs can justify it; inference-dominant or modest mixed workloads usually do not. Profile the network's role in Job Completion Time before deciding.

OSFP or QSFP-DD, which should I choose?

Choose OSFP for greenfield builds where thermal headroom and the longest roadmap matter. Choose QSFP-DD when backward compatibility with existing 400G optics is a priority. In both cases, confirm transceiver-to-platform qualification rather than assuming compatibility.

How much power should I budget per 800G port?

Plan around roughly 14 to 20W per 800G optical module, and remember that optics can dominate total switch power in dense designs. Model the full per-port thermal envelope before locking in density.

What is the single biggest mistake in an 800G migration?

Buying a 100G-per-lane "800G" switch as a future-proofing move. The port label says 800G, but the SerDes generation makes the next step to 1.6T a forklift. Decide on the lane generation first; everything else follows from it.

Ready to assess your 800G migration path?

IVI provides the engineering depth to evaluate your workload profile, select purpose-built switch silicon, and execute the migration without the forklift. We handle platform selection, optics sourcing, and migration execution - giving you operational ownership of a fabric designed for your actual requirements, not hyperscaler assumptions.

Start a Conversation