Knowledge Graph Infographic

Virgo Network

Google positions Virgo Network as the new scale-out accelerator fabric inside a campus-as-a-computer architecture, designed to deliver deterministic low latency, massive scale, and resilient goodput for modern AI workloads.

Scale Claim134,000 TPU 8t chips and up to 47 petabits/sec of non-blocking bi-sectional bandwidth

Performance ClaimUp to 4x bandwidth per accelerator and 40% lower unloaded fabric latency

System RoleNetworking foundation for Google’s AI Hypercomputer

Why The Network Had To Change

The article says AI workloads broke the assumptions of general-purpose data center networking across four dimensions: scale, bandwidth, burstiness, and latency.

Define the AI networking constraints

Training now exceeds single data-center power and space, traffic arrives in synchronized bursts, and latency-sensitive serving demands consistency.

Split the architecture into specialized layers

Google separates scale-up, scale-out, and north-south domains so each layer can evolve independently and optimize for its role.

Introduce Virgo as the scale-out fabric

Virgo becomes the east-west fabric for accelerator-to-accelerator communication across pods at non-blocking scale.

Tie performance to reliability and system design

The network is co-designed with accelerators and instrumented for observability, fault isolation, and fast recovery.

The Three-Layer Architecture

The article does not describe Virgo in isolation. It places the fabric inside a unified compute-domain architecture with separate responsibilities for each traffic type.

Scale-up domain

The intra-pod interconnect for tightly coupled accelerator communication.

Scale-out accelerator fabric

The dedicated RDMA east-west network for deterministic latency and horizontal scale across pods.

Jupiter front-end network

The north-south fabric for storage access, general-purpose compute, and scaling training across sites.

What Virgo Adds

Virgo’s design centers on lower latency, flatter topology, and better fault containment at extreme AI scale.

Flat two-layer non-blocking topology

High-radix switches reduce the number of network tiers, which the article says cuts latency compared with traditional designs.

Multi-planar design

Independent switching planes and control domains help contain faults and preserve cluster-wide goodput.

Bi-sectional bandwidth

The key throughput metric used to express Virgo’s non-blocking scale across a single fabric.

Goodput

The post focuses on delivered workload throughput, not just raw bandwidth, because synchronized AI systems are vulnerable to stragglers and localized faults.