Skip to content
Morning Briefing · Tuesday, May 19, 2026

MRC Reaches OCP as Coding Agents Cross the Daily-Driver Threshold

networkingautomationai-mldatacenterscience
Listen to the episode
MRC Reaches OCP as Coding Agents Cross the Daily-Driver Threshold
29 min · 99 turns
Plate Ileaf · spine
Schematic leaf-spine fabric — explicit-path traffic flows across the spine plane, pods at the edges.
Top Highlights
№ 01·Top Highlights

Top 3 Highlights

1. NVIDIA MRC Protocol Submitted to OCP — AI Fabric Wars Move to Standards Bodies

TL;DR: NVIDIA, alongside AMD, Broadcom, Intel, Microsoft, and OpenAI, has submitted the Multipath Reliable Connection (MRC) specification to the Open Compute Project, formally transitioning a proprietary Spectrum-X feature into a contested open standard. MRC enables a single RDMA connection to spray traffic across hundreds of paths simultaneously using SRv6 source routing from the endpoint NIC — bypassing the ECMP hash entropy problems that throttle standard RoCEv2 at gigascale.

Key Points:

  • MRC decouples RDMA connections from individual network paths: a single queue pair fans traffic across hundreds of paths simultaneously, with the receiver reassembling in order — eliminating hot spots that force additional switch tiers
  • Routing intelligence moves to the NIC via SRv6 source routing: each packet encodes its own path in the header, turning switches into dumb forwarders and removing switch-level load-balancing state from the critical path
  • Real-time fabric telemetry ingestion enables proactive congestion rerouting at microsecond speed — not the seconds-long reconvergence while GPU collectives stall
  • Selective retransmission sends only lost packets, not full data windows, preserving bandwidth during partial path failures
  • OpenAI deployed MRC on Spectrum-X to train frontier models including ChatGPT and Codex; Microsoft is deploying in GB200-class AI factories today

The OCP submission marks MRC's transition from a Spectrum-X competitive differentiator to a contested open protocol — and the co-author list is the whole story. NVIDIA, AMD, Broadcom, Intel, Microsoft, and OpenAI co-authoring a NIC-level specification is the AI fabric equivalent of a de facto standards vote. Eighteen to twenty-four months from now, when MRC-capable NICs from multiple vendors appear in procurement catalogs, this submission will be the date the AI fabric monoculture ended.

The technical substance behind the spec deserves attention. Standard RoCEv2 binds a connection to a single network path — the switch does load balancing via ECMP hash, which creates flow concentration at scale. When you're running collective operations across tens of thousands of GPUs, a few saturated links stall the entire job. MRC solves this by moving path selection from switches to NICs via SRv6 source routing: each packet carries its own forwarding instructions in the header.

When path selection moves from switches to NICs, the fabric stops making decisions and starts executing them.

The practical implications extend beyond AI fabric design. The YANG models and telemetry paths for SRv6 segment lists are already mature tooling. Engineers who have invested in SRv6 expertise will find themselves directly applicable to AI fabric evaluation in a way that wasn't obvious six months ago. This week alone, SRv6 appeared in the MRC spec, in netlab's BGP-free core topology, and in the SpaceMoE paper's hidden-state routing framing — three independent signals converging on the same protocol investment.

So What? MRC reaching OCP marks the shift from proprietary AI fabric differentiation to contested open protocol — get familiar with the spec now, ask NIC vendors on your approved list for implementation timelines, and treat SRv6 expertise as a cross-domain investment.

SourcesPacket Pushers NB575, NAND Research — NVIDIA MRC at OCP, OCP MRC Specification 1.0


2. Simon Willison at PyCon 2026 — Coding Agents Crossed the Daily-Driver Threshold in November 2025

TL;DR: Simon Willison's PyCon 2026 lightning talk documented a genuine inflection point: RLVR (Reinforcement Learning from Verifiable Rewards) applied to code tasks caused coding agents to cross from "often works" to "mostly works" in November 2025. The same binary-verifiability property that enabled this training breakthrough applies directly to network configuration validation — which has direct implications for where to invest in automation infrastructure now.

Key Points:

  • November 2025 inflection: RLVR on code tasks enabled "daily driver" agents requiring minimal correction — the mechanism is binary reward signals (code runs or it doesn't) rather than human feedback
  • The best model changed hands five times in six months across Anthropic, OpenAI, and Google — a direct argument for model-agnostic abstraction layers in any automation pipeline
  • Qwen3.6-35B-A3B runs at 20.9 GB — meaningfully competitive on coding tasks at laptop scale, closing the on-premises inference capability gap
  • GLM-5.1, a 1.5-trillion-parameter open-weight model from a Chinese lab, is in the frontier mix — open-weight competition is no longer just a Western story
  • A wave of ambitious projects launched in December–January on the back of the November capability jump — Willison characterizes this with characteristic dry humor as "predictable"

The November 2025 inflection is worth taking seriously because Willison is one of the most methodical public trackers of LLM capability. "Daily driver with minimal correction" matches what network automation practitioners have been reporting privately: AI-assisted configuration generation is now useful enough for real pipelines, not just demos. The RLVR mechanism is the key insight: code verification is binary — code runs or it doesn't, and the reward signal is unambiguous. That allowed reinforcement learning to provide much stronger training signal than human feedback alone.

The verification loop that made coding agents good is the same one that will make network automation agents good.

The same binary-verifiability property applies directly to network configuration validation. A config either passes a Batfish check or it doesn't. A BGP session either comes up or it doesn't. An intent-based policy either allows the traffic or it doesn't. Teams building digital-twin validation loops and intent-based testing frameworks right now are creating the evaluation infrastructure that next-generation network automation agents will directly benefit from — not eventually, but on whatever the equivalent timeline is from the network automation community's equivalent of November 2025.

The six-model churn in six months also encodes an important architecture lesson: if the model is the unstable part of the stack, stable engineering investment goes into tool definitions, validation logic, and intent models — not which LLM you're calling. The model is an interchangeable backend. Any automation pipeline that depends on model-specific quirks is built on the wrong foundation.

So What? Build your digital-twin validation loops now — the binary-verifiable reward signal that made coding agents good is precisely what will make network automation agents good, and the teams with clean validation infrastructure will be first to benefit.

SourcesSimon Willison — The last six months in LLMs in five minutes


3. netlab 26.05 — BGP-Free SRv6 Core Is Now Testable in Open-Source Labs

TL;DR: The netlab 26.05 release adds support for global BGP routes with SRv6 next hops on FRR and IOS XR, making BGP-free SRv6 core topology a first-class testable pattern in open-source virtual labs. Combined with new lifecycle script hooks and snapshot-aware restart, the result is genuinely CI/CD-ready lab environments for SRv6 configurations. (This covers the SRv6 and lifecycle features from the 26.05 release notes — distinct coverage from yesterday's Python 3.8 EOL post.)

Key Points:

  • BGP-free SRv6 core: global BGP routes carry SRv6 next hops on FRR and IOS XR; core forwards on SRv6 segment lists without participating in BGP — overlay and underlay cleanly decoupled, generated automatically from topology YAML
  • Junos gains OSPF/IS-IS route redistribution, VRF-scoped IS-IS instances, and per-interface OSPF parameter configuration — previously missing capabilities blocking realistic multi-vendor Juniper lab topologies
  • Lifecycle scripts attach to netlab up and netlab down hooks: device license injection, pre/post-configuration automation, and pre-lab state seeding for CI/CD
  • Juniper cSRX container support added — security policy testing without provisioning full vSRX VMs
  • Configuration reload now validates device compatibility and surfaces warnings for unclean reloads, closing a silent-failure class

BGP-free SRv6 core is a production pattern at major service providers: the IS-IS/OSPF underlay handles reachability, SRv6 handles forwarding via segment lists encoded in packet headers, and BGP runs only at the edges for service routes. This cleanly separates the BGP control plane problem (scales with edge device count) from the underlay forwarding problem (scales with core complexity). netlab 26.05 makes this topology directly testable with configuration generated automatically from a YAML topology definition.

The lifecycle script hooks are the underrated feature in this release. The ability to attach shell scripts to netlab up and netlab down enables pre-lab state injection — seeding route tables, importing route policies, pre-populating YANG datastores — and post-lab state capture for diff analysis. Combined with the rewritten snapshot-aware netlab restart, lab environments are now idempotent and reproducible across working directory changes. This means a lab run can be treated like a test suite run in a CI/CD pipeline. The implications for integration testing of SRv6 configurations before production deployment are immediate.

So What? BGP-free SRv6 core is a production pattern at major operators; netlab 26.05 removes the last excuse for not having a validation lab, and the lifecycle hooks make CI/CD integration of lab-based testing tractable in a way it wasn't before.

SourcesipSpace.net — netlab 26.05 release notes


Networking
№ 02·Networking

Networking

Plate IInetworking
Schematic leaf-spine fabric — explicit-path traffic flows across the spine plane, pods at the edges.

MRC at OCP — SRv6 Source Routing Moves to the NIC Layer

See Top 3 Highlight #1 for full coverage. The networking-specific implication: when path selection moves to NICs via SRv6, switch configuration complexity drops significantly — switches become forward-only fabric elements. YANG models and telemetry paths for SRv6 segment lists are mature; the operational tooling story is not starting from zero when MRC-capable NICs reach your procurement list.

Cisco Workforce Reduction While Revenue Grows: The same Packet Pushers NB575 episode covering MRC/OCP notes Cisco is shrinking headcount while income grows — the "more revenue, less headcount" cycle that historically precedes either aggressive internal AI tooling deployment or a major platform bet. Worth watching which direction Cisco's internal automation investment breaks over the next two quarters.

SourcesPacket Pushers NB575


Automation
Plate IIIautomation
Source-of-truth pipeline — intent → diff → apply → verify, idempotent on every revolution.

netlab 26.05 Lifecycle Scripts Enable CI/CD-Ready Lab Testing

See Top 3 Highlight #3 for full coverage. The lifecycle script hooks — executing scripts at netlab up and netlab down — enable pre-lab state injection and post-lab state capture for diff analysis. Paired with snapshot-aware restart, lab environments are idempotent and reproducible. Treating lab runs like test suite runs is now tractable.

SourcesipSpace.net — netlab 26.05

Composable AI Agents Are the Structurally Correct Architecture for Network Operations

A new arXiv paper argues that applying LLM-style foundation models to network operations is structurally flawed: wireless/network data is configuration-dependent (CSI tensors, device telemetry) and lacks the self-contained portable semantics that make LLM training work. The same structural argument applies to enterprise network operations — device telemetry is inseparable from vendor schemas and platform context. The proposed alternative: composable, task-specific models orchestrated by an agentic coordination layer. This mirrors what the automation community has converged on independently through practice. When evaluating AI-assisted NetOps tools, ask whether it's composable specialized tools with defined interfaces, or a monolithic model trained on pooled network data. The former is verifiable; the latter is not.

SourcesarXiv — Against the Monolithic Wireless World Model


AI / ML
№ 04·AI / ML

AI & Machine Learning

Plate IVai / ml
Embedding space — clusters carry related concepts; the highlighted query vector pulls its nearest neighbors.

Open Agent Leaderboard — Architecture Matters More Than Model Choice

IBM Research and Hugging Face launched the Open Agent Leaderboard evaluating full agent systems — tools, planning, memory, error recovery — not models in isolation. Across six benchmarks (SWE-Bench Verified, BrowseComp+, AppWorld, tau2-Bench), the top five configurations use the same underlying model but differ significantly in performance and cost due purely to agent architecture. General-purpose agents now match or beat specialized ones across most benchmarks. Critically, failed runs cost 20–54% more than successful ones — failure mode economics matter as much as success rate for deployment budgeting. Open-weight models (DeepSeek V3.2, Kimi K2.5) lag closed-source by 18–29 percentage points specifically on agentic multi-step tasks.

For network automation evaluators: if agent architecture explains most of the performance variance, evaluating AI-assisted NetOps tools by asking them questions in a chat window is optimizing the wrong variable. The planning loop, tool definitions, and error recovery logic are where the real leverage is. Ask what happens on the second and third failed action attempt.

SourcesHugging Face / IBM Research — Open Agent Leaderboard

Google I/O 2026 Live — Gemini Intelligence Becomes the Android 17 OS Layer

Google's May 19 keynote positioned Gemini Intelligence as an OS-level agentic layer in Android 17 — cross-app persistent agents, proactive workflow automation, and human-in-the-loop approval for transactions confirmed as the design pillars. Android XR smart glasses shipping with Gemini 2.5 Pro represent the first consumer wearable with persistent vision context tied to a frontier model. The AppFunctions developer surface is the watch item: a documented API for agents to interact with Android apps has non-trivial implications for network management tooling on mobile platforms. (Specific Gemini model version claims remain unverified at publication time — the platform/OS architecture is confirmed; wait for official engineering posts for model version specifics.)

Google's advantage here is distribution, not model benchmarks. Shipping an agentic layer to every Android device via an OS update is something OpenAI and Anthropic cannot do. That distribution moat is more durable than any benchmark lead.

SourcesAndroid Central — Google I/O 2026 Live Blog

AI Lab Margins Are Structurally Squeezed From Both Ends

The Register's analysis argues AI labs are heading for margin compression simultaneously from both directions: frontier model training costs measured in hundreds of millions of dollars per run, while inference pricing races toward commodity levels. Cloud providers and infrastructure vendors — including the networking fabric layer — are structurally better positioned for durable margin than the model labs themselves. For enterprise buyers, this affects vendor stability calculations: labs subsidizing inference pricing with venture capital may not be around in five years in their current form. The picks-and-shovels thesis continues to be the right call.

SourcesThe Register — AI company margins


Datacenter
№ 05·Datacenter

Datacenter & Infrastructure

Plate Vdatacenter
Datacenter row — per-rack utilization at a glance. Cool colors are slack; warmer fills are pressure.

Dell Tech World 2026 — PowerRack and the Sovereign AI Bet

Dell's opening keynote focused entirely on sovereign and on-premises AI, anchored by PowerRack — a turnkey rack-scale compute, storage, and networking solution. The networking configuration ships eight Dell PowerSwitch SN6600-LD (NVIDIA Spectrum-based) switches delivering over 800 Tb/s of east-west switching capacity, with a deployment target of under six and a half hours from delivery to operational. The PowerCool CDU C7000 handles up to 220 kW of heat intake and is explicitly rated for the NVIDIA Vera Rubin NVL72 rack-scale platform (warm-water inlet up to 40°C). Mistral AI is running LLM training on liquid-cooled PowerRack with GB200 NVL72 systems; Google and Dell are bringing Gemini 3 Flash into fully on-premises confidential computing via Google Distributed Cloud.

The sovereign AI framing leans on partnership logos in places, but the underlying rack-scale fabric architecture is real. The six-and-a-half-hour deployment target is a managed-services hook as much as an engineering claim — Dell is selling a factory-floor-to-running-cluster experience. For regulated industries and government contexts where public cloud is unavailable for AI workloads, this is a credible and now clearly provisioned option.

SourcesServeTheHome — Dell Tech World 2026, DataCenter Dynamics — Dell PowerRack launch

67.7 GW and Rising — The 6.25% Political Threshold Is Starting to Bite

IDCA's 2026 report puts global datacenter capacity at 67.7 GW, up 36% year-over-year, with five countries controlling nearly 70% of global total. IDCA identifies 6.25% of national grid consumption as the inflection point where political friction accelerates from slow to damaging — not a regulatory limit, but an empirically consistent threshold across markets.

Countries already past it: Netherlands at 9.7%, Germany at 6.1%, Singapore at 19.5% — all experiencing regulatory tightening. In the US, approximately 12 GW of planned 2026 capacity is under active construction; the remainder is stalled by grid bottlenecks, transformer/switchgear shortages, and tariff impacts. The most valuable datacenter asset right now is not a building or hardware — it's a signed grid interconnection agreement.

SourcesComputer Weekly / IDCA, DataCenter Dynamics

Argentum AI's $2.5B European Deal — Independent GPU Compute at Scale

Argentum AI signed a $2.5 billion, 300 MW datacenter framework agreement with cloud gaming firm Boosteroid and real estate developer DL Invest Group, targeting hyperscaler-independent enterprise AI workloads across Europe. Boosteroid's operational credibility is real — 29 datacenters and roughly 8 million users from GPU-intensive cloud gaming workloads is not a paper credential. DL Invest has up to 12 sites planned. The independent-compute positioning is genuine demand from enterprises navigating AI Act compliance and data residency requirements. That said, three hundred megawatts is an ambitious new-entrant footprint; pressure-test operational depth before any procurement commitment.

SourcesPR Newswire — Argentum AI, The Fast Mode


Science
Plate VIscience
Field schematic — three-body stability under quasi-equal masses, drawn from the day's central result.

SpaceMoE — Running Distributed LLM Inference Across Satellite Constellations

(arXiv preprint, not peer-reviewed) A May 2026 paper proposes a framework for deploying Mixture-of-Experts LLM inference distributed across satellite nodes, exploiting sparse expert activation to work within satellite memory, energy, and thermal constraints. The design addresses three problems specific to orbital deployment: expert placement across the constellation, expert selection given dynamic orbital geometry, and hidden-state transmission between satellites as they move. The inter-satellite hidden-state routing problem is structurally identical to segment routing with strict path constraints — a framing immediately legible to anyone who's worked with SR-MPLS or SRv6.

The deeper resonance: SpaceMoE is the extreme case of the architectural principle appearing everywhere this week — distribute intelligence to the endpoint closest to the problem, simplify the thing in the middle. Whether the endpoint is a NIC in an AI rack (MRC), a satellite in low-Earth orbit (SpaceMoE), or a composable specialist model in a NetOps pipeline, the same engineering response to scale constraints keeps surfacing independently. Production timelines for orbital inference are speculative; the architectural pattern is not.

SourcesarXiv — SpaceMoE

Gödel's Incompleteness Revisited — Limits as an Invitation to Go Further

A Quanta Magazine piece challenges the standard pessimistic reading of Gödel's 1931 theorems. Rather than a permanent ceiling on what formal systems can prove, the reframing treats incompleteness as a cartographic finding: it reveals mathematical territory that cannot be captured by any fixed axiom set, prompting richer, more extensible formal systems rather than abandonment of formal methods. The computer science connection is direct — the halting problem is a corollary, and the same boundary applies to every formal verification tool in a network automation stack. Batfish and Forward Networks work within specific formal systems with defined guarantees; they do not claim to verify everything about every network. The optimistic read — build richer, extensible verification frameworks rather than abandoning formal methods — is the right engineering response.

SourcesQuanta Magazine


Quick Takes
№ 07·Quick Takes

Quick Takes

  • VMware Quietly Debuts ARM Hypervisor Tech Preview: Broadcom showed ESX running on ARM. Small fraction of enterprise deployments today, but the signal is clear: Broadcom is not ceding ARM hypervisor space to cloud-native alternatives by default.

  • China Brain Implants Moving From Trials Toward Real-World Use: Nature News reports Chinese startups are transitioning AI-assisted BCIs toward commercial deployment for motor and speech restoration. The AI inference stack here (transformer-based models decoding neural spike trains) is converging with robotics and autonomous systems software. [Lightly sourced — Nature article paywalled.]

  • Dell–Samsung AI Chipmaking Infrastructure Deal: Dell announced support for Samsung with infrastructure for AI-driven chipmaking — another datacenter-semiconductor integration story worth tracking as chipmaking increasingly requires the same power-dense compute environment as AI training.

SourcesThe Register — VMware ARM, Nature News — China BCIs, DataCenter Dynamics — Dell Samsung


Watch Today
№ 08·Watch Today

Watch Today

  • Google I/O Day 1 continued: AppFunctions developer surface details, official Gemini model version confirmation, and agentic governance model specifics
  • Dell Tech World: Live PowerRack networking specifications — full NOS and fabric tooling details
  • OCP MRC adoption signals: AMD, Broadcom, and Intel NIC implementation timeline commitments
  • netlab 26.05: If you're planning SRv6 work, today is a good day to set up the BGP-free core topology test

Automation
№ 09·Automation

Pipeline Stats

Plate VIIautomation
Source-of-truth pipeline — intent → diff → apply → verify, idempotent on every revolution.
  • Articles processed: 80 (RSS digest) + targeted web searches across 5 domains
  • Topics researched: 15 across 5 domains (networking, automation, ai-ml, datacenter, science)
  • Quality score average: 4.5/5
  • Cooldown rejections enforced: 2 (Google I/O Android Authority preview URL from 2026-05-18; netlab 26.05 Python EOL post from 2026-05-18 — both replaced with different URLs covering new developments)
  • Security: No significant architecture updates today
  • Dominant domain: Automation/Networking (4 items with deep coverage)
Subscribe

Get the briefing in your inbox.

One email per weekday morning. Same writing, same sources — no audio required.