SEAN HOWELL/WRITING ← back

essay

On the Microfauna of Megaprojects

Why the biggest machines of the next decade will be inhabited by dense ecologies of small machine intelligences, and why the organizations that matter should start cultivating them now.

March 2026

I increasingly think the control-plane metaphor is too neat for what is coming. It still imagines a clean hierarchy: machine below, intelligence above, human at the top. The next decade's serious systems will look stranger than that. A utility-scale quantum computer, an autonomous lab, an advanced chip fab, or a large robot fleet will be inhabited by something more like microfauna: dense populations of small, specialized machine intelligences living inside the project and feeding on its telemetry, schedules, faults, work orders, and simulations. Some will watch drift. Some will route jobs. Some will reconcile sensors. Some will triage repairs. Some will negotiate between throughput and risk. Most of them will be boring. All of them will matter.

My own work on scientific agents is part of why I have grown impatient with the word copilot. A copilot sits beside a human and waits to be asked. These systems need resident operators woven into the substrate. Quantum already tells the story. IBM's current fault-tolerant plan is explicitly modular and adaptive, with real-time decoded measurements that can change subsequent instructions, a compact decoder architecture aimed at FPGA or ASIC implementation, and a 2029 target of 200 logical qubits running 100 million gates.1 Google reports the same basic truth from another direction: on Willow, error correction now works well enough that decoder latency itself becomes a first-order systems problem, with 50–100 microseconds of delay already material to performance.2 PsiQuantum, meanwhile, is now talking in the language of tier-1 foundries, thousands of wafers, cryogenic cabinets supporting hundreds of chips, and million-qubit machines.3 These have become operational ecologies.

The small agents arrive first

The same pattern is now showing up in other domains with surprisingly little disguise. Applied Materials describes chipmaking as a continuous chain from atomic-scale materials simulation to fab-scale digital twins, with millions of measurements, thousands of process variables, and virtual studies that now collapse weeks into same-day work.4 The U.S. Commerce Department has backed a semiconductor digital-twin institute with explicit goals to cut development and manufacturing costs by more than 40 percent and cycle times by 35 percent.5 BMW is scaling digital twins across more than 30 production sites, expects up to 30 percent lower planning costs, and says the system is being expanded with generative and agentic AI assistants.6 Amazon has deployed its millionth robot and says its DeepFleet model should improve travel efficiency across the fleet by 10 percent.7 NVIDIA is now publishing blueprints for robot-fleet twins and AI-factory operations that explicitly connect power, cooling, networking, and operations agents.8 Siemens, working with NVIDIA, is talking openly about an industrial AI operating system and fully AI-driven adaptive manufacturing blueprints starting in 2026, while PepsiCo is already using its new twin stack to catch potential facility issues before physical changes are made.9

In science, the boundary is dissolving just as fast. Nature papers now describe autonomous mobile robots for exploratory chemistry, LLM-driven systems that can design, plan, and execute experiments through tool use, and growing interest in self-driving labs as shared network infrastructure rather than one-off curiosities.1011 The practical point is that autonomy will arrive as graded resident capability, spreading through subsystems one role at a time. The important architectural fact is that these roles are local, narrow, and continuous. A quantum computer needs agents that watch syndrome statistics, prioritize recalibration, detect creeping thermal or RF pathologies, choose when to reroute workloads, and translate application demand into live resource commitments. A fab needs agents that sit between chamber physics, metrology, recipes, and throughput. A robot fleet needs traffic organisms, map maintainers, collision forecasters, charge schedulers, and exception triage. A self-driving lab needs experiment proposers, instrument translators, literature scavengers, and safety clerks. These systems want many smaller models with limited authority, shared state, and hard boundaries, each handling the decisions inside its own scope.

Megaprojects grow into autonomy the way living systems do, by cultivating a biosphere around themselves.

The project becomes a managed ecosystem

This is the part I think people still underweight. The important agents will mostly look like janitors, dispatchers, inspectors, bookkeepers, and mechanics. One will notice that a particular syndrome pattern in a quantum module is the early smell of a calibration slip. Another will delay a process step because a metrology instrument is starting to drift. Another will rewrite robot traffic after a small layout change. Another will refuse an experiment plan because the solvent cabinet, glovebox schedule, and waste stream are already near constraint. The hard problem is maintaining local order in a system that never stops moving.

That is why I expect the winning pattern to be a society of narrower models with limited authority, shared state, and explicit law. Central planners are useful, but the combinatorics arrive faster than any single loop can absorb them. Think of a megaproject as a habitat with an immune system, a circulatory system, and a bookkeeping layer. If the data model is clean and the rules are tight, this ecology becomes extraordinarily powerful. If the data model is dirty and the rules are fuzzy, it goes feral fast.

This also changes the shape of the firm. Project management stops being mostly a matter of meetings and starts becoming a live inference system. Schedules become live beliefs about what the plant can really do this week. Change control moves from PDF procedure to machine-readable law. The org chart matters less than the permission graph: who, or what, is allowed to sense, decide, and act.

My aggressive bet is that by the early 2030s the decisive moat in many of these systems will sit less in the headline hardware than in the resident ecology around it. Two teams may have comparable robot cells, comparable process tools, comparable qubits, even comparable foundation models. One will have years of replayable telemetry, shadow agents, simulator traces, policy history, and incident memory. The other will have dashboards, tickets, heroics, and tribal knowledge. From the outside they will look similar. Inside, one will compound and the other will stall.

Predictions, sooner than people think

By 2027, every credible fault-tolerant quantum effort will have agentic operations in shadow mode across calibration prioritization, decoder tuning, workload admission, and failure diagnosis. By 2028, at least one major fab, robot-heavy factory, or autonomous-lab network will treat the live digital twin as a commissioning requirement rather than a planning aid. By 2029, large industrial systems will begin to budget inference, simulation, and policy infrastructure the way they already budget power, cooling, and cleanroom utilities. That same year, I would expect the first utility-scale quantum programs to discover that the real bottleneck is the coordination burden of keeping the whole stack inside its useful envelope.

By 2030, the phrase operator headcount will need an asterisk. The median consequential decision inside a frontier lab, fab, or robot fleet will already have been made, filtered, or scheduled by machine intelligence before a human ever sees it. Humans will still set objectives, own risk, sign off on escalations, and intervene in novel situations. But they will sit above the runtime as the constitutional layer.

By 2032, procurement, finance, and regulation will start to absorb this. Black-box equipment that cannot emit structured state, accept policy-mediated commands, or participate in replayable digital twins will begin to look obsolete. Insurers and auditors will want incident lineage, action traces, uncertainty thresholds, and rollback proofs. In other words, the microfauna will become part of the official operating model.

What to do now

The immediate work is much less glamorous than the vision. First, instrument the system as if it is going to host machine operators. High-frequency telemetry, calibration histories, maintenance records, operator notes, work orders, simulator outputs, and control actions need to land in a shared event model with stable identifiers and real timestamps. You cannot grow useful microfauna in a swamp of screenshots, spreadsheets, and PDF procedures.

Second, stop treating digital twins as presentationware. Build them for replay, counterfactuals, and shadow operation. Every meaningful intervention should be testable against a twin or at least a surrogate before it touches live hardware. Every incident should become training material. Every near miss should be reproducible enough that an agent can learn from it without rerunning the failure in the plant.

Third, write a machine constitution before you write a machine personality. Most of the engineering here is permissions, not prompt craft: what an agent may observe, propose, execute, defer, cancel, or escalate; what uncertainty bars apply; what hard invariants may never be crossed; what rollback path exists if the model is wrong; which human signatures are required when the stakes change. A megaproject without this will still grow microfauna. It will just grow them accidentally.

Fourth, seed the ecology with narrow residents. Give one agent ownership of calibration prioritization. Give another maintenance triage. Give a third responsibility for routing robot traffic, ranking experiments, or reconciling metrology against throughput. Run them in shadow mode. Compare them against human baselines. Keep their action surfaces small until they earn more. The goal is to discover which organisms are actually useful and under what constraints.

Finally, staff this like core infrastructure. The right team is a joint operating group of controls people, software people, domain scientists, reliability engineers, and whoever carries operational accountability. If the system is strategic, the ecology around it is strategic.

I think the next decade ends with something stranger and more practical than one majestic industrial mind quietly running everything: megaprojects dense with constrained machine operators, each unremarkable on its own, together forming the metabolism of the system. Such microfauna already exist in early forms. The open question is whether we cultivate their habitat deliberately enough to keep the project coherent or allow them to grow feral.

sources

  1. IBM Quantum Blog, "How IBM will build the world's first large-scale, fault-tolerant quantum computer", June 10, 2025; and IBM, "Quantum computing hardware and roadmap", accessed March 30, 2026.
  2. Google Research, "Making quantum error correction work", December 9, 2024; and Michael Newman et al., "Quantum error correction below the surface code threshold", Nature, 2025.
  3. PsiQuantum, "PsiQuantum Announces Omega, a Manufacturable Chipset for Photonic Quantum Computing", February 26, 2025; and PsiQuantum, "About", accessed March 30, 2026.
  4. Applied Materials, "AIx", accessed March 30, 2026; and Applied Materials, "Applied Materials Collaborates With NVIDIA to Accelerate End-to-End Chip Manufacturing", February 18, 2026.
  5. U.S. Department of Commerce, "Biden-Harris Administration Awards Semiconductor Research Corporation Manufacturing Consortium Corporation $285M for New CHIPS Manufacturing USA Institute for Digital Twins", January 3, 2025.
  6. BMW Group, "BMW Group scales Virtual Factory", June 11, 2025; and NVIDIA, "BMW Group Develop Custom Application on NVIDIA Omniverse", accessed March 30, 2026.
  7. Amazon, "Amazon launches a new AI foundation model to power its robotic fleet and deploys its 1 millionth robot", June 30, 2025.
  8. NVIDIA, "NVIDIA Unveils 'Mega' Omniverse Blueprint for Building Industrial Robot Fleet Digital Twins", January 6, 2025; NVIDIA, "NVIDIA Releases Vera Rubin DSX AI Factory Reference Design and Omniverse DSX Digital Twin Blueprint With Broad Industry Support", March 16, 2026; and NVIDIA, "NVIDIA Omniverse", accessed March 30, 2026.
  9. NVIDIA Newsroom, "Siemens and NVIDIA Expand Partnership to Build the Industrial AI Operating System", January 6, 2026; and Siemens, "Siemens unveils technologies to accelerate the industrial AI revolution at CES 2026", January 6, 2026.
  10. Tianyi Dai et al., "Autonomous mobile robots for exploratory synthetic chemistry", Nature, 2024; and Daniel A. Boiko et al., "Autonomous chemical research with large language models", Nature, 2023.
  11. R. B. Canty et al., "Science acceleration and accessibility with self-driving labs", Nature Communications, 2025; and L. Hung et al., "Autonomous laboratories for accelerated materials discovery: a community survey and practical insights", Digital Discovery, 2024.
← back to writing