Topic dashboard

Agent Architecture & Personal Infrastructure

Last refreshed July 10, 2026 · 25 concepts

Agent Architecture & Personal Infrastructure

The harness, not the model, is becoming the moat.

My take

Start from two assumptions I treat as fixed: models will keep getting better, and they will stay non-deterministic. At any given moment one model will be ahead of another, but leadership rotates on a quarterly cadence, and non-determinism never goes away. Both facts point at the same answer: betting on a specific model is a depreciating asset, and even the best model is unsafe to ship without scaffolding around it.

What compounds, then, is not the model. It is the harness, the evals, and the memory - the layer that governs context, scopes tools, persists state across sessions, and lets you measure whether a swap actually improved anything. The ability to switch models confidently, backed by your own evaluation suite, is the capability most teams underinvest in. It is also the one that decides who has pricing power against the labs.

The mistake I see most often is treating “agent” as a model-capability question. It is an infrastructure question. Whoever owns the harness owns the workflow, the data exhaust, and the switching cost, regardless of which model is plugged in underneath. That is why open-source harnesses wrapping proprietary CLIs are a real threat to subscription economics, not a curiosity.

Over the next twelve months I expect enterprise buyers to start asking harness-shaped questions - memory, observability, permission boundaries, audit, evals - before they ask model-shaped ones. The vendors who treat the harness as a thin wrapper will lose to the ones treating it as the operating system.

Everything above the divider is mine. Everything below is auto-assembled daily from my knowledge base — individual links and summaries may be stale or off-target. Last refreshed: 2026-07-10.

What’s shifted recently

Agent Economy Infrastructure (updated 2026-07-10)
Agent economy infrastructure refers to the foundational primitives — identity, communications, payments, compute, memory, and orchestration — being built specifically for AI agent… — source · source · source
Agent Loop Engineering (updated 2026-07-10)
Agent loop engineering is the design discipline of building self-sustaining agent execution loops — control structures by which agents find work, assign tasks, verify outputs, rem… — source · source · source
Harness Engineering (updated 2026-07-10)
Harness engineering is the practice of designing the OS-layer around AI coding agents — the context governance, tool architecture, eval loops, memory management, and permission mo… — source · source · source
Agent Memory Systems Stack (updated 2026-07-09)
The agent memory systems stack is the layered taxonomy of memory types, storage backends, retrieval strategies, and lifecycle operations that collectively give AI agents continuit… — source · source · source
Context Engineering (updated 2026-07-09)
Context engineering is the discipline of designing what information enters an LLM’s context window — system prompts, retrieved documents, tool results, memory, conversation histor… — source · source · source
Parallel Coding Agent Orchestration (updated 2026-07-09)
Parallel coding agent orchestration is the practice of running multiple AI coding agents concurrently against isolated copies of a codebase, coordinated by a human or a machine co… — source · source · source
Enterprise Agent Integration Layer (updated 2026-07-07)
The enterprise agent integration layer refers to the set of CLI tools, APIs, protocol adapters, and platform extensions that established enterprise software vendors are building t… — source · source · source
Agent Memory Architecture (updated 2026-07-06)
Agent memory architecture refers to the set of mechanisms by which AI coding agents and AI coworkers maintain context that persists beyond a single session, enabling continuity ac… — source · source · source
Agent Eval Frameworks 2026 (updated 2026-06-30)
Agent evaluation frameworks in mid-2026 measure whether agents—not just base models—produce correct decisions, safe outputs, and usable code. — source · source · source
Agent Subagent Decomposition Production Pattern (updated 2026-06-29)
Agent-subagent decomposition is the architectural pattern of splitting a production AI workflow into a parent orchestrator and one or more specialized child agents, each scoped to… — source · source · source
Agent Memory Product Launches (updated 2026-06-27)
Agent memory product launches are a wave of purpose-built systems that package persistent agent memory as a product, API, or architectural layer rather than treating recall as an… — source · source · source

The ideas I keep coming back to

Currently active (last 30 days):

Agent Economy Infrastructure — Agent economy infrastructure refers to the foundational primitives — identity, communications, payments, compute, memory, and orchestration — being built specifically for AI agent…
Agent Loop Engineering — Agent loop engineering is the design discipline of building self-sustaining agent execution loops — control structures by which agents find work, assign tasks, verify outputs, rem…
Harness Engineering — Harness engineering is the practice of designing the OS-layer around AI coding agents — the context governance, tool architecture, eval loops, memory management, and permission mo…
Agent Memory Systems Stack — The agent memory systems stack is the layered taxonomy of memory types, storage backends, retrieval strategies, and lifecycle operations that collectively give AI agents continuit…
Context Engineering — Context engineering is the discipline of designing what information enters an LLM’s context window — system prompts, retrieved documents, tool results, memory, conversation histor…
Parallel Coding Agent Orchestration — Parallel coding agent orchestration is the practice of running multiple AI coding agents concurrently against isolated copies of a codebase, coordinated by a human or a machine co…
Enterprise Agent Integration Layer — The enterprise agent integration layer refers to the set of CLI tools, APIs, protocol adapters, and platform extensions that established enterprise software vendors are building t…
Agent Memory Architecture — Agent memory architecture refers to the set of mechanisms by which AI coding agents and AI coworkers maintain context that persists beyond a single session, enabling continuity ac…
Agent Eval Frameworks 2026 — Agent evaluation frameworks in mid-2026 measure whether agents—not just base models—produce correct decisions, safe outputs, and usable code.
Agent Subagent Decomposition Production Pattern — Agent-subagent decomposition is the architectural pattern of splitting a production AI workflow into a parent orchestrator and one or more specialized child agents, each scoped to…
Agent Memory Product Launches — Agent memory product launches are a wave of purpose-built systems that package persistent agent memory as a product, API, or architectural layer rather than treating recall as an…
Extended Reasoning Inference Tradeoffs — Extended-reasoning inference refers to AI model computation modes where the model allocates additional compute budget at runtime to reason explicitly through a problem before gene…
Agent Eval Observability Tooling — Agent eval and observability tooling is the category of commercial and open-source platforms that combine runtime monitoring with structured quality evaluation for deployed AI age…
Anthropic Managed Agents Platform — Claude Managed Agents is Anthropic’s hosted agent-runtime platform, providing infrastructure primitives — memory, quality grading, multiagent orchestration, and webhooks — directl…
Cross Agent Persistent Memory MCP — Cross-agent persistent memory via MCP is a local-first infrastructure pattern in which a single SQLite-backed store — exposed as an MCP server — gives every coding agent on a mach…
Agent Cost Governance — Agent cost governance is the set of practices, architectural patterns, and platform controls used to bound token spend and compute costs for autonomous AI agents running in produc…
Hermes Agent Skill Composition Framework — Hermes Agent is an open-source CLI-first agent framework built by NousResearch that structures autonomous workflows around three composable primitives: skills (discrete capability…
Model Self Improvement Research Automation — Model self-improvement research automation is the trajectory by which AI systems transition from assisting human researchers to running closed scientific discovery loops autonomou…

Established:

Local Computer Use Agents 2026 — Local computer-use agents are AI systems that control desktop GUIs, browsers, and on-device interfaces through direct hardware access without sending data to cloud inference endpo…
Agent Harness As Trained Behavior — Agent harness behavior is not determined solely by system prompts or runtime instructions—it is burned into model weights during training.

Who I’m watching

Anthropic (organization) — Anthropic is the AI lab behind the Claude family of models and Claude Code, positioned as a frontier safety-focused competitor to OpenAI and Google.
Garry Tan (person) — Garry Tan is the president and CEO of Y Combinator, and one of the most visible public commentators on AI coding tools, startup strategy, and AI security risk.
LangChain (organization) — LangChain is a framework and tooling company for building production LLM applications, with the LangChain orchestration library, the LangSmith observability platform, and the Deep…
DeepSeek (organization) — DeepSeek is a Chinese AI lab whose open-weight model releases anchor the lower end of the cost-capability frontier and contribute directly to the frontier-model-compression dynami…
Jensen Huang (person) — Jensen Huang is co-founder and CEO of NVIDIA, which under his leadership became the world’s most valuable company by capitalizing on the AI infrastructure buildout.
Moonshot AI / Kimi (organization) — Moonshot AI (月之暗面) is the Chinese lab behind the Kimi model family, including the open-weight Kimi K2.5 release that powers Cursor Composer 2.
NVIDIA (organization) — NVIDIA is the dominant supplier of GPU compute for AI training and inference, and as of 2026 the world’s most valuable public company.
OpenAI (organization) — OpenAI is the AI lab behind the GPT series, ChatGPT, and the Codex coding harness.
Peter Steinberger (person) — Peter Steinberger (X: @steipete) is the creator of OpenClaw, the open-source personal AI agent platform that reached over 160,000 GitHub stars within weeks of launch.
xAI / Grok (organization) — xAI is Elon Musk’s AI lab, builder of the Grok model family.

Sources I’ve been drawing on

x.com — cited in Agent Economy Infrastructure
venturebeat.com — cited in Agent Economy Infrastructure
x.com — cited in Agent Economy Infrastructure
x.com — cited in Agent Economy Infrastructure
www.reddit.com — cited in Agent Economy Infrastructure
www.reddit.com — cited in Agent Economy Infrastructure
architect.salesforce.com — cited in Agent Economy Infrastructure
cyberbee.dev — cited in Agent Economy Infrastructure
x.com — cited in Agent Economy Infrastructure
fourweekmba.com — cited in Agent Economy Infrastructure
tech.yahoo.com — cited in Agent Economy Infrastructure
byteiota.com — cited in Agent Economy Infrastructure