Brutalist archive interior with translucent markdown-inscribed panels suspended in warm amber light - representing structured text as the architectural medium of human-agent communication
Back to Observatory

The Observatory · Issue 038 · February 2026

Agentic Markdown: Agent-Readable Content | AXD

The Lingua Franca of Human-Agent Communication

By Tony Wood·30 min read


I. The Dual Audience Problem

For three decades, the web had one audience. Every page, every stylesheet, every interaction was designed for a human sitting in front of a screen. HTML was the language of that audience - a markup language built to describe how content should appear to human eyes. CSS governed how it should look. JavaScript governed how it should behave. The entire architecture of the web was an architecture of human attention.

That era is ending. One in every thirty-one visits to a website is now from a non-human agent. AI crawlers, shopping agents, research assistants, and autonomous systems traverse the web not to see content but to understand it. They do not need navigation bars, hero images, or CSS animations. They need meaning - structured, semantic, efficiently parseable meaning. And the language they prefer for that meaning is not HTML. It is Markdown.

This creates what we might call the dual audience problem - the foundational design challenge of the agentic web. Every website, every piece of digital content, every online presence now serves two fundamentally different audiences with fundamentally different needs. Humans need visual hierarchy, aesthetic appeal, and interactive affordances. Agents need semantic clarity, structural consistency, and token efficiency. Designing for one audience while ignoring the other is no longer viable. The organisations that thrive in the agentic age will be those that design for both.

This is not merely a technical challenge. It is a design challenge - and specifically, an Agentic Experience Design challenge. The dual audience problem sits at the intersection of Signal Clarity, Trust Architecture, and Absent-State Design. How an organisation presents itself to agents determines how those agents represent it to humans. The Markdown layer is not a technical afterthought. It is a design surface - and increasingly, the most consequential one.


II. What Agentic Markdown Is

Markdown itself is not new. Created by John Gruber in 2004 as a lightweight way to write HTML more comfortably, it has been the lingua franca of developer documentation for two decades. Every README.md file on GitHub, every technical blog post, every wiki page - Markdown has been quietly ubiquitous in the developer ecosystem since its inception.

What is new is its role. Agentic Markdown is the emerging practice of using structured Markdown as the primary communication layer between humans and autonomous AI agents. It encompasses three distinct but interconnected functions: how agents discover content, how agents consume content, and how agents receive instructions from humans. In each function, Markdown serves as the medium - not because it was designed for this purpose, but because its properties make it uniquely suited to the task.

Those properties are worth enumerating. Markdown is lightweight - a heading costs three tokens rather than the twelve to fifteen that its HTML equivalent demands. It is semantic - its structure directly encodes meaning through headings, lists, links, and emphasis rather than through visual presentation. It is diff-friendly - changes can be tracked, versioned, and reviewed through standard source control workflows. It is universal - not proprietary to any platform, vendor, or ecosystem. And it is human-readable - unlike JSON or XML, a Markdown file can be read and authored by a human without any tooling.

This combination of properties - lightweight, semantic, versionable, universal, and human-readable - makes Markdown the natural medium for the agentic web. It is the format in which humans can express intent clearly enough for agents to act upon, and in which agents can consume content efficiently enough to represent it faithfully. Agentic Markdown is not a specification or a standard. It is a design practice - the practice of structuring human knowledge for agent consumption.


III. The Three Layers

Agentic Markdown operates across three distinct layers, each serving a different function in the relationship between humans, content, and agents. Understanding these layers is essential for any organisation designing for the agentic web.

The Discovery Layer governs how agents find and understand a site. This is the domain of llms.txt and llms-full.txt - Markdown files placed at a website's root that provide AI models with a structured map of the site's content, purpose, and authority. The Discovery Layer answers the agent's first question: What is this place, and what can I learn here?

The Content Layer governs how agents consume the actual substance of a site. This is the domain of Markdown-for-Agents - the practice of serving content in structured Markdown format when an agent requests it, stripping away the HTML, CSS, and JavaScript that serve human presentation but add no semantic value for agents. The Content Layer answers the agent's second question: What does this content actually say?

The Instruction Layer governs how agents receive behavioural guidance from humans. This is the domain of AGENTS.md, SKILL.md, prompt files, and instruction files - Markdown documents that encode rules, workflows, and constraints for agent behaviour. The Instruction Layer answers the agent's third question: How should I behave here, and what are the rules? Together, these three layers form the complete architecture of Agentic Markdown - the communication infrastructure of the human-agent relationship.


IV. The Discovery Layer

The llms.txt standard, proposed by Jeremy Howard in late 2024 and adopted with extraordinary speed, represents the first formal acknowledgment that websites need a machine-readable identity layer. Just as robots.txt told search engine crawlers what they may access, llms.txt tells AI agents what they should understand. The shift from permission to comprehension is the shift from the search age to the agentic age.

The growth has been remarkable. Adoption of llms.txt has increased by 1,835 per cent since June 2025, according to Webflow. Over twenty per cent of enterprise brands are now experimenting with the format. This is not incremental adoption. It is a recognition that the agent audience has arrived and demands to be addressed.

From an AXD perspective, the Discovery Layer is an exercise in Signal Clarity. The llms.txt file is a signal - a structured declaration of what an organisation is, what it knows, and what authority it claims. The quality of that signal determines how accurately agents represent the organisation to the humans they serve. A poorly structured llms.txt file produces hallucinations, misrepresentations, and lost authority. A well-structured one produces accurate citations, faithful representations, and earned trust.

The companion file, llms-full.txt, extends the Discovery Layer with comprehensive content - full definitions, key concepts, frequently asked questions, and citation guidance. Where llms.txt is the map, llms-full.txt is the territory. Together, they form a complete discovery architecture that enables agents to understand not just what a site contains, but what it means - its intellectual position, its canonical vocabulary, and its authoritative claims. This is Intent Architecture applied to the machine-readable web.


V. The Content Layer

In February 2026, Cloudflare unveiled its Markdown for Agents feature - a single toggle that automatically converts any website's HTML into structured Markdown when an agent requests it. The mechanism is elegant: when an agent includes an Accept: text/markdown header in its request, the server responds with clean Markdown rather than the full HTML page. The same URL serves two audiences - humans receive the rich visual experience, agents receive the semantic essence.

The metaphor offered by Cloudflare's Will Allen is instructive: feeding an LLM raw HTML is like giving a chef your entire kitchen - the utensils, the fridge, the sink, and every random ingredient - when all they really need is the recipe. Markdown is the recipe: structured, essential, and easy to follow. The navigation bars, footer links, CSS classes, script tags, and div wrappers that constitute a modern web page have zero semantic value for an agent. They are noise. Markdown strips the noise and delivers the signal.

This is not merely a convenience. It is a design imperative. As Linda Tong, CEO of Webflow, observes: "If the way that you structure those paragraphs, or you structure content on your site, isn't easily understood by an LLM, it starts to hallucinate and it misrepresents you." The Content Layer is therefore not just about efficiency - it is about representational fidelity. How you structure your Markdown determines how accurately agents represent your content, your brand, and your authority to the humans they serve.

From an AXD perspective, the Content Layer connects directly to Agent Observability. Cloudflare's implementation includes an x-markdown-tokens response header that tells the requesting agent exactly how many tokens the content will consume. This is observability at the protocol level - the agent can monitor its own consumption, budget its context window, and make informed decisions about what to read and what to skip. The Content Layer does not merely serve content. It enables agents to be self-aware consumers of content.


VI. The Instruction Layer

The most profound development in Agentic Markdown is not about content consumption. It is about behavioural governance. Markdown has evolved from a documentation format into an instruction layer - a medium through which humans encode rules, workflows, constraints, and expectations that govern how agents behave.

The evidence is everywhere. GitHub's .github/copilot-instructions.md provides always-on behavioural guidance for Copilot across an entire repository. Prompt files (*.prompt.md) encode reusable task-scoped instructions that can be invoked like commands. SKILL.md files define agent capabilities as version-controlled Markdown documents. AGENTS.md - adopted by over sixty thousand open-source projects - provides a standardised format for guiding coding agents that interact with a repository.

What unites these formats is a single architectural principle: instructions should live alongside the artefacts they govern. A SKILL.md file sits in the same repository as the code it instructs agents about. A prompt file sits in the same workspace as the project it addresses. Instructions are not external configurations or cloud-hosted rules. They are co-located, version-controlled, reviewable artefacts - subject to the same pull request, code review, and change management processes as the code itself.

From an AXD perspective, the Instruction Layer is the most direct implementation of Delegation Design that currently exists. When a developer writes a SKILL.md file, they are performing an act of delegation - specifying what the agent should do, how it should do it, and what constraints it must observe. The Markdown file is the delegation contract. It is human-readable, machine-interpretable, and auditable. It is, in the language of AXD, a trust architecture expressed in plain text.



VIII. Why HTML Fails the Agentic Test

HTML was designed to describe the presentation of content, not its meaning. A heading in HTML - <h2 class="section-title" id="about">About Us</h2> - carries twelve to fifteen tokens of presentational metadata for three tokens of semantic content. Multiply this across an entire web page - with its div wrappers, navigation bars, script tags, footer links, and CSS class names - and the signal-to-noise ratio becomes catastrophic. A typical web page delivers perhaps ten per cent semantic content and ninety per cent presentational scaffolding.

For human browsers, this scaffolding is essential. It creates the visual hierarchy, the interactive affordances, and the aesthetic experience that make the web usable and beautiful. For agents, it is pure waste. Every token spent parsing a navigation bar is a token not spent understanding the content. Every token consumed by a CSS class name is a token subtracted from the agent's context window - the finite space within which it must hold, process, and reason about information.

The implications extend beyond efficiency. When agents must extract meaning from HTML, they are performing an interpretation - inferring semantic structure from presentational cues. This interpretation is inherently lossy and error-prone. A beautifully written article laden with metaphors and thematic transitions across multiple paragraphs will be processed by an LLM chunk by chunk, with each paragraph treated as a separate unit. Ideas that span paragraphs get fragmented. Nuance gets lost. The agent's representation of the content diverges from the author's intent.

Markdown eliminates this interpretation step. The structure is the meaning. A heading is a heading. A list is a list. A link is a link. There is no presentational layer to strip away, no CSS classes to ignore, no script tags to skip. The agent receives exactly what it needs - semantic content in a format that maps directly to its processing architecture. This is why Agentic Markdown is not merely a preference or a convenience. It is a design requirement for any organisation that wants agents to represent its content faithfully.


IX. The Economics of Token Efficiency

The shift to Agentic Markdown is not driven by aesthetics or ideology. It is driven by economics. Tokens are the fundamental unit of cost in the AI economy. Every input token consumed by an LLM costs computation, energy, and money. Every unnecessary token - every div wrapper, every CSS class, every script tag - is waste that degrades both performance and economics.

Cloudflare's data quantifies the scale of this waste. A simple "About Us" heading in Markdown costs approximately three tokens. Its HTML equivalent - with class attributes, id attributes, and surrounding div wrappers - costs twelve to fifteen tokens. That is a four-to-five-fold increase in cost for zero increase in semantic value. Extrapolated across the billions of pages that agents crawl daily, the aggregate waste is staggering. The AI industry is spending billions of dollars processing tokens that carry no meaning.

The economic argument creates a powerful alignment of incentives. Content owners benefit because their content is consumed more accurately and represented more faithfully. AI companies benefit because their processing costs decrease and their output quality improves. Users benefit because the agents serving them work faster, cheaper, and with fewer hallucinations. This three-way alignment is rare in technology - and it explains the extraordinary speed of Agentic Markdown adoption.

From an AXD perspective, token efficiency connects to the broader principle of resource-aware design. Every agentic system operates within resource constraints - context windows, API rate limits, computational budgets. Designing for these constraints is not optimisation for its own sake. It is a form of trust architecture - ensuring that the agent can fulfil its delegated responsibilities within the resources available. An agent that exhausts its context window parsing HTML noise before reaching the substantive content has failed not because of a capability limitation, but because of a design failure on the part of the content provider.


X. Agentic Markdown: Implications for Practitioners

Design for two audiences from the start. Every digital presence must now be designed with both human and agent audiences in mind. This does not mean compromising the human experience. It means creating a parallel agent experience - through llms.txt, through Markdown-for-Agents content negotiation, through structured data - that serves agents with the same intentionality that the visual layer serves humans. The dual audience problem is not a problem to be solved once. It is a permanent condition of the agentic web.

Treat your Markdown layer as a design surface. The llms.txt file is not a technical configuration. It is a design artefact - a carefully crafted declaration of organisational identity, intellectual authority, and canonical vocabulary. It deserves the same design attention as a homepage hero section. The structure, language, and completeness of your Markdown layer directly determines how agents represent you. Design it with the same rigour you apply to your visual brand.

Implement content negotiation. Serving Markdown to agents who request it - via the Accept: text/markdown header - is the Content Layer equivalent of responsive design. Just as responsive design serves different layouts to different screen sizes, content negotiation serves different formats to different audiences. Cloudflare's single-toggle implementation makes this trivially easy for its customers. For others, the principle remains: the same URL should serve the right format to the right audience.

Version-control your agent instructions. If your organisation uses AI agents - for development, for customer service, for commerce - the instructions governing those agents should be Markdown files in source control. Not prompts in a dashboard. Not configurations in a cloud console. Markdown files, reviewed in pull requests, subject to change management, auditable by anyone on the team. The Instruction Layer is a governance layer. Treat it with the rigour that governance demands.

Design your consent architecture explicitly. Decide - deliberately, strategically, with full awareness of the implications - what agents may do with your content. Use Content Signals, robots.txt extensions, and selective access controls to encode that decision. Do not leave consent to default settings or platform assumptions. The Consent Horizon applies to content as much as it applies to personal data. Your Markdown layer is where consent is expressed.

Recognise that Markdown is the new design material. For thirty years, the design materials of the web were HTML, CSS, and JavaScript. For the next thirty, the design materials will include Markdown - not as a replacement for visual design, but as a parallel medium for agent communication. The organisations that master Agentic Markdown will not merely be found by agents. They will be understood by agents - accurately, faithfully, and in their own voice. In the age of agentic commerce and agentic shopping, being understood by agents is not a technical nicety. It is a competitive necessity. The Markdown layer is where that understanding begins.


Frequently Asked Questions