Guide

llms.txt: The Complete Guide to the New Standard for AI Crawlers

llms.txt is a proposed web standard that lets you publish a curated map of your site for large language models. 10.13% of domains have already adopted it -- but does it actually move AI citations? This guide covers the spec, the data, the major adopters, and an honest answer on whether to implement.

Nisha Kumari|April 25, 202610 min read

llms.txt is a proposed web standard that lets website owners publish a curated, markdown-formatted map of their site for large language models. Like robots.txt is for search engine crawlers and sitemap.xml is for indexers, llms.txt is for the AI systems that increasingly read the web on behalf of users -- ChatGPT, Claude, Perplexity, and the wave of developer tools built on top of them.

10.13%

of 300,000 analyzed domains have an llms.txt file -- adoption is real but not universal (SE Ranking, 2026)

The standard was proposed in September 2024 by Australian technologist Jeremy Howard (founder of Answer.AI and fast.ai). Nineteen months later, the format has been adopted by names like Anthropic, Stripe, Zapier, Cloudflare, Hugging Face, and Perplexity itself -- yet remains unofficial, with no AI platform formally requiring it.

This guide gives you the complete picture: what llms.txt is, how it works, who's adopted it, what the data says about its impact, how to write your own, and -- critically -- whether you should bother. The honest answer requires looking at both sides.

The Origin Story

The case for llms.txt is grounded in a problem most developers recognize: LLMs have limited context windows, and websites are messy. When a model crawls a typical site, it has to wade through navigation, ads, footers, comment threads, and JavaScript-rendered shells before getting to the substance.

llms.txt Adoption Timeline

From proposal to standard -- key milestones in 19 months

Date	Milestone
Sept 2024	Jeremy Howard (Answer.AI) proposes llms.txt standard
Q4 2024	First major adopters (Anthropic, Stripe, Hugging Face)
Q1 2025	Developer tools (Cursor, Continue, Aider) begin reading llms.txt
Q3 2025	Adoption rate reaches ~7% across analyzed domains
Q1 2026	SE Ranking publishes 300K domain study: 10.13% adoption, no measurable AI citation effect
Q2 2026	Standard remains unofficial; no AI platform formally requires it

Howard's proposal was simple: let site owners publish a single markdown file at /llms.txt that lists their most important pages with brief summaries. AI systems can request this file first, get a clean map of the site's authoritative content, and then fetch only the specific pages they need. Faster, cheaper, more accurate.

The format spec is intentionally minimal: an H1 title, an optional blockquote summary, and curated lists of links grouped under H2 sections. Each link includes a short description. That's it. The full spec lives at llmstxt.org.

How llms.txt Works (The Spec)

The format is a single plain-text file at the root of your domain (e.g., acme.com/llms.txt). The structure is markdown with a specific convention.

Example: A Well-Structured llms.txt

The standard format with H1 title, blockquote summary, and curated section links

# Acme SaaS

> Acme is a project management platform for distributed teams. We help 50,000+ companies organize work, track progress, and ship faster.

## Core Documentation
- [Getting Started Guide](/docs/getting-started.md): How to set up your first project in 5 minutes
- [API Reference](/docs/api.md): Complete REST and GraphQL API documentation
- [Integrations](/docs/integrations.md): 200+ pre-built integrations with major tools

## Product Details
- [Pricing](/pricing.md): Transparent pricing with calculator
- [Features](/features.md): Core feature breakdown by use case
- [Security](/security.md): SOC2 Type II, ISO 27001, GDPR compliance

## Customer Resources
- [Case Studies](/customers.md): Detailed customer success stories
- [Comparison](/compare.md): How we compare to alternatives

## Optional
- [Blog](/blog.md): Latest product updates and industry insights
- [Changelog](/changelog.md): Recent product releases

H1 title: the company or site name. Required.

Blockquote summary: one paragraph describing what the site offers. This is the highest-value real estate -- AI systems often quote it verbatim when introducing your brand.

H2 sections with link lists: grouped by purpose (Documentation, Product, Customer Resources, etc.). Each link is formatted as [Title](URL): Short description.

Optional H2 "Optional" section: content that is helpful but not core. AI systems may deprioritize this section when context is constrained.

A companion convention -- /llms-full.txt -- includes the actual page content concatenated, not just links. This is heavier but allows AI systems to load your most important content without making additional requests.

Current Adoption: The Honest Numbers

The most rigorous public data comes from SE Ranking's 300,000-domain study: 10.13% of domains have an llms.txt file. That's meaningful adoption for a 15-month-old standard, but well below universal -- 9 out of 10 sites still don't have one.

llms.txt Adoption by Traffic Tier

Adoption is remarkably consistent across all traffic levels (SE Ranking, 300,000-domain study)

What's striking is how flat the curve is. Adoption is essentially identical across traffic tiers: low-traffic sites are at 9.88%, mid-traffic at 10.54%, and high-traffic at 8.27%. There is no "enterprise advantage" here -- if anything, developer-heavy small sites are slightly ahead of large corporate ones.

The picture among known major adopters tells a different story. The list reads like a Who's Who of developer-facing companies:

Notable Companies with Public llms.txt Files

Major adopters (verified live as of April 2026)

Company	Category	URL	Why It's Notable
Anthropic	AI Lab	docs.anthropic.com/llms.txt	Maker of Claude -- ironic given Claude doesn't yet officially support llms.txt
GitHub	Dev Platform	docs.github.com/llms.txt	Most-cited developer documentation source on the web
Perplexity	AI Search	docs.perplexity.ai/llms.txt	Among first AI platforms to publish their own
Stripe	Fintech	docs.stripe.com/llms.txt	Comprehensive API documentation guide
Zapier	Automation	zapier.com/llms.txt	Developer integration directory
Cloudflare	Infrastructure	www.cloudflare.com/llms.txt	Developer documentation
Vercel	Dev Platform	vercel.com/llms.txt	Frontend deployment platform powering Next.js sites
Cursor	Dev Tool	cursor.com/llms.txt	AI-powered code editor

The pattern is clear: AI labs, developer tools, and documentation-heavy companies have moved early. Marketing-led B2B SaaS, e-commerce, and content sites are mostly waiting.

Does It Actually Work?

This is where honesty matters. The most rigorous data on llms.txt impact suggests it has no measurable effect on AI citations -- yet.

SE Ranking's analysis of 300,000 domains found no correlation between llms.txt presence and AI citation frequency. Both statistical analysis and machine learning models showed zero effect.

Search Engine Journal's coverage of the same study put it bluntly: 8 out of 9 sites saw no measurable change in traffic after implementing llms.txt. Google's John Mueller has publicly noted that none of the major AI crawlers have officially confirmed they extract information via llms.txt.

That said, there is a parallel reality. The developer-tool ecosystem -- Cursor, Continue, Aider, and various RAG (retrieval-augmented generation) frameworks -- actively reads llms.txt when it exists. For a B2B SaaS targeting developer audiences, having an llms.txt that surfaces your API docs and integration guides is genuinely useful, even if mainstream AI platforms don't formally rely on it.

The honest summary: llms.txt is not a citation hack. If you're hoping it boosts your ChatGPT mention rate tomorrow, the data says it won't. But it's a low-effort future-proofing move, and for some specific use cases it already pays off.

llms.txt vs robots.txt vs sitemap.xml

These three files address related but distinct problems. They are not substitutes for each other -- they are complements.

llms.txt vs robots.txt vs sitemap.xml

How llms.txt fits alongside existing web standards

Dimension	robots.txt	sitemap.xml	llms.txt
Purpose	Tell crawlers what NOT to crawl	Tell crawlers what to discover	Tell LLMs which content to extract
Format	Plain text directives	XML structured	Markdown with structured links
Audience	Search bots (Googlebot, Bingbot)	Search bots + AI crawlers	LLMs and dev tools
Adoption	Universal (~95%+)	Widespread (~70%+)	Low (~10%)
Required by	All major search engines	All major search engines	No platform officially requires it
Content type	Allow / disallow rules	URL list with metadata	Curated content map with summaries

robots.txt tells crawlers what they can and cannot access. It's the access-control layer. sitemap.xml tells crawlers what URLs exist and when they were last updated. It's the discovery layer. llms.txt goes one step further: it doesn't just list URLs, it curates them and tells the LLM which content matters most. It's the prioritization layer.

For a thorough technical foundation, see Ranqo's AI readiness audit guide -- which covers all three layers along with schema markup, Core Web Vitals, and the rest of the technical surface AI crawlers evaluate.

How to Write Your Own llms.txt

The implementation is the easy part. The hard part is editorial judgment -- which pages deserve to be listed, and how to summarize them for an AI audience.

Step 1: Audit your most important pages

What are the 10-30 URLs you would most want an LLM to read first when answering a question about your business? Typically: pricing, core product features, comparison pages, top documentation, key case studies, and company background.

Step 2: Write a one-paragraph summary

Your blockquote is your elevator pitch in 2-3 sentences. AI systems often quote it verbatim. Lead with what you do, who you do it for, and any quantitative differentiator ("50,000 customers," "Fortune 500 trusted," etc.).

Step 3: Group links into H2 sections

Common groupings: Core Documentation, Product, Pricing, Customer Resources, Developer Tools, Company. Use 3-7 links per section -- AI systems prefer clearly delineated short lists over long catch-all sections.

Step 4: Write descriptions that aid extraction

A description like "API Reference" is useless. A description like "Complete REST and GraphQL API documentation with code samples in Python, Node.js, and Go" is extractable. Front-load specifics.

Step 5: Consider llms-full.txt for documentation

If you're a developer-tools company, also publish /llms-full.txt with the actual concatenated content of your docs. Cursor, Continue, and Aider users will thank you.

Step 6: Validate the URL

Make sure /llms.txt returns HTTP 200 with content-type text/markdown or text/plain. Don't require login, don't use JavaScript rendering, don't block AI user agents in your robots.txt.

Common mistakes to avoid

Listing every page on your site -- llms.txt is curation, not a sitemap
Writing marketing copy -- the audience is an LLM, not a human visitor
Linking to gated or login-required pages
Using fake H2 sections with no links
Forgetting to update it when key URLs change
Treating it as a one-time setup -- treat it like a sitemap, refresh quarterly

Should You Implement llms.txt?

The decision depends on your site type and what audience you're serving. Here's a clear matrix.

Should You Implement llms.txt?

Decision matrix based on site type and use case

Site Type	Recommendation	Why
Developer tool / API platform	Yes -- implement	Cursor, Continue, Aider, RAG frameworks actively read llms.txt for context
B2B SaaS with technical buyers	Yes -- implement	Low effort, signals technical maturity, future-proofs for adoption
Documentation-heavy site	Yes -- implement	Strong fit for the format; helps LLMs navigate complex doc structures
Content / blog / publication	Optional	Modest benefit; standard SEO + GEO tactics matter more
E-commerce site	Optional	Product schema and review platforms matter more than llms.txt
Small business / local	Skip for now	Google Business Profile and local schema are higher priority

Yes, implement now if

You're a developer tool, an API platform, a B2B SaaS with technical buyers, or a documentation-heavy site. The developer ecosystem already reads llms.txt. The cost is minimal. The signal value is high (it tells technical buyers you take AI integration seriously).

Optional if

You're a content site, an e-commerce store, or a generalist B2B SaaS. llms.txt won't hurt, but it also won't move citations meaningfully today. Your time is better spent on schema markup, comparison content, and answer-first formatting -- as detailed in Ranqo's 7-step optimization playbook.

Skip for now if

You're a small business, local service provider, or individual creator. Your AI visibility wins are elsewhere -- Google Business Profile, schema markup, third-party reviews. Revisit llms.txt in 6-12 months.

The Future of llms.txt

llms.txt sits at an interesting inflection point. The format is technically sound, the early adopter list is high-quality, and the developer-tool ecosystem actively reads it. But the major AI platforms haven't formally committed to it, and the empirical citation impact is currently zero.

Three things would change the trajectory dramatically:

An official commitment from OpenAI, Anthropic, Google, or Perplexity to read llms.txt as a first-class input
An adoption inflection above 25-30% (which would create network effects similar to robots.txt's rise in the early 2000s)
A documented citation lift from a credible benchmark study (rather than the current null-effect findings)

None of these are guaranteed. The format could plateau at 10% adoption indefinitely, or it could be supplanted by a more opinionated standard. But for technical audiences, the asymmetric bet is clear: implementation cost is low, future-proofing value is real, and the worst case is that you spent two hours writing a markdown file that aged as a historical artifact.

For the broader strategic context on how AI platforms select sources and what optimization actually matters today, see Ranqo's complete GEO guide. llms.txt is one tool in the toolkit -- not a replacement for the foundational signals (content depth, schema, third-party mentions, author credibility) that actually drive AI citations today.

llms.txt is a low-cost, future-facing bet for technical sites. It's not a magic citation lever. Treat it as infrastructure: useful, sometimes underappreciated, occasionally essential.

Audit your site for AI readiness (with or without llms.txt)

Get a 6-dimension audit of your site's AI readiness: crawlability, content quality, page speed, AI extractability, citation potential, and authority. See where your highest-impact optimization opportunities are -- llms.txt is one signal among many. For the deeper foundation, see our audit guide and 5 factors that drive AI citations.

Start your audit

Written by

Nisha Kumari

Co-Founder at Ranqo

Nisha Kumari is Co-Founder at Ranqo, where she leads growth strategy and client acquisition. With a background in digital marketing and financial management, she specializes in SEO, Generative Engine Optimization, and helping brands build visibility across AI platforms.

Share this article

Guide

AI.txt vs Llms.txt vs Robots.txt: The Complete AI Crawler Control Guide for 2026

Most articles about AI crawlers ask the wrong question -- whether to block them. The strategic question is which crawlers should train your model, which should retrieve from you for citation, and where licensing replaces both. This guide covers the three control files (robots.txt, llms.txt, AI.txt), the AI crawler taxonomy, the crawl-to-referral economics that should drive your decisions (ClaudeBot 20,583:1 vs PerplexityBot 194.8:1), the Perplexity stealth-crawling case study, and industry-specific decision frameworks. Every claim is verified against published sources.

May 4, 202621 min read

Guide

What AI Actually Sees When It Crawls Your Site: A Live Walkthrough

Your site looks great in a browser. But AI crawlers see only raw HTML -- no JavaScript, no rendered components, no dynamic content. This is a live walkthrough of exactly what GPTBot, ClaudeBot, and PerplexityBot fetch when they visit, with verified data on every claim and a 6-method test you can run today.

Apr 26, 202612 min read

Guide

How to Earn a Spot on the 'Best X' Lists AI Cites

Ranked listicles are the most-cited content format in AI search — 35.7% of content citations across 102 brands. But your own site is only 2.9% of citations, so you can't write your way on. Earning a spot on the 'best X' lists AI cites is a repeatable play, not luck. Here's the six-step playbook to find the right lists and become the obvious include.

Jun 30, 20268 min read

Guide

llms.txt: The Complete Guide to the New Standard for AI Crawlers

Nisha Kumari|April 25, 202610 min read

10.13%

of 300,000 analyzed domains have an llms.txt file -- adoption is real but not universal (SE Ranking, 2026)

The Origin Story

llms.txt Adoption Timeline

From proposal to standard -- key milestones in 19 months

Date	Milestone
Sept 2024	Jeremy Howard (Answer.AI) proposes llms.txt standard
Q4 2024	First major adopters (Anthropic, Stripe, Hugging Face)
Q1 2025	Developer tools (Cursor, Continue, Aider) begin reading llms.txt
Q3 2025	Adoption rate reaches ~7% across analyzed domains
Q1 2026	SE Ranking publishes 300K domain study: 10.13% adoption, no measurable AI citation effect
Q2 2026	Standard remains unofficial; no AI platform formally requires it

How llms.txt Works (The Spec)

The format is a single plain-text file at the root of your domain (e.g., acme.com/llms.txt). The structure is markdown with a specific convention.

Example: A Well-Structured llms.txt

The standard format with H1 title, blockquote summary, and curated section links

# Acme SaaS

> Acme is a project management platform for distributed teams. We help 50,000+ companies organize work, track progress, and ship faster.

## Core Documentation
- [Getting Started Guide](/docs/getting-started.md): How to set up your first project in 5 minutes
- [API Reference](/docs/api.md): Complete REST and GraphQL API documentation
- [Integrations](/docs/integrations.md): 200+ pre-built integrations with major tools

## Product Details
- [Pricing](/pricing.md): Transparent pricing with calculator
- [Features](/features.md): Core feature breakdown by use case
- [Security](/security.md): SOC2 Type II, ISO 27001, GDPR compliance

## Customer Resources
- [Case Studies](/customers.md): Detailed customer success stories
- [Comparison](/compare.md): How we compare to alternatives

## Optional
- [Blog](/blog.md): Latest product updates and industry insights
- [Changelog](/changelog.md): Recent product releases

H1 title: the company or site name. Required.

Blockquote summary: one paragraph describing what the site offers. This is the highest-value real estate -- AI systems often quote it verbatim when introducing your brand.

H2 sections with link lists: grouped by purpose (Documentation, Product, Customer Resources, etc.). Each link is formatted as [Title](URL): Short description.

Optional H2 "Optional" section: content that is helpful but not core. AI systems may deprioritize this section when context is constrained.

Current Adoption: The Honest Numbers

llms.txt Adoption by Traffic Tier

Adoption is remarkably consistent across all traffic levels (SE Ranking, 300,000-domain study)

The picture among known major adopters tells a different story. The list reads like a Who's Who of developer-facing companies:

Notable Companies with Public llms.txt Files

Major adopters (verified live as of April 2026)

Company	Category	URL	Why It's Notable
Anthropic	AI Lab	docs.anthropic.com/llms.txt	Maker of Claude -- ironic given Claude doesn't yet officially support llms.txt
GitHub	Dev Platform	docs.github.com/llms.txt	Most-cited developer documentation source on the web
Perplexity	AI Search	docs.perplexity.ai/llms.txt	Among first AI platforms to publish their own
Stripe	Fintech	docs.stripe.com/llms.txt	Comprehensive API documentation guide
Zapier	Automation	zapier.com/llms.txt	Developer integration directory
Cloudflare	Infrastructure	www.cloudflare.com/llms.txt	Developer documentation
Vercel	Dev Platform	vercel.com/llms.txt	Frontend deployment platform powering Next.js sites
Cursor	Dev Tool	cursor.com/llms.txt	AI-powered code editor

The pattern is clear: AI labs, developer tools, and documentation-heavy companies have moved early. Marketing-led B2B SaaS, e-commerce, and content sites are mostly waiting.

Does It Actually Work?

This is where honesty matters. The most rigorous data on llms.txt impact suggests it has no measurable effect on AI citations -- yet.

SE Ranking's analysis of 300,000 domains found no correlation between llms.txt presence and AI citation frequency. Both statistical analysis and machine learning models showed zero effect.

llms.txt vs robots.txt vs sitemap.xml

These three files address related but distinct problems. They are not substitutes for each other -- they are complements.

llms.txt vs robots.txt vs sitemap.xml

How llms.txt fits alongside existing web standards

Dimension	robots.txt	sitemap.xml	llms.txt
Purpose	Tell crawlers what NOT to crawl	Tell crawlers what to discover	Tell LLMs which content to extract
Format	Plain text directives	XML structured	Markdown with structured links
Audience	Search bots (Googlebot, Bingbot)	Search bots + AI crawlers	LLMs and dev tools
Adoption	Universal (~95%+)	Widespread (~70%+)	Low (~10%)
Required by	All major search engines	All major search engines	No platform officially requires it
Content type	Allow / disallow rules	URL list with metadata	Curated content map with summaries

How to Write Your Own llms.txt

The implementation is the easy part. The hard part is editorial judgment -- which pages deserve to be listed, and how to summarize them for an AI audience.

Step 1: Audit your most important pages

Step 2: Write a one-paragraph summary

Step 3: Group links into H2 sections

Step 4: Write descriptions that aid extraction

A description like "API Reference" is useless. A description like "Complete REST and GraphQL API documentation with code samples in Python, Node.js, and Go" is extractable. Front-load specifics.

Step 5: Consider llms-full.txt for documentation

If you're a developer-tools company, also publish /llms-full.txt with the actual concatenated content of your docs. Cursor, Continue, and Aider users will thank you.

Step 6: Validate the URL

Make sure /llms.txt returns HTTP 200 with content-type text/markdown or text/plain. Don't require login, don't use JavaScript rendering, don't block AI user agents in your robots.txt.

Common mistakes to avoid

Listing every page on your site -- llms.txt is curation, not a sitemap
Writing marketing copy -- the audience is an LLM, not a human visitor
Linking to gated or login-required pages
Using fake H2 sections with no links
Forgetting to update it when key URLs change
Treating it as a one-time setup -- treat it like a sitemap, refresh quarterly

Should You Implement llms.txt?

The decision depends on your site type and what audience you're serving. Here's a clear matrix.

Should You Implement llms.txt?

Decision matrix based on site type and use case

Site Type	Recommendation	Why
Developer tool / API platform	Yes -- implement	Cursor, Continue, Aider, RAG frameworks actively read llms.txt for context
B2B SaaS with technical buyers	Yes -- implement	Low effort, signals technical maturity, future-proofs for adoption
Documentation-heavy site	Yes -- implement	Strong fit for the format; helps LLMs navigate complex doc structures
Content / blog / publication	Optional	Modest benefit; standard SEO + GEO tactics matter more
E-commerce site	Optional	Product schema and review platforms matter more than llms.txt
Small business / local	Skip for now	Google Business Profile and local schema are higher priority

Yes, implement now if

Optional if

Skip for now if

The Future of llms.txt

Three things would change the trajectory dramatically:

An official commitment from OpenAI, Anthropic, Google, or Perplexity to read llms.txt as a first-class input
An adoption inflection above 25-30% (which would create network effects similar to robots.txt's rise in the early 2000s)
A documented citation lift from a credible benchmark study (rather than the current null-effect findings)

llms.txt is a low-cost, future-facing bet for technical sites. It's not a magic citation lever. Treat it as infrastructure: useful, sometimes underappreciated, occasionally essential.

Audit your site for AI readiness (with or without llms.txt)

Start your audit

Written by

Nisha Kumari

Co-Founder at Ranqo

Share this article

Guide

AI.txt vs Llms.txt vs Robots.txt: The Complete AI Crawler Control Guide for 2026

May 4, 202621 min read

Guide

What AI Actually Sees When It Crawls Your Site: A Live Walkthrough

Apr 26, 202612 min read

Guide

How to Earn a Spot on the 'Best X' Lists AI Cites

Jun 30, 20268 min read

The Origin Story

llms.txt Adoption Timeline

How llms.txt Works (The Spec)

Example: A Well-Structured llms.txt

Current Adoption: The Honest Numbers

llms.txt Adoption by Traffic Tier

Notable Companies with Public llms.txt Files

Does It Actually Work?

llms.txt vs robots.txt vs sitemap.xml

llms.txt vs robots.txt vs sitemap.xml

How to Write Your Own llms.txt

Step 1: Audit your most important pages

Step 2: Write a one-paragraph summary

Step 3: Group links into H2 sections

Step 4: Write descriptions that aid extraction

Step 5: Consider llms-full.txt for documentation

Step 6: Validate the URL

Common mistakes to avoid

Should You Implement llms.txt?

Should You Implement llms.txt?

Yes, implement now if

Optional if

Skip for now if

The Future of llms.txt

Audit your site for AI readiness (with or without llms.txt)

Nisha Kumari

Related articles

AI.txt vs Llms.txt vs Robots.txt: The Complete AI Crawler Control Guide for 2026

What AI Actually Sees When It Crawls Your Site: A Live Walkthrough

How to Earn a Spot on the 'Best X' Lists AI Cites

The Origin Story

llms.txt Adoption Timeline

How llms.txt Works (The Spec)

Example: A Well-Structured llms.txt

Current Adoption: The Honest Numbers

llms.txt Adoption by Traffic Tier

Notable Companies with Public llms.txt Files

Does It Actually Work?

llms.txt vs robots.txt vs sitemap.xml

llms.txt vs robots.txt vs sitemap.xml

How to Write Your Own llms.txt

Step 1: Audit your most important pages

Step 2: Write a one-paragraph summary

Step 3: Group links into H2 sections

Step 4: Write descriptions that aid extraction

Step 5: Consider llms-full.txt for documentation

Step 6: Validate the URL

Common mistakes to avoid

Should You Implement llms.txt?

Should You Implement llms.txt?

Yes, implement now if

Optional if

Skip for now if

The Future of llms.txt

Audit your site for AI readiness (with or without llms.txt)

Nisha Kumari

Related articles

AI.txt vs Llms.txt vs Robots.txt: The Complete AI Crawler Control Guide for 2026

What AI Actually Sees When It Crawls Your Site: A Live Walkthrough

How to Earn a Spot on the 'Best X' Lists AI Cites