[]
Ranqo
PricingBlog
Guide

llms.txt: The Complete Guide to the New Standard for AI Crawlers

llms.txt is a proposed web standard that lets you publish a curated map of your site for large language models. 10.13% of domains have already adopted it -- but does it actually move AI citations? This guide covers the spec, the data, the major adopters, and an honest answer on whether to implement.

Nisha Kumari|April 25, 202615 min read

On this page

llms.txt is a proposed web standard that lets website owners publish a curated, markdown-formatted map of their site for large language models. Like robots.txt is for search engine crawlers and sitemap.xml is for indexers, llms.txt is for the AI systems that increasingly read the web on behalf of users -- ChatGPT, Claude, Perplexity, and the wave of developer tools built on top of them.

10.13%

of 300,000 analyzed domains have an llms.txt file -- adoption is real but not universal (SE Ranking, 2026)

The standard was proposed in September 2024 by Australian technologist Jeremy Howard (founder of Answer.AI and fast.ai). Nineteen months later, the format has been adopted by names like Anthropic, Stripe, Zapier, Cloudflare, Hugging Face, and Perplexity itself -- yet remains unofficial, with no AI platform formally requiring it.

This guide gives you the complete picture: what llms.txt is, how it works, who's adopted it, what the data says about its impact, how to write your own, and -- critically -- whether you should bother. The honest answer requires looking at both sides.

The Origin Story

The case for llms.txt is grounded in a problem most developers recognize: LLMs have limited context windows, and websites are messy. When a model crawls a typical site, it has to wade through navigation, ads, footers, comment threads, and JavaScript-rendered shells before getting to the substance.

llms.txt Adoption Timeline

From proposal to standard -- key milestones in 19 months

DateMilestone
Sept 2024Jeremy Howard (Answer.AI) proposes llms.txt standard
Q4 2024First major adopters (Anthropic, Stripe, Hugging Face)
Q1 2025Developer tools (Cursor, Continue, Aider) begin reading llms.txt
Q3 2025Adoption rate reaches ~7% across analyzed domains
Q1 2026SE Ranking publishes 300K domain study: 10.13% adoption, no measurable AI citation effect
Q2 2026Standard remains unofficial; no AI platform formally requires it

Howard's proposal was simple: let site owners publish a single markdown file at /llms.txt that lists their most important pages with brief summaries. AI systems can request this file first, get a clean map of the site's authoritative content, and then fetch only the specific pages they need. Faster, cheaper, more accurate.

The format spec is intentionally minimal: an H1 title, an optional blockquote summary, and curated lists of links grouped under H2 sections. Each link includes a short description. That's it. The full spec lives at llmstxt.org.

How llms.txt Works (The Spec)

The format is a single plain-text file at the root of your domain (e.g., acme.com/llms.txt). The structure is markdown with a specific convention.

Example: A Well-Structured llms.txt

The standard format with H1 title, blockquote summary, and curated section links

# Acme SaaS

> Acme is a project management platform for distributed teams. We help 50,000+ companies organize work, track progress, and ship faster.

## Core Documentation
- [Getting Started Guide](/docs/getting-started.md): How to set up your first project in 5 minutes
- [API Reference](/docs/api.md): Complete REST and GraphQL API documentation
- [Integrations](/docs/integrations.md): 200+ pre-built integrations with major tools

## Product Details
- [Pricing](/pricing.md): Transparent pricing with calculator
- [Features](/features.md): Core feature breakdown by use case
- [Security](/security.md): SOC2 Type II, ISO 27001, GDPR compliance

## Customer Resources
- [Case Studies](/customers.md): Detailed customer success stories
- [Comparison](/compare.md): How we compare to alternatives

## Optional
- [Blog](/blog.md): Latest product updates and industry insights
- [Changelog](/changelog.md): Recent product releases

H1 title: the company or site name. Required.

Blockquote summary: one paragraph describing what the site offers. This is the highest-value real estate -- AI systems often quote it verbatim when introducing your brand.

H2 sections with link lists: grouped by purpose (Documentation, Product, Customer Resources, etc.). Each link is formatted as [Title](URL): Short description.

Optional H2 "Optional" section: content that is helpful but not core. AI systems may deprioritize this section when context is constrained.

A companion convention -- /llms-full.txt -- includes the actual page content concatenated, not just links. This is heavier but allows AI systems to load your most important content without making additional requests.

Current Adoption: The Honest Numbers

The most rigorous public data comes from SE Ranking's 300,000-domain study: 10.13% of domains have an llms.txt file. That's meaningful adoption for a 15-month-old standard, but well below universal -- 9 out of 10 sites still don't have one.

llms.txt Adoption by Traffic Tier

Adoption is remarkably consistent across all traffic levels (SE Ranking, 300,000-domain study)

What's striking is how flat the curve is. Adoption is essentially identical across traffic tiers: low-traffic sites are at 9.88%, mid-traffic at 10.54%, and high-traffic at 8.27%. There is no "enterprise advantage" here -- if anything, developer-heavy small sites are slightly ahead of large corporate ones.

The picture among known major adopters tells a different story. The list reads like a Who's Who of developer-facing companies:

Notable Companies with Public llms.txt Files

Major adopters (verified live as of April 2026)

CompanyCategoryURLWhy It's Notable
AnthropicAI Labanthropic.com/llms.txtMaker of Claude -- ironic given Claude doesn't yet officially support llms.txt
Hugging FaceAI Platformhuggingface.co/llms.txtDeveloper-focused community for ML models
PerplexityAI Searchperplexity.ai/llms.txtAmong first AI platforms to publish their own
StripeFintechstripe.com/llms.txtComprehensive API documentation guide
ZapierAutomationzapier.com/llms.txtDeveloper integration directory
CloudflareInfrastructurecloudflare.com/llms.txtDeveloper documentation
CursorDev Toolcursor.com/llms.txtAI-powered code editor

The pattern is clear: AI labs, developer tools, and documentation-heavy companies have moved early. Marketing-led B2B SaaS, e-commerce, and content sites are mostly waiting.

Does It Actually Work?

This is where honesty matters. The most rigorous data on llms.txt impact suggests it has no measurable effect on AI citations -- yet.

SE Ranking's analysis of 300,000 domains found no correlation between llms.txt presence and AI citation frequency. Both statistical analysis and machine learning models showed zero effect.

Search Engine Journal's coverage of the same study put it bluntly: 8 out of 9 sites saw no measurable change in traffic after implementing llms.txt. Google's John Mueller has publicly noted that none of the major AI crawlers have officially confirmed they extract information via llms.txt.

That said, there is a parallel reality. The developer-tool ecosystem -- Cursor, Continue, Aider, and various RAG (retrieval-augmented generation) frameworks -- actively reads llms.txt when it exists. For a B2B SaaS targeting developer audiences, having an llms.txt that surfaces your API docs and integration guides is genuinely useful, even if mainstream AI platforms don't formally rely on it.

The honest summary: llms.txt is not a citation hack. If you're hoping it boosts your ChatGPT mention rate tomorrow, the data says it won't. But it's a low-effort future-proofing move, and for some specific use cases it already pays off.

llms.txt vs robots.txt vs sitemap.xml

These three files address related but distinct problems. They are not substitutes for each other -- they are complements.

llms.txt vs robots.txt vs sitemap.xml

How llms.txt fits alongside existing web standards

Dimensionrobots.txtsitemap.xmlllms.txt
PurposeTell crawlers what NOT to crawlTell crawlers what to discoverTell LLMs which content to extract
FormatPlain text directivesXML structuredMarkdown with structured links
AudienceSearch bots (Googlebot, Bingbot)Search bots + AI crawlersLLMs and dev tools
AdoptionUniversal (~95%+)Widespread (~70%+)Low (~10%)
Required byAll major search enginesAll major search enginesNo platform officially requires it
Content typeAllow / disallow rulesURL list with metadataCurated content map with summaries

robots.txt tells crawlers what they can and cannot access. It's the access-control layer. sitemap.xml tells crawlers what URLs exist and when they were last updated. It's the discovery layer. llms.txt goes one step further: it doesn't just list URLs, it curates them and tells the LLM which content matters most. It's the prioritization layer.

For a thorough technical foundation, see Ranqo's AI readiness audit guide -- which covers all three layers along with schema markup, Core Web Vitals, and the rest of the technical surface AI crawlers evaluate.

How to Write Your Own llms.txt

The implementation is the easy part. The hard part is editorial judgment -- which pages deserve to be listed, and how to summarize them for an AI audience.

Step 1: Audit your most important pages. What are the 10-30 URLs you would most want an LLM to read first when answering a question about your business? Typically: pricing, core product features, comparison pages, top documentation, key case studies, and company background.

Step 2: Write a one-paragraph summary. Your blockquote is your elevator pitch in 2-3 sentences. AI systems often quote it verbatim. Lead with what you do, who you do it for, and any quantitative differentiator ("50,000 customers," "Fortune 500 trusted," etc.).

Step 3: Group links into H2 sections. Common groupings: Core Documentation, Product, Pricing, Customer Resources, Developer Tools, Company. Use 3-7 links per section -- AI systems prefer clearly delineated short lists over long catch-all sections.

Step 4: Write descriptions that aid extraction. A description like "API Reference" is useless. A description like "Complete REST and GraphQL API documentation with code samples in Python, Node.js, and Go" is extractable. Front-load specifics.

Step 5: Consider llms-full.txt for documentation. If you're a developer-tools company, also publish /llms-full.txt with the actual concatenated content of your docs. Cursor, Continue, and Aider users will thank you.

Step 6: Validate the URL. Make sure /llms.txt returns HTTP 200 with content-type text/markdown or text/plain. Don't require login, don't use JavaScript rendering, don't block AI user agents in your robots.txt.

Common mistakes to avoid

  • Listing every page on your site -- llms.txt is curation, not a sitemap
  • Writing marketing copy -- the audience is an LLM, not a human visitor
  • Linking to gated or login-required pages
  • Using fake H2 sections with no links
  • Forgetting to update it when key URLs change
  • Treating it as a one-time setup -- treat it like a sitemap, refresh quarterly

Should You Implement llms.txt?

The decision depends on your site type and what audience you're serving. Here's a clear matrix.

Should You Implement llms.txt?

Decision matrix based on site type and use case

Site TypeRecommendationWhy
Developer tool / API platformYes -- implementCursor, Continue, Aider, RAG frameworks actively read llms.txt for context
B2B SaaS with technical buyersYes -- implementLow effort, signals technical maturity, future-proofs for adoption
Documentation-heavy siteYes -- implementStrong fit for the format; helps LLMs navigate complex doc structures
Content / blog / publicationOptionalModest benefit; standard SEO + GEO tactics matter more
E-commerce siteOptionalProduct schema and review platforms matter more than llms.txt
Small business / localSkip for nowGoogle Business Profile and local schema are higher priority

Yes, implement now if: you're a developer tool, an API platform, a B2B SaaS with technical buyers, or a documentation-heavy site. The developer ecosystem already reads llms.txt. The cost is minimal. The signal value is high (it tells technical buyers you take AI integration seriously).

Optional if: you're a content site, an e-commerce store, or a generalist B2B SaaS. llms.txt won't hurt, but it also won't move citations meaningfully today. Your time is better spent on schema markup, comparison content, and answer-first formatting -- as detailed in Ranqo's 7-step optimization playbook.

Skip for now if: you're a small business, local service provider, or individual creator. Your AI visibility wins are elsewhere -- Google Business Profile, schema markup, third-party reviews. Revisit llms.txt in 6-12 months.

The Future of llms.txt

llms.txt sits at an interesting inflection point. The format is technically sound, the early adopter list is high-quality, and the developer-tool ecosystem actively reads it. But the major AI platforms haven't formally committed to it, and the empirical citation impact is currently zero.

Three things would change the trajectory dramatically:

  1. An official commitment from OpenAI, Anthropic, Google, or Perplexity to read llms.txt as a first-class input
  2. An adoption inflection above 25-30% (which would create network effects similar to robots.txt's rise in the early 2000s)
  3. A documented citation lift from a credible benchmark study (rather than the current null-effect findings)

None of these are guaranteed. The format could plateau at 10% adoption indefinitely, or it could be supplanted by a more opinionated standard. But for technical audiences, the asymmetric bet is clear: implementation cost is low, future-proofing value is real, and the worst case is that you spent two hours writing a markdown file that aged as a historical artifact.

For the broader strategic context on how AI platforms select sources and what optimization actually matters today, see Ranqo's complete GEO guide. llms.txt is one tool in the toolkit -- not a replacement for the foundational signals (content depth, schema, third-party mentions, author credibility) that actually drive AI citations today.

llms.txt is a low-cost, future-facing bet for technical sites. It's not a magic citation lever. Treat it as infrastructure: useful, sometimes underappreciated, occasionally essential.

Audit your site for AI readiness (with or without llms.txt)

Get a 6-dimension audit of your site's AI readiness: crawlability, content quality, page speed, AI extractability, citation potential, and authority. See where your highest-impact optimization opportunities are -- llms.txt is one signal among many. For the deeper foundation, see our audit guide and 5 factors that drive AI citations.

Start your audit

Written by

Nisha Kumari

Co-Founder at Ranqo

Nisha Kumari is Co-Founder at Ranqo, where she leads growth strategy and client acquisition. With a background in digital marketing and financial management, she specializes in SEO, Generative Engine Optimization, and helping brands build visibility across AI platforms.

On this page

Share this article

[]
Ranqo

Monitor and improve your brand's visibility across ChatGPT, Claude, Perplexity, Gemini, and Grok.

Product

  • Search Visibility
  • Prompt Intelligence
  • Competitor Benchmarking
  • Source Analytics
  • Page Optimization
  • Content Lab
  • Action Center

Company

  • Pricing
  • Contact

Legal

  • Privacy
  • Terms
  • Cookies

Resources

  • Blog
  • AI Visibility Checker
  • AI Readiness Score

© 2026 Ranqo. All rights reserved.