Guide

How to Audit Your Website for AI Readiness

Most websites are optimized for Google but invisible to AI. A 6-dimension audit -- crawlability, content quality, page speed, AI readiness, citation potential, and authority -- reveals exactly where the gaps are.

Ranqo Research|April 9, 202618 min read

Most websites are optimized for Google. Title tags are tuned, backlinks are built, page speed is monitored. But when ChatGPT, Perplexity, or Gemini generate a recommendation in your category, does your brand appear? For the majority of businesses, the answer is no -- and the reason is not a lack of content but a lack of AI readiness.

78%

of businesses report zero visibility in AI-generated answers, despite AI search traffic growing 527% year-over-year

AI readiness is not a single checkbox. It is a composite of six dimensions, each measuring a different aspect of how AI platforms evaluate your content. The weighting model below is calibrated from cross-platform analysis of AI citation patterns, aligned with industry benchmarks from sources like First Page Sage, Princeton's GEO research, and Search Engine Land's citation studies. This guide walks through each dimension with specific signals, scoring criteria, and actionable checklists -- so you can audit any page on your site and know exactly where the gaps are.

Why AI Readiness is Different from SEO

Google crawls your site to rank it. AI crawls your site to extract from it. That distinction changes everything about how you should think about optimization. Search Engine Journal found that ChatGPT-User makes 3.6x more requests than Googlebot -- but with a critical limitation: AI crawlers cannot execute JavaScript.

How AI Crawlers Differ from Googlebot

Crawl behavior comparison across major bots (Search Engine Journal)

Crawler	Requests	JS Execution	What It Wants
Googlebot	37,426	Yes	Crawlable, fast, structured content
ChatGPT-User	133,361	No	Answer-dense, quotable content
GPTBot	~50,000	No	Comprehensive, authoritative content
ClaudeBot	~24,000	No	Nuanced, well-cited content

3.6x

more requests from ChatGPT-User than Googlebot -- but AI crawlers cannot execute JavaScript (Search Engine Journal)

If your site relies on client-side rendering (React, Vue, Angular without SSR), AI crawlers see an empty page. They cannot index what they cannot read. This is the most common -- and most invisible -- reason brands fail the AI readiness audit. Googlebot renders JavaScript and sees your content. GPTBot and ClaudeBot see raw HTML and move on.

Google asks: "Should this page rank?" AI asks: "Can I extract an answer from this page?" The second question is harder to pass.

Dimension 1: Crawlability (11% of Score)

The foundation. If AI bots cannot discover and access your content, nothing else matters. Crawlability measures whether the technical infrastructure of your site allows AI systems to find, fetch, and process your pages.

The most critical signal is server-side rendering. An analysis of 500 million+ GPTBot fetches found zero evidence of JavaScript execution. AI crawlers see only raw HTML -- any content generated client-side is invisible to them. If you use a JavaScript framework, ensure SSR or static generation is enabled for all content pages.

Accessibility matters too. According to Hashmeta's study of 100,000 ChatGPT responses, paywalled content receives 97% fewer citations than open-access content. Beyond paywalls, AI bots check for basic signals: HTTPS encryption, a discoverable XML sitemap, a robots.txt that does not block AI user agents, and clean canonical tags. These are binary checks -- either pass or fail.

Crawlability checklist

HTTPS enabled across entire site (no mixed content)
XML sitemap exists and is linked from robots.txt
Robots.txt does not block GPTBot, ClaudeBot, or PerplexityBot
Canonical tags present and pointing to correct URLs
Server-side rendering or static generation for all content pages
No noindex directives on pages you want AI to cite
Clean URL structure (no excessive query parameters)
Mobile-friendly rendering (responsive or adaptive)
Schema.org structured data present

Dimension 2: Content Quality (24% of Score)

The highest-weighted dimension -- and for good reason. First Page Sage ranks content quality as the #1 Google algorithm factor at 23%, and AI platforms follow the same pattern. Content quality measures whether your content is substantive, well-organized, and deep enough to be useful as an AI source.

Content Quality: What Gets Scored

Signal weight distribution within the Content Quality dimension (points out of 70)

Content depth carries the most weight at 26 points. Hashmeta's analysis of 100,000 ChatGPT responses found that articles over 1,500 words are cited 4.7x more often than content under 500 words. This dimension measures word count (minimum 300, optimal 1,000+), readability score, paragraph structure (2-3 sentence chunks), and topic coverage breadth.

Heading structure at 12 points checks for a single H1 tag, proper H2/H3 hierarchy without skipping levels, and keywords naturally distributed across headings. AI uses headings to understand content structure and extract relevant sections.

Format matters too. Listicles account for 21.9% of all AI citations -- the most of any content format. If your page targets a "best X" or comparison query, structure it as a list with clear numbered items.

Content quality checklist

Minimum 1,000 words for informational content
Single H1 tag matching the page topic
Proper H2/H3 hierarchy (never skip levels)
Title length 50-60 characters with primary keyword near beginning
Meta description 150-160 characters summarizing the page
3+ internal links with descriptive anchor text
Alt text on every image, describing content not just decoration
Readability at Flesch 60-70 (plain English)
Short paragraphs (2-3 sentences) for scanability

Dimension 3: Page Speed (5% of Score)

The lowest-weighted dimension, but with a twist: speed functions as a gate, not a signal. An analysis of 107,352 AI-visible pages found that good performance does not create an advantage -- but severe failure creates a disadvantage. Core Web Vitals are a pass/fail check.

more AI citations for pages with First Contentful Paint under 0.4 seconds (ZipTie.dev)

Core Web Vitals: AI Citation Thresholds

Performance targets and their impact on AI visibility (Search Engine Land, 107K pages)

Metric	Target	AI Impact
Largest Contentful Paint (LCP)	2.5s or less	Gate: slow LCP = deprioritized by AI crawlers
First Contentful Paint (FCP)	0.4s or less	High: 3x more AI citations under 0.4s
Cumulative Layout Shift (CLS)	0.1 or less	Low: weak correlation with AI citations
First Input Delay (FID)	100ms or less	Low: minimal AI impact
Total Blocking Time (TBT)	200ms or less	Medium: affects crawl completion
Speed Index	3.4s or less	Medium: affects content availability to crawlers

ZipTie.dev found that pages with FCP under 0.4 seconds receive 3x more AI citations. But the correlation between CWV and AI visibility is weak overall (LCP correlation just -0.12 to -0.18). The takeaway: fix severe speed issues, but don't over-invest in going from "good" to "great."

Slow pages affect AI in a specific way: if the page takes too long to respond, the AI crawler may time out and move on, never indexing the content at all. This is different from Google, which is more patient with slow renders.

Dimension 4: AI Readiness (21% of Score)

The dimension that separates traditional SEO from AI optimization. AI readiness measures whether your content can be extracted -- not just read, but pulled apart into answer blocks, definitions, comparisons, and facts that AI platforms can serve directly to users.

AI Readiness: The Extraction Test

Can AI extract direct answers from your content? Test each signal.

Signal	What It Means	Quick Test
Answer Density	Direct answers in first 50-100 words	Does the first paragraph answer the page's core question?
Fact Density	Statistics and data points per 200 words	Count specific numbers vs. vague claims ("42%" vs. "many")
Entity Clarity	Named entities without vague pronouns	Can you understand each paragraph without reading the one before it?
Paragraph Ready	Self-contained 40-60 word answer blocks	Can any paragraph be copy-pasted as a standalone answer?
List Ready	Well-formatted numbered or bulleted lists	Are key points in scannable list format?
Table Ready	Comparison or data tables present	Is there at least one HTML table comparing options?
Definition Ready	Clear "[X] is [definition]" statements	Does the page define its key terms explicitly?

Answer density is the most important signal. Does the first paragraph directly answer the page's core question? AI platforms scan for answers they can extract verbatim. Content that builds to a conclusion through five paragraphs of context before delivering the answer will lose to content that leads with the answer.

Fact density measures how many specific, quantified data points appear per 200 words. The difference between "Many companies report improvement" and "42% of companies reported a 3x improvement" is the difference between being ignored and being cited.

Structured data amplifies every signal. Pages with FAQ schema are 3.2x more likely to appear in AI Overviews. Hashmeta found that 72.3% of frequently-cited pages have schema markup, compared to just 18.6% of rarely-cited pages -- a nearly 4x difference in adoption.

The question is not whether your content is good. The question is whether AI can extract a specific, quotable answer from your content in under two seconds. Structure is the multiplier.

Dimension 5: Citation Potential (21% of Score)

AI readiness asks "can AI extract from this?" Citation potential asks "would AI want to cite this?" The difference is the quality and uniqueness of what you offer. Three pillars drive citation potential.

Citation Potential: Typical vs AI-Optimized Page

Editorial comparison across 5 citation dimensions (score 0-100)

40%

citation increase from content that includes statistics, expert quotes, and source citations (Princeton, KDD 2024)

Quotability measures whether your content contains specific, memorable statements that AI would want to attribute. Generic advice like "focus on quality content" has zero quotability. A statement like "Pages with FAQ schema are 3.2x more likely to appear in AI Overviews" is highly quotable.

Unique information is the strongest citation driver. Princeton researchers found that content with original statistics and data increases AI citations by up to 40%. Original research, proprietary data, case studies, and first-hand experience all qualify. If your content can be found on ten other sites, AI has no reason to cite yours specifically.

Expert attribution completes the picture. Named experts with verifiable credentials get cited significantly more than anonymous content. Analysis of 8,000 AI citations shows brands are 6.5x more likely to be cited through third-party sources than through their own domain.

Citation potential checklist

Include at least 3 original statistics or data points per page
Add expert quotes with credentials (name, title, organization)
Cite external authoritative sources with inline links
Create comparison tables for product or service pages
Include a TL;DR or key takeaways section AI can extract
Add FAQ sections with clear question-answer pairs
Write definitions for key terms ("[X] is [definition]")

Dimension 6: Authority & Trust (18% of Score)

The E-E-A-T dimension. AI platforms have adopted Google's Experience, Expertise, Authority, and Trustworthiness framework as a citation filter. Content that fails the trust threshold is excluded from AI recommendations regardless of how relevant or well-structured it is.

89%

of frequently-cited pages have clear author bylines, vs just 31% of rarely-cited pages (Hashmeta, 100K responses)

Authority & Trust: The Binary Checks

Each signal is either present or absent -- there is no partial credit

Signal	Impact	Why It Matters
HTTPS / SSL	High	Binary trust gate -- AI deprioritizes insecure sites
Author Bio with Credentials	High	Named author with verifiable expertise (PhD, certifications, title)
External Source Citations	High	Links to authoritative third-party sources within content
Privacy Policy	Medium	Signals organizational legitimacy and compliance
Contact Information	Medium	Physical address, email, or phone visible on site
Last Updated Date	Medium	Visible date showing content is actively maintained

The data is stark. Hashmeta's analysis found that 89.2% of frequently-cited pages have clear author bylines, compared to just 31.4% of rarely-cited pages. Anonymous or corporate-authored content receives 64% fewer citations. Named authorship alone increases citations by 1.9x.

ZipTie.dev found that pages ranking #6-10 on Google with strong E-E-A-T signals get cited 2.3x more by AI than pages ranking #1 with weak E-E-A-T. Trust outweighs rank.

The most impactful signal beyond authorship is external source citations. Pages that cite authoritative third-party sources -- with real inline links, not just a bibliography -- signal to AI that the content is well-researched and trustworthy. This is precisely what makes AI platforms confident enough to cite you in turn.

Content freshness also plays a role here. According to rank.bot, content updated within 30 days receives 3.2x more AI citations, and more than 70% of all AI citations go to pages updated within 12 months. A visible "last updated" date signals active maintenance.

Trust is not a differentiator. It is a prerequisite. Pass every binary check, and you qualify for citation. Miss even one, and AI may skip you entirely.

Putting It All Together: The Weighted Score

The six dimensions combine into a single weighted score from 0 to 100. Each dimension contributes proportionally based on its observed impact on AI citation likelihood -- calibrated from cross-platform citation analysis and industry benchmarks including content format research, CWV impact studies, and Princeton's GEO findings.

The 6 Dimensions: Weighted Scoring Model

Weights calibrated from cross-platform AI citation analysis and industry benchmarks

A+: 90-100A: 80-89B+: 70-79B: 60-69C+: 50-59C: 40-49D: 30-39F: Below 30

The weighting reveals a clear prioritization, and each weight is backed by published research. Content Quality at 24% aligns with First Page Sage's finding that content quality is the #1 ranking factor at 23%. AI Readiness at 21% reflects the Princeton GEO study's finding that structured optimization boosts visibility by up to 40% -- the largest controlled experimental effect in the field. Together, these two dimensions account for 45% of the total score.

Citation Potential at 21% rewards original, quotable content -- backed by Airops' finding that brands are 6.5x more likely cited through third-party sources. Authority & Trust at 18% acts as the gatekeeping filter -- anonymous content gets 64% fewer citations. Crawlability at 11% and Page Speed at 5% (matching First Page Sage's 3%) are foundational but lower-impact once you pass the baseline.

The grade scale translates the score into actionable territory: an A+ (90-100) means the page is strongly AI-ready, while anything below C (40-49) indicates significant gaps that are likely costing you AI citations.

Your AI Readiness Checklist

Prioritize your audit by working through the dimensions in order of impact. Start with the quick wins, then build toward the long-term investments.

Quick wins (this week)

Enable HTTPS if not already active
Add or fix XML sitemap, link from robots.txt
Ensure robots.txt does not block AI crawlers (GPTBot, ClaudeBot, PerplexityBot)
Add FAQ schema to top 10 informational pages
Add "last updated" dates to all content pages
Add author bios with credentials to every article

Medium-term (this month)

Front-load answers in the first 100 words of every page
Restructure top pages as listicles or comparison guides
Add comparison tables to product and category pages
Include 3+ original statistics or data points per page
Cite external authoritative sources with inline links
Fix heading hierarchy (single H1, proper H2/H3 nesting)

Long-term (this quarter)

Ensure server-side rendering for all content pages
Build a monthly content update calendar for high-value pages
Invest in original research and proprietary data
Earn third-party mentions on review sites and industry publications
Create comprehensive, authoritative content exceeding 1,000 words
Monitor AI visibility per platform and track improvements

An AI readiness audit is not a one-time exercise. It is a recurring process. The sites that monitor and improve continuously are the ones that AI platforms learn to trust and cite.

Run your AI readiness audit

Audit any page across all six dimensions -- crawlability, content quality, page speed, AI readiness, citation potential, and authority. Get a weighted score, per-dimension breakdown, and prioritized recommendations.

Start your audit free

Share this article