We Measured AI Visibility Across 102 Brands and 5 AI Engines. Here's What the Data Shows.
Across 102 brands and 102,025 AI answers, only 2.9% of citations pointed to a brand's own domain. Yet most published 'AI visibility' studies still tell teams to optimize their own pages first. The real structure is a 73 / 44 / 11 stature ladder where third-party pages do the work. Here's everything our arXiv study found, and what to do about it.
We measured how 102 brands show up across the five major AI search engines — ChatGPT, Gemini, Perplexity, Claude, and Grok — over 102,025 prompt responses and 149,912 source citations, then published the whole thing on arXiv. The short version: AI visibility is not random, and it is not evenly distributed. It forms a three-tier brand-stature ladder (global brands appear in 73% of unbranded answers, mid-market in 44%, niche in 11%), only 2.9% of citations point to a brand's own domain, the ranked listicle is the single highest-leverage page, and how a brand is framed is far noisier than whether it gets mentioned at all.
2.9%
of 149,912 AI citations point to a brand's own domain. The other 97% is third-party territory.
That single number reframes the whole problem. People increasingly get answers straight from AI assistants instead of scrolling ten blue links — Pew Research found 49% of US adults now use AI chatbots, up from a third in 2024, and most search users read the AI summary at the top of the page. So the question that matters for a brand has changed: not whether you rank for a keyword, but whether a model names you when someone asks about your category. That is the work of Generative Engine Optimization (GEO). Here is what the data says about how it actually works — and where most of the advice gets it wrong.
What we measured
Most "AI visibility" studies are small: a few dozen prompts, one or two engines, a single run. Ours is the largest openly published measurement we know of, drawn from Ranqo's production tracking between March and May 2026. Every prompt was unbranded — a real category question with no brand name in it — because a branded prompt makes "does AI mention you" tautological.
| Parameter | Value |
|---|---|
| AI engines | ChatGPT, Gemini, Perplexity, Claude, Grok |
| Brands tracked | 102 |
| Prompt responses | 102,025 |
| Source citations analyzed | 149,912 |
| Prompt style | Unbranded category questions |
| Data window | March–May 2026 |
| Paper | arXiv:2606.20065 (CC BY 4.0) |
A disclosure, up front: this is our own data, gathered on the platform we sell. That is a real limitation — we are a vendor measuring the market we serve. We have tried to counter it two ways. First, the full methodology, statistics, and confidence intervals are published openly on arXiv so anyone can check the work. Second, we are reporting the findings that are inconvenient for us too: that your own website barely moves the needle, and that the metric most tools love to sell — sentiment — is mostly noise.
One more honest note. Two of the five engines (Claude and Grok) were measured on a smaller, higher-stature slice of brands, so we do not over-read their individual numbers. Every cross-engine finding below pools all five engines.
Finding 1: AI visibility is a 73 / 44 / 11 ladder
Sort the 102 brands by how established they are — global household names, mid-market challengers, genuinely niche players — and visibility falls in clean steps.
The brand-stature visibility ladder
Share of unbranded AI answers a brand appears in, by stature tier (102 brands, 5 engines)
Each step down the ladder costs roughly 30 points of unbranded visibility. A Tier 1 brand shows up in about 73% of category answers without anyone naming it; a niche brand, 11%. The gap is not subtle, and it is not mostly about your website — it is about how much the rest of the web already says about you. The paper reports this with bootstrap confidence intervals and a Kruskal–Wallis test; the short version is that the tiers are real, not sampling noise.
~30 points
of unbranded visibility lost at each step down the stature ladder
The practical read: stop benchmarking yourself against the giant in your category. Benchmark against your tier, and treat the tier above as a multi-quarter climb that runs through third-party coverage, not a homepage rewrite. For how to set that benchmark without fooling yourself, see our guide to measuring AI share of voice.
Finding 2: your own domain is 2.9% of citations
When these engines cite a source, where does the citation point? Almost never to you.
Where AI citations actually point
Share of 149,912 source citations by destination — your own domain highlighted
Across 149,912 citations, a brand's own domain accounted for 2.9%. Corporate and competitor pages — third-party sites describing the category, including your rivals' pages — took 75.2%. The rest is spread across video, media, community forums, reference sites, and review platforms. If your AI strategy is "optimize our own pages and wait," you are optimizing the 3% slice.
In AI search, your own website is the smallest lever you have. The brands that win are the ones other pages talk about.
This is the hardest pill for SEO-trained teams. The lever that moves AI visibility is earned presence on the pages models actually read: the category listicles, the comparison articles, the forum threads, the reference entries. Source Analytics exists to show you exactly which of those domains the engines cite about you, so you can earn your way in rather than guess. It also reframes a familiar SEO idea: the goal is not to rank your page, it is to become the brand those ranking pages name.
Finding 3: the listicle is the highest-leverage page
If third-party pages do the work, which kind of page does the most? We classified citations by content format. One format leads all the others.
The listicle is the highest-leverage page
Share of content-format citations (the ranked listicle leads every format)
Among citations we could classify by format, the ranked listicle — the "best X tools" article — took 35.7%, well ahead of the generic article in second place. In absolute terms that is about 21% of all 149,912 citations from a single format. One good list that ranks you surfaces you across a long tail of AI answers, because the models reach for the same well-structured comparison pages again and again.
You will see higher listicle figures quoted elsewhere; they usually measure a different denominator. Ours is a clean decomposition of content-type citations across 102 brands, so the formats are comparable to each other. The takeaway is the same either way: getting onto, and up, the credible ranked lists in your category is the highest-leverage content play in GEO. What is GEO covers the broader playbook, and Content Lab is how we help build the assets that earn those placements.
Finding 4: sentiment is mostly noise
Plenty of tools sell sentiment dashboards. Our data says be careful with them.
Sentiment is 6.7x noisier than mention
How often each signal flips between consecutive measurements (% of brand–engine pairs)
Between consecutive measurements, whether a brand was mentioned at all flipped about 6.8% of the time. How the brand was framed — positive, neutral, negative — flipped 45.5% of the time. Sentiment is roughly 6.7x noisier than mention. A single reading of "AI sentiment" is close to a coin flip; the signal only emerges as a trend across many runs.
Measure mention and position first, because they are stable enough to act on. Treat sentiment as a slow-moving trend, not a number to react to week over week. If a tool shows your sentiment swinging wildly, that is usually the measurement, not your brand.
Finding 5: visibility is 77.5% deterministic, so measure repeatedly
Given all that volatility, you might conclude AI visibility is chaos. It is mostly not.
Across brand, prompt, and engine combinations, 77.5% were deterministic: the brand was either always cited or never cited, run after run. Only the remaining ~22.5% sat in the stochastic middle. That is good news — most of your visibility is a stable, improvable state, not a dice roll — with one catch: you cannot tell which bucket a given prompt is in from a single run. Independent work makes the same point. The recent "Don't Measure Once" paper argues AI visibility should be treated as a distribution, not a one-off observation.
77.5%
of brand × prompt × engine cells are deterministic — always cited or never cited
So the discipline is simple: measure the same prompts repeatedly, act on the deterministic majority you can actually move, and watch the volatile tail for genuine shifts. Earlier GEO research found targeted tactics can lift visibility by up to 40% on the right pages — but you only know they worked if you were measuring before and after.
What this means for your brand
Strip away the dashboards and the study comes down to three moves that most teams have backwards:
1. Win the third-party web, not just your homepage
Your own domain is 2.9% of citations. Spend the effort where the other 97% lives: earning mentions on the category pages, comparisons, and communities the models read. On-site fixes matter for crawlability, but they are the floor, not the lever.
2. Get onto the ranked lists in your category
The listicle is the single highest-leverage format. Identify the "best X" lists the engines already cite in your space, and earn an honest, well-placed spot on them. One good list compounds across hundreds of answers.
3. Measure mention repeatedly; treat sentiment as a trend
Mention and position are stable enough to act on; sentiment flips 6.7x more often, so read it as a slow trend. And because most visibility is deterministic, the same prompts measured on a steady cadence tell you what is real and what is noise.
The brands that win in AI search are not the ones with the most optimized homepage. They are the ones the rest of the web already treats as an answer.
See what AI says about your brand
Ranqo tracks your visibility, position, sentiment, and citations across ChatGPT, Claude, Perplexity, Gemini, and Grok — the same pipeline behind this study. Read the full research paper, or check your AI visibility free in a couple of minutes.
Start monitoring freeWritten by
Nisha Kumari
Nisha Kumari is Co-Founder at Ranqo, where she leads growth strategy and client acquisition. With a background in digital marketing and financial management, she specializes in SEO, Generative Engine Optimization, and helping brands build visibility across AI platforms.
Share this article
Related articles
What is Generative Engine Optimization (GEO)? The Complete 2026 Guide
Generative Engine Optimization (GEO) is the practice of optimizing content to be cited by AI platforms like ChatGPT, Perplexity, Gemini, Claude, and Grok. This complete guide covers what GEO is, how it works, why it matters in 2026, and how to implement it -- with data from every major study published in the last 12 months.
What AI Platforms Really Recommend When You Ask About CRM Software
We tested 50 prompts across ChatGPT, Claude, Perplexity, Gemini, and Grok to map the CRM recommendation landscape. The data reveals a concentrated market where 3 brands capture over half of all AI-generated mentions.
GEO vs AEO vs SEO: Three Measurement Views of the Same Work
Every other 'GEO vs AEO vs SEO' article gives you a comparison table or a layered metaphor. Both are wrong. These three terms aren't competing strategies -- they're three measurement views of the same underlying work, and treating them as separate disciplines is the mistake that costs most marketing teams real budget. Here's the honest version, with verified data and a business-model allocation framework.