LLMRanks · Research Report
Video 002 · What Is AEO? · Jun 2026
Full research report · all sources cited

What is AEO, and what do
AI engines actually cite?

The complete research behind the video and article: our original 270-answer citation study, plus an adversarially-verified review of the external literature — every claim tied to its source.

270
AI answers measured
28
External sources read
22
Claims verified (of 124)
3
Claims refuted
00 · Executive summary

What we did, and what we found.

Two independent bodies of research sit behind this project. Part A is an original measurement study: we asked 3 AI engines the same 30 buyer questions, 3 times each, and recorded every source cited — 270 answers in total. Part B is an adversarially-verified review of the public literature: 28 sources, 124 candidate claims, narrowed to 22 that survived independent 3-vote fact-checking.

Part A

The original citation study.

A first-party measurement of what three AI answer engines cite when asked real buying questions — designed to be reproducible and sliceable.

A.1 · Method

How the study was run

Two outcomes we measure — "citation" vs. "mention"

A citation (linked source) — the engine attaches a clickable link to a webpage it drew from. The publisher earns the attribution and the referral click. We report this as the link rate; the per-domain counts in this report are citations in this sense.

A mention — the engine names a brand or product in the answer, recommending it to the buyer, whether or not a source is linked. The brand earns the visibility. We report this as the mention rate.

They come apart: an engine can say "use HubSpot" with no link at all. A publisher wants to be a cited source (the click); a brand wants to be mentioned (the recommendation). Because bare "cited" is ambiguous, we keep the two separate throughout.

Reproducibility

Run via the same engine endpoints the LLMRanks platform uses in production; 270/270 cells returned cleanly (0 errors). Raw per-answer data retained.

A.2 · Instrument

The 30 prompts

6 verticals × 5 query types · US / EN
Verticalbest [X][A] vs [B]how tobest [X] for…worth it / alt
B2B softwarebest CRM for small businessHubSpot vs Salesforcechoose project-mgmt softwareemail platform for ecommerceMailchimp alternatives
Consumer technoise-cancelling headphonesiPhone 16 vs Galaxy S25pick a robot vacuumlaptop for video editing <$1500is the Dyson V15 worth it
Health/wellnessbest protein powdercreatine vs pre-workoutstart strength training at homefitness tracker for runnersis AG1 worth the money
Personal financebest budgeting appRoth vs traditional IRAstart investing with $1000travel credit card for cashbackis YNAB worth it
Home/lifestylemattress for back painPurple vs Casperchoose a standing deskrobot mower for a large yardPeloton alternatives
Travel/DTCbest carry-on luggageAway vs Monosfind cheap flightstravel insurance for a familyis Allbirds worth it

Full citation matrix — every prompt × engine

Each cell = the number of distinct domains that engine cited for that prompt (3 runs combined), with the single most-frequent one named. Gemini's blank cells and AI Overviews' spikes are both visible at a glance.

All 30 prompts · distinct domains cited per engine
ChatGPT
Gemini
AI Overviews
B2B Software
bestWhat's the best CRM for a small business?
10techradar.com
0no cite
10reddit.com
vsHubSpot vs Salesforce — which should I choose?
6sasanova.com
5resonatehq.com
6reddit.com
howtoHow do I choose project management software for a remote team?
8softabase.com
9slack.com
3asana.com
usecaseWhat's the best email marketing platform for ecommerce?
5ecomstacksolutions.com
8emailtooltester.com
6moosend.com
altWhat are the best alternatives to Mailchimp?
13activecampaign.com
8emailtooltester.com
5zapier.com
Consumer Tech
bestWhat are the best noise-cancelling headphones?
3rtings.com
8techradar.com
5youtube.com
vsiPhone 16 vs Samsung Galaxy S25 — which is better?
3techradar.com
9phonebot.com.au
4youtube.com
howtoHow do I pick a robot vacuum?
7bestrobovacuums.com
0no cite
5youtube.com
usecaseWhat's the best laptop for video editing under $1500?
7techradar.com
3gagadget.com
1google.com
altIs the Dyson V15 worth it?
7rtings.com
7purewow.com
4reddit.com
Health & Wellness
bestWhat's the best protein powder?
5verywellhealth.com
3forbes.com
13reddit.com
vsCreatine vs pre-workout for beginners?
2preworkoutsups.com
0no cite
7reddit.com
howtoHow do I start strength training at home?
2healthline.com
0no cite
5health.ucdavis.edu
usecaseWhat's the best fitness tracker for runners?
4techradar.com
4runnersworld.com
2google.com
altIs AG1 worth the money?
8healthline.com
0no cite
14reddit.com
Personal Finance
bestWhat's the best budgeting app?
5appstested.com
8kiplinger.com
7reddit.com
vsRoth IRA vs traditional IRA?
2fidelity.com
4farther.com
26startengine.com
howtoHow do I start investing with $1000?
8richmoneyflow.com
7friendsthatinvest.com
3youtube.com
usecaseWhat's the best travel credit card for cashback?
3financepedia.us
4forbes.com
5reddit.com
altIs YNAB worth it?
8senticmoney.com
7ynab.com
5reddit.com
Home & Lifestyle
bestWhat's the best mattress for back pain?
3sleepfoundation.org
5theguardian.com
8reddit.com
vsPurple vs Casper mattress?
3casper.com
5sleepopolis.com
6reddit.com
howtoHow do I choose a standing desk?
9maplin.co.uk
0no cite
6youtube.com
usecaseWhat's the best robot lawn mower for a large yard?
6therobowire.com
14navimow.segway.com
1google.com
altWhat are the best alternatives to a Peloton bike?
1homefitnesslab.com
6cnet.com
2reddit.com
Travel & DTC
bestWhat's the best carry-on luggage?
6forbes.com
5forbes.com
25nbcnews.com
vsAway vs Monos luggage?
3goodhousekeeping.com
6rd.com
4google.com
howtoHow do I find cheap flights?
9forbes.com
4moneysavingexpert.com
11reddit.com
usecaseWhat's the best travel insurance for a family trip?
6haznos.org
9explorewitherin.com
5usnews.com
altIs Allbirds worth it?
9trustpilot.com
7neverendingvoyage.com
3google.com
Cited domains (3 runs combined):01–23–45–67+
A.3 · Results

Per-engine profile

270 answers · 90 per engine · cited domains
EngineCite rateDistinct domainsTop-10 shareEditorial (review+news)UGC+video
ChatGPT81%13430%20%7%
Gemini58%12327%15%9%
AI Overviews100%14435%10%19%

Cite rate — % of answers citing ≥1 source

AI Overviews
100%
ChatGPT
81%
Gemini · consumer app
58%
Gemini · direct API
73%
Cross-validation — a second, independent method

Because "Gemini cites least" is a headline finding, we re-ran it a different way: querying Gemini's developer API with forced google_search grounding (n=30, 0 errors). It cited on 73% of answers — higher than the consumer app (gemini.google.com surfaces fewer sources to users than the model actually grounds on), but still the lowest of the three engines, and even with a live search tool Gemini cited nothing on ~27% of buyer questions. The finding holds under both methods.

Mentions vs links — being named ≠ being linked

A linked citation and a brand mention are different outcomes — an engine can recommend a brand by name without linking any source. A companion run (270 cells) captured the full answer text and extracted the brands each engine names, alongside the links:

Companion run · 270 answers · 90 per engine
EngineLinks a sourceNames a brandRecommends, no linkAvg brands / answer
AI Overviews99%97%0%7.1
ChatGPT83%96%14%5.2
Gemini53%82%29%5.7

Every engine names brands more than it links them, and Gemini's gap is the widest — it recommends a brand by name on 82% of answers but links a source on only ~53%, so ~1 in 3 Gemini answers name a brand with zero links. (Link rates here corroborate the headline study — AIO 100 / ChatGPT 81 / Gemini 58 — within run-to-run variance.) This refines "Gemini cites least": it gives the fewest links, but still recommends — visibility without attribution. The implication for AEO: on Gemini, being named (in-model knowledge / grounding) is a different and arguably more valuable target than being linked.

Cross-engine overlap — shared cited domains (Jaccard)

ChatGPT ∩ Gemini
0.09
ChatGPT ∩ AI Overviews
0.09
Gemini ∩ AI Overviews
0.14

Top cited domains

Cells citing each domain · 90 cells per engine
RankChatGPTGeminiAI Overviews
1reddit.com · 14reddit.com · 17reddit.com · 61
2techradar.com · 12forbes.com · 9youtube.com · 41
3forbes.com · 11cnet.com · 6google.com · 28
4healthline.com · 7goodhousekeeping · 5forbes.com · 12
5goodhousekeeping · 5youtube.com · 4nerdwallet.com · 12

Reddit ranks #1 on all three engines. Cross-engine totals: reddit.com (92) · youtube.com (46) · forbes.com (32) · google.com (29) · techradar.com (18).

Source type by query type

% of citations, all engines · "brand/other" = vendor & long-tail pages
Query typebrand/otherforum/UGCvideoreviewnews
best64851011
vs787464
how-to64111145
use-case809174
alternatives66122612

How-to questions spike on video (people want to be shown); best/alternatives pull the most news/media (editorial listicles); vs/use-case lean hardest on specific brand pages.

Source type by vertical

% of citations, all engines
Verticalbrand/otherreviewnewsforumvideo
consumer-tech5425489
health-wellness6411596
travel-dtc6911771
b2b-software7262146
personal-finance767493
home-lifestyle796195

Concentration & the long tail

Citations are not concentrated in a few big aggregators. The top-10 domains account for only 27–35% of each engine's citations; the remaining two-thirds spread across 123–144 distinct domains per engine, with 64–80% going to specific vendor / niche "brand/other" pages.

A.4 · Limitations

What this study does and doesn't claim

A.5 · The full ranking

Every domain and brand, ranked

The two tables aggregate the entire dataset — every linked source (citation) and every named brand (mention) across all 270 answers, three runs combined. The head is short and the tail is very long. The complete, unabridged lists ship as raw JSON alongside this report — citation-study-data.json and mention-study-data.json — so you can run your own analysis.

Most-cited domains — who AI engines link

Citations = cells linking the domain · 90 cells per engine · top 20 of 339
#DomainChatGPTGeminiAI OvwTotal
1reddit.com14176192
2youtube.com144146
3forbes.com1191232
4google.com012829
5techradar.com123318
6nerdwallet.com201214
7healthline.com70613
8goodhousekeeping.com55010
9mattressnerd.com13610
10zapier.com0369
11rtings.com3429
12cnet.com0639
13instagram.com0099
14pcmag.com0437
15businessinsider.com0437
16amazon.com0066
17cleanmyspace.com0336
18sleepfoundation.org3036
19slack.com2305
20popularmechanics.com3205

339 distinct domains, 987 total citations — but long-tailed: 44% were cited only once, and the top-10 draw barely 28% of all citations. Reddit (92) is cited about twice as often as #2 (youtube.com, 46); AI Overviews drives the overwhelming share of the Reddit, YouTube and Google volume.

⚠ Google's own surfaces: google.com (29 — all but one from AI Overviews) and youtube.com (46 — 41 from AI Overviews) are Google-owned. The google.com hits are almost all product-shopping queries — Google's AI linking its own Shopping / Flights surfaces, i.e. citing itself. Counted as-is, but these aren't third-party AEO targets: you reach them via a Shopping/Merchant feed or a YouTube video, not by publishing citable content.

Most-named brands — who AI engines recommend

Mentions = cells naming the brand · 90 cells per engine · top 20 of 606
#BrandChatGPTGeminiAI OvwTotal
1Apple1217837
2HubSpot76922
3YNAB66618
4Away66618
5Mailchimp65617
6Klaviyo66517
7Shopify67316
8Salesforce56314
9Brevo44614
10ActiveCampaign44614
11Fidelity53513
12Zoho43512
13WooCommerce55212
14Omnisend43512
15MailerLite34411
16Slack35210
17Monarch Money34310
18Monos34310
19Pipedrive3339
20Trello3339

606 distinct brands, 1,613 total mentions, with 54% named only once — recommendations spread even wider than source citations. Strikingly, the most-named brands split evenly across all three engines (YNAB and Away are 6/6/6; Mailchimp 6/5/6): so while the engines cite wildly different sources (9–14% overlap, see A.3), they broadly agree on which brands to recommend.

Reading the two tables together

They are different units: a citation's value accrues to a publisher (reddit.com, forbes.com); a mention's accrues to a product (HubSpot, YNAB). A brand can be widely named without its own site being cited — the mention-vs-link gap quantified in A.3.

Part B

The background literature research.

An adversarially-verified review of what's publicly known about AEO — so every external claim in the video and article is independently checked, not repeated on faith.

B.1 · Method

Multi-source sweep → 3-vote verification

The question was decomposed into 6 angles (definition/origin · the academic GEO paper · empirical citation data · citability levers · market growth · skeptical/contrarian). Sources were gathered per angle, key falsifiable claims extracted, then each ranked claim was put through independent 3-vote adversarial verification — voters instructed to refute; a claim needed a majority to survive.

Sources fetched28
Falsifiable claims extracted124
Claims sent to verification25
Confirmed22
Refuted / killed3
Merged into findings9
B.2 · Verified findings

What survived fact-checking

High confidencevote 3–0
No one definitively coined the term "AEO." The most-repeated origin story credits Jason Barnard / Kalicube with pioneering it — but that's a self-reported claim, and even his own write-ups disagree on the year. The practice is real; the "who invented it" story isn't settled.
Why it matters: don't state "X coined AEO" as fact. And note that in practice people use "AEO" and "GEO" interchangeably — only GEO has a rigorous academic definition (next finding).
Sources: [1] [17]
High confidencevote 3–0 · 6 sources
GEO has a clean, peer-reviewed academic origin: "GEO: Generative Engine Optimization" (Aggarwal, Murahari, Rajpurohit, Kalyan, Narasimhan, Deshpande — Princeton + IIT Delhi). arXiv Nov 2023; published at KDD 2024 (ACM SIGKDD), DOI 10.1145/3637528.3671900.
Why it matters: this is the credibility backbone — a top-tier data-mining venue, not a vendor blog. Attribute to Princeton + IIT Delhi.
Sources: [3] [4] [5] [6] [7] [8]
High confidencevote 3–0
The GEO paper's headline result: the right content changes can boost how prominently a source appears inside an AI answer by up to ~40% in the best case. The three strongest levers were citing your sources (≈+28%), adding direct quotations (≈+41%), and adding original statistics (≈+34%).
Hedge (mandatory): "up to 40%" is best-case, relative, on a citation-prominence proxy metric, measured on a 2023–24 benchmark; the one live-engine validation (Perplexity) was ~22%. Present as direction-of-effect, not a guarantee.
Sources: [4] [5]
High confidencevote 3–0
Keyword stuffing — the classic SEO tactic — does not transfer to AI answer engines. The GEO paper finds it ~10% worse than baseline (often negative).
Why it matters: the strongest single evidence that AEO levers diverge from old SEO. Independently reaffirmed in 2025–26.
Sources: [7] [26]
High confidencevote 3–0
The largest disclosed external dataset: Semrush analyzed ~230,000 prompts and 100M+ AI citations across ChatGPT Search, Google AI Mode, and Perplexity over 13 weeks (Jul–Oct 2025). (Vendor-published; Gemini not in scope.)
Why it matters: credible scale, but a dated snapshot — and the gap it leaves on Gemini is exactly what our Part A study fills.
Sources: [9]
Medium confidencevote 2–1
Reddit and Wikipedia dominate ChatGPT citations — but their share is highly volatile (Reddit ~60%→~10%, Wikipedia ~55%→<20% over Aug–Sep 2025).
Why it matters: the durable takeaway is the volatility, not the specific shares. Our own data corroborates Reddit's dominance on commercial queries.
Sources: [9]
High confidencevote 3–0
How much an engine's citations overlap with Google's own top-10 organic results varies enormously. Perplexity tracks Google closely — >91% of the domains it cites also rank in the top 10 (82% at the exact-URL level); Google's AI Overviews ~86% domain / ~67% URL; its newer AI Mode ~54% / ~35%. ChatGPT was the clear outlier: Semrush measured it with the weakest overlap of any engine — lowest on both domain and URL, correlating more with Bing than Google — and an independent Ahrefs study found just ~10% of the exact pages ChatGPT cites also rank in Google's top 10 (≈32% at the domain level).
Why it matters: "rank #1 on Google = cited by ChatGPT" is simply false — and two independent datasets agree. Our Part A data pushes it further: the engines also diverge sharply from each other. (Semrush reports domain/URL overlap across 5,000 keywords but gives no ChatGPT figure — only that it ranked weakest; Ahrefs separately measured ChatGPT on 3,311 head terms — ~10% page-level / ~32% domain-level overlap, Sept 2025. ChatGPT cites the right site far more often than the exact ranking page.)
Sources: [10] [13]
Also verified

The GEO paper formally coins "generative engines" and frames them as "rapidly replacing" traditional search (3–0); Barnard frames AEO as a discipline distinct from SEO — optimizing for machine understanding and credibility (2–1, attributed viewpoint).

B.3 · Refuted — shown for transparency

Claims that did not survive

Refutedvote 0–3
GEO's "40% lift" was achieved on deployed commercial engines.
Reality: it was a benchmark / proxy result, not a live-engine measurement. Do not state it as a real-world commercial figure.
Refutedvote 0–3
The GEO paper is a Princeton-only project.
Reality: omits IIT Delhi and independent co-authors. Attribute to Princeton + IIT Delhi.
Refutedvote 1–2
Google AI Mode cites LinkedIn ~15% (top) and Wikipedia only ~2%.
Reality: failed verification — excluded from all deliverables.
B.4 · Market context

The "why now" numbers

Verified — gap-fill pass, 2026-06-16

Each figure was confirmed in a targeted verification pass (primary source + a corroborating second source). They are industry-reported by the named orgs and date-stamped — directional "why now" context, not peer-reviewed measurement.

MetricReported valueSource
ChatGPT scale800M+ weekly active users (Oct 2025); 900M cited by early 2026[23]
Google AI Overviews2.5B+ monthly users (Pichai, Google I/O, May 2026); 2B in Jul 2025[22]
Search-volume forecast−25% by 2026 (Gartner)[15]
Click behaviorClicked a result on 8% of visits with an AI summary vs 15% without (Pew, Jul 2025; Google disputes the method)[14]
B.5 · Caveats

Reading the evidence honestly

Bibliography

Sources (28)

Tiered by quality. primary peer-reviewed / institutional · secondary reputable reporting · first-party self-published (use as attributed claim) · weak excluded from claims.

  1. Jason Barnard — "The Trustpilot white-paper that started AEO" first-party
    jasonbarnard.com
  2. Profound — "AEO vs GEO" blog
    tryprofound.com/blog/aeo-vs-geo
  3. arXiv — "GEO: Generative Engine Optimization" (abstract) primary
    arxiv.org/abs/2311.09735
  4. ACM Digital Library — KDD 2024, DOI 10.1145/3637528.3671900 primary
    dl.acm.org/doi/10.1145/3637528.3671900
  5. arXiv — GEO paper PDF (v2) primary
    arxiv.org/pdf/2311.09735v2
  6. Princeton institutional repository — GEO publication record primary
    collaborate.princeton.edu
  7. arXiv — GEO paper HTML (v2) primary
    arxiv.org/html/2311.09735v2
  8. Generative-Engines.com — GEO project page primary
    generative-engines.com/GEO
  9. Semrush — "Most-cited domains in AI" (230K prompts) primary
    semrush.com/blog/most-cited-domains-ai
  10. Semrush — "AI Mode comparison study" primary
    semrush.com/blog/ai-mode-comparison-study
  11. Profound — "AI platform citation patterns" primary
    tryprofound.com/blog/ai-platform-citation-patterns
  12. Peec.ai — "Top domains cited by AI search (30M sources)" primary
    peec.ai/blog/top-domains-cited-by-ai-search…
  13. Ahrefs — "ChatGPT vs Google citations" (referenced in overlap finding) primary
    ahrefs.com/blog/chatgpt-google-citations
  14. Pew Research — AI summaries reduce link clicks (Jul 2025) primary
    pewresearch.org
  15. Gartner — search volume to drop 25% by 2026 primary
    gartner.com
  16. Ahrefs — schema & AI citations primary
    ahrefs.com/blog/schema-ai-citations
  17. Wikipedia — "Generative engine optimization" secondary
    en.wikipedia.org/wiki/Generative_engine_optimization
  18. Search Engine Land — AI engines cite Reddit, YouTube, LinkedIn most secondary
    searchengineland.com
  19. Search Engine Journal — AI Overview citations from top-ranking pages drop secondary
    searchenginejournal.com
  20. Search Engine Land — "No, llms.txt is not the new meta keywords" secondary
    searchengineland.com
  21. The Digital Bloom — 2025 AI-citation / LLM-visibility report secondary
    thedigitalbloom.com
  22. Google AI Overviews reach — 2B (Pichai, Q2-2025 earnings) → 2.5B (Pichai, Google I/O, May 2026) secondary / primary
    pymnts.com (2B) · blog.google (2.5B) · cnbc.com (2.5B video)
  23. Tech.eu — ChatGPT 800M+ weekly active users (Altman) secondary
    tech.eu
  24. Stan Ventures — schema markup has no meaningful AI-citation impact secondary
    stanventures.com
  25. arXiv 2406.18382 — follow-up generative-engine work primary
    arxiv.org/html/2406.18382v1
  26. Contentful — llms.txt & search visibility secondary
    contentful.com/blog/llms-txt-search-visibility
  27. SEOPress / DataVessel — keyword-stuffing has negligible/negative AI effect secondary
    (corroborating sources for the keyword-stuffing finding)
  28. Adobe — AI-driven traffic surges weak — excluded
    business.adobe.com
Provenance & cost

How much work this was.

Most "AI SEO" content is a hot take. This project is two measured studies. Here's the full accounting.

Resources behind video/article 002
LayerScaleCost (API-equivalent)
Part A — citation study270 answers · 3 engines · 30 prompts × 3 runs · 0 errors~$1
Part B — literature research111 AI agents · 28 sources · 124 claims · 22 verified · ~47M tokens~$114

Every external claim was cross-examined by independent verifiers; only the 22 that survived appear in the deliverables, and the 3 that failed are listed above in full. That's the standard for credibility — and it's why the article that comes out of this is itself built to be cited.