BetterAISearch
Strategy|8 min read

ChatGPT SEO: how to get cited in ChatGPT answers — what the research actually shows

BE
BetterAISearch Editorial Team
BetterAISearch

ChatGPT does not use PageRank. It does not crawl anchor text. It does not reward domain authority in the way Google does. If your ChatGPT SEO strategy is a subset of your Google strategy, you are optimising for a different system than the one you think you are. The research has now accumulated enough to say what actually works.

4%
overlap between GPT-4o's source pool and Google's indexed results
arXiv peer-reviewed study. Perplexity overlaps by 15.2%, Claude by 12.6%, Gemini by 11.1%.

Why traditional SEO fails as a ChatGPT strategy

A peer-reviewed arXiv study on AI source selection found that GPT-4o draws from a source pool that overlaps with Google's indexed results by just 4%. That means 96% of what ChatGPT cites is outside the territory traditional SEO is fighting for.

This is not incidental. ChatGPT is trained on a massive corpus of web text and selects sources through retrieval mechanisms that weight different signals than PageRank. Backlinks — the core authority signal in Google's algorithm — are a structural relationship between web pages that LLMs do not directly process from content. A brand consistently mentioned across dozens of publications in plain text is more visible to a language model than a brand with many incoming links but sparse textual mentions.

Ahrefs confirmed this with data at scale: an analysis of 75,000 brands found branded web mentions have a Spearman correlation of 0.664 with AI Overview citation rates. Backlinks correlated at 0.218. The gap is 3x. This is the core divergence between Google SEO and ChatGPT SEO.

What actually predicts ChatGPT citation

The research converges on five variables with consistent positive correlation to ChatGPT citation rates. Each is independently confirmed by multiple studies.

1. Author credentials and attribution

Presence AI tracked 1,200 pages and 3,600 queries across ChatGPT, Perplexity, Google AI Overviews, and Gemini over 90 days. Pages with expert authors and documented credentials achieved a 72% AI citation rate. Pages with no author attribution achieved 25%. Pages with partial attribution (name only, no credentials) achieved 41%.

The practical implementation: every published page needs a visible author byline, a credentials statement ("Senior researcher at X", "10 years in Y"), and a link to an author page. AccuraCast's analysis of 9,000 citation sources found Person schema appeared in 58.9% of cited pages and 70.4% of ChatGPT citations specifically.

2. Content structure: headings and extractability

AirOps analysis of 815,484 retrieved pages found that 7 to 20 subheadings was the optimal range for per-query citation rate in ChatGPT. Pages with fewer than 7 headings (insufficient structure) and pages with more than 20 (over-fragmented) both underperformed.

The mechanism: ChatGPT retrieves approximately 6 to 7 pages for every one it cites. 85% of retrieved content is discarded. Clear heading structure creates extraction points — the model can locate the exact section that answers the query and extract it cleanly. Pages structured as walls of prose with no internal navigation are harder to mine for specific answers.

3. Branded web mentions

Branded web mentions — references to your brand or domain by name across third-party publications, with or without a hyperlink — are the strongest single predictor of AI citation rate across platforms. The Ahrefs 0.664 correlation exceeds backlinks (0.218), domain rating, and organic traffic as predictors.

Building branded mentions requires digital PR: being quoted as an expert source in industry publications, having your research cited by other writers, securing brand mentions in round-up articles. The goal is to appear by name in text that AI systems train on and retrieve from — not just to acquire hyperlinks.

4. Content freshness

ChatGPT with browsing enabled weights recent content more heavily. Amsive analysis found 50% of AI-cited content is under 13 weeks old. BetterAISearch's own freshness decay calibration, which is based on this and corroborating data, gives full weight to content under 3 months old and discounts content over 18 months by 60%.

Practically: every key page on your site should have a visible, accurate last-updated date. Updating with new data or revised findings — even minor additions — resets the freshness signal. Stale content on active topics is a direct citation liability.

5. Specific, sourced claims

ChatGPT is optimised to produce accurate responses. It prefers sources whose claims can be verified against its training corpus. Content with specific numbers, named studies, documented methodology, and properly attributed quotes gives ChatGPT more confidence that it can cite you accurately without hallucinating details.

Generic advice ("content should be high quality") creates no extractable evidence for ChatGPT to work with. Specific claims ("AirOps found 815,484 pages; 500 to 2,000 words outperformed") give the model something concrete to cite and verify.

ChatGPT technical requirements: what crawlers can and cannot read

GPTBot, OpenAI's web crawler, does not execute JavaScript. A Writesonic study of 62 webpage elements across six AI crawlers found that JSON-LD structured data scored zero out of six for readability — none of the crawlers reliably extracted it. JavaScript-rendered content scored 1 out of 6.

The content GPTBot can read: visible body text (6/6), semantic headings H1–H3 (6/6), title tags (5/6), and image alt text (4/6). Everything else is either not reliably read or not read at all.

The checklist for ChatGPT technical eligibility:

  • robots.txt must allow GPTBot: User-agent: GPTBot, Allow: /
  • Primary content must be in server-rendered HTML, not JavaScript-populated
  • Title tag must accurately describe the page content
  • H1–H3 heading structure must reflect the page's informational architecture
  • No login walls or cookie gates blocking content access

The ChatGPT SEO vs Google SEO action table

TacticGoogle SEO impactChatGPT citation impactPriority shift
Backlink buildingVery highLow (0.218 correlation)Downgrade
Branded web mentionsMediumVery high (0.664 correlation)Upgrade significantly
Author credentialsIndirectVery high (2.4x citation rate)Upgrade significantly
Person schema markupLowHigh (58.9% cited page rate)Upgrade
Content freshnessModerateHigh (50% of cited content <13wk)Upgrade
Keyword densityModerateNot a direct signalDowngrade
H1–H3 heading structureModerateHigh (6/6 crawler readability)Maintain
JSON-LD schema markupHighZero (0/6 crawler readability)Redirect to Google only
Word count (long-form)ModerateVolume, not rate (use strategically)Nuance required

Source: Ahrefs (75k brands), Presence AI (n=1,200 pages), Writesonic (62 elements, 6 crawlers), AirOps (n=815,484 pages)

The bottom line

ChatGPT SEO is not Google SEO with a different name. The source pool barely overlaps. The signals that drive ranking in Google — backlinks, domain rating, keyword density — have weak or zero correlation with ChatGPT citation rates. The signals that drive ChatGPT citation — branded web mentions, author credentials, content freshness, heading structure — are either secondary or absent from traditional SEO practice.

The research is now consistent enough to act on. Start with author attribution (highest measured impact per unit effort), branded web mentions (highest correlation overall), and technical accessibility (binary — either GPTBot can read your pages or it cannot).

Frequently asked questions

What is ChatGPT SEO?

ChatGPT SEO — more accurately called AEO (Answer Engine Optimisation) or GEO (Generative Engine Optimisation) — is the practice of optimising content to be selected as a cited source in ChatGPT's responses. Unlike traditional SEO, which targets Google's ranking algorithm, ChatGPT SEO targets the retrieval system OpenAI uses to select sources for its browsing-enabled and GPT-4o answers. The two disciplines overlap but require different tactics.

Do backlinks help with ChatGPT citations?

Backlinks are a significantly weaker predictor of ChatGPT citation than they are for Google ranking. Ahrefs analysis of 75,000 brands found branded web mentions have a Spearman correlation of 0.664 with AI citation rates, versus 0.218 for backlinks — a 3x difference. The mechanism: LLMs are trained on raw web text, where a brand mentioned across publications signals authority without requiring hyperlinks.

What content length performs best for ChatGPT citations?

AirOps analysis of 815,484 pages found 500 to 2,000 words with 7 to 20 subheadings produces the highest per-query citation rate in ChatGPT. Longer content (10,000+ words) accumulates more total citations over time by covering more sub-queries, but underperforms for any specific question. Both lengths have a role depending on whether you are targeting one query or building topical coverage.

Does ChatGPT index new content in real time?

ChatGPT with browsing enabled (GPT-4o with web search) can access recent content. ChatGPT's base model is limited to its training cutoff. For browsing-mode citations, research from Amsive found 50% of AI-cited content is under 13 weeks old, suggesting a strong recency bias. Fresh content with a visible publication date outperforms older content on equivalent topics.

How is ChatGPT SEO different from Google SEO?

Three core differences: (1) ChatGPT's source pool overlaps with Google by only 4% (arXiv peer-reviewed study), meaning Google ranking does not reliably predict ChatGPT citation. (2) Backlinks are far weaker signals for ChatGPT than branded web mentions. (3) Author credentials matter directly for ChatGPT citation — pages with expert, attributed authors achieve 2.4x higher citation rates than anonymous pages (Presence AI, n=1,200 pages).

Related tactics in the database

About the author

BE
BetterAISearch Editorial Team
BetterAISearch

The BetterAISearch team synthesises peer-reviewed studies, platform documentation, and independent research into actionable, scored tactics.