Strategy|May 2, 20259 min read

Generative engine optimisation (GEO): what it is, what the research shows, and how to do it

BetterAISearch Editorial Team

BetterAISearch

The Princeton KDD Lab published the first peer-reviewed study on generative engine optimisation in 2024. Since then, a body of controlled research has accumulated across nine independent studies. Some tactics are confirmed. Many are not. This is the evidence base for what actually works, scored by source quality.

2.4×

citation rate improvement from expert author attribution across all four AI platforms

Presence AI, 1,200 pages, 3,600 queries, ChatGPT + Perplexity + Google AI Overviews + Gemini, 90-day tracking.

What generative engine optimisation means

Generative engine optimisation (GEO) is the practice of improving content so that AI systems (ChatGPT, Perplexity, Google AI Overviews, and Gemini) select it as a cited source in their generated responses. The term was coined and defined in a peer-reviewed study from Princeton's Knowledge Discovery and Data Mining lab, published in 2024, which tested nine optimisation strategies across 9,679 queries from a 10,000-query dataset.

GEO is related to AEO (Answer Engine Optimisation), which predates the generative AI wave and covers optimising for featured snippets, voice search, and structured answer formats. GEO is the AI-specific evolution: optimising for retrieval-augmented generation (RAG) systems that synthesise answers from multiple sources and attribute citations.

What the Princeton GEO study confirmed

The Princeton study tested nine strategies: adding statistics, adding citations, adding quotations, adding fluency improvements, adding authoritative tone, adding keyword stuffing, adding easy-to-understand language, adding technical terms, and a combined approach.

The result: citing relevant statistics increased visibility by 40% on average across three AI systems. Adding quotations from experts increased visibility by 20%. All strategies improved citation rates except keyword stuffing, which showed neutral to slightly negative outcomes.

The study confirmed GEO as a valid optimisation practice and established that AI systems respond to content quality signals in measurable ways. It also confirmed that keyword-first thinking (the foundation of traditional SEO) does not transfer to generative AI.

The five most evidence-backed GEO tactics

1. Author credentials (Tier 1 by measured impact)

Presence AI ran a 90-day controlled study tracking 1,200 pages and 3,600 queries across ChatGPT, Perplexity, Google AI Overviews, and Gemini. The finding: pages with expert authors and documented credentials achieved a 72% AI citation rate. Pages with no author attribution achieved 25%. The 2.4x difference makes E-E-A-T author attribution the single highest-impact actionable GEO tactic in the database.

Implementation: every published page needs a visible author byline, a credentials statement (specific expertise, not vague), a link to an author page with documented background, and, for Google AI Overviews specifically, Person schema in the article markup.

2. Branded web mentions (Tier 1 by predictive correlation)

Ahrefs analysed 75,000 brands and found that branded web mentions (references to a brand by name across third-party publications, with or without a hyperlink) have a Spearman correlation of 0.664 with AI Overview citation rates. Backlinks correlated at 0.218. That is a 3x difference in predictive strength between the dominant Google signal and the dominant GEO signal.

Implementation: digital PR, expert interviews, being cited as a source in industry publications, brand mentions in aggregator and round-up articles. The goal is text mentions across diverse sources, not link acquisition.

3. Content structure: heading structure and extractability

AirOps analysis of 815,484 AI-retrieved pages found that 7 to 20 subheadings with 500 to 2,000 words produced the highest consistent per-query citation rate in ChatGPT. Over-structured content (too many headings) and under-structured content (prose without internal navigation) both underperformed.

Microsoft's Bing documentation corroborates this: "strong descriptive headings are signals that help AI know where a complete idea starts and ends." Each H2 section should be independently extractable: it should make sense as a standalone answer to the sub-question it addresses.

4. Specific, sourced claims with statistics

The Princeton study found that adding statistics to content increased AI visibility by 40%. Growth Memo analysis of 21,482 ChatGPT citations found DATE and NUMBER are the two strongest positive entity signals in a page's first 1,000 characters. Specific, verifiable claims create extraction points that AI systems can cite confidently.

The implementation is explicit: cite the source of every statistic, include the study methodology and sample size where known, name the specific percentage or count rather than approximating. Content that reads like it could be cited in a research paper (specific, attributed, falsifiable) performs better than content that reads like marketing copy.

5. Content freshness

Perplexity weights content from the last 30 days most heavily. ChatGPT with browsing enabled shows recency bias: Amsive analysis found 50% of AI-cited content is under 13 weeks old. Content updated with new data or revised findings outperforms equivalent content on the same topic with an older date.

GEO tactics with weak evidence

Three tactics are widely recommended for GEO but are poorly supported by the research:

Tactic	Recommendation prevalence	Evidence quality	Verdict
FAQ schema markup	Very high	AccuraCast: 1.8% of cited pages	Not a meaningful AI citation driver
Meta description	High	Writesonic: 0/6 crawler readability	Not read by AI crawlers
Open Graph tags	High	Writesonic: 0/6 crawler readability	Not read by AI crawlers
JSON-LD structured data	Very high	Writesonic: 0/6 crawler readability	Indirect benefit via Google only
Word count maximisation	Moderate	Mixed (rate vs volume trade-off)	Context-dependent

Source: AccuraCast (9,000 citation sources), Writesonic (62 elements, 6 crawlers), AirOps (n=815,484)

The AirOps dataset (815,484 pages) and a Writesonic crawl testing 62 webpage elements across six AI crawlers both confirm that meta descriptions, Open Graph tags, and JSON-LD schema markup are not read by most AI crawlers directly. An AccuraCast study of 9,000 AI-cited pages found FAQ schema present on just 1.8% of them, despite being the most commonly recommended structured data type for AEO.

How GEO differs by AI platform

GEO is not uniform across AI systems. Platform architecture determines which signals reach the model.

Google AI Overviews builds on Google Search infrastructure. Traditional SEO signals (backlinks, domain authority, structured data) transfer more strongly here than to any other AI platform. Person schema and Article schema are relevant for Google AI Overviews in a way they are not for ChatGPT or Perplexity.

ChatGPT (GPT-4o with browsing) shows the weakest correlation with backlinks and the strongest preference for branded web mentions and author credentials. It also requires explicit GPTBot permission in robots.txt, making AI crawlability a prerequisite for citation.

Perplexity weights content freshness most aggressively of the four major platforms. It also produces the highest source overlap with Google (15.2%) of the ChatGPT/Claude/Gemini group, suggesting Google ranking transfers to Perplexity better than to other platforms.

Gemini uses Google infrastructure and shows citation patterns closer to Google AI Overviews than to ChatGPT or Perplexity. Standard SEO and GEO signals both apply.

How to measure GEO performance

Google Search Console does not capture AI citation data from ChatGPT, Perplexity, or Gemini standalone. Measuring GEO performance requires platform-specific monitoring.

The available approaches: manual query testing (ask target questions directly in each AI platform and record citation sources), AI answer monitoring platforms (track which sources are cited in AI responses to a monitored keyword set), and share-of-voice tracking adapted for AI search. As of mid-2026, no single tool covers all four major platforms with equivalent depth.

The minimum viable GEO measurement: monthly manual audits of your top 20 target queries across ChatGPT, Perplexity, and Google AI Overviews, recording which pages are cited and which competitors appear. This gives a trend line without requiring specialised tooling.

The bottom line

Generative engine optimisation is a distinct discipline from SEO. The source pools barely overlap. The signals differ. The measurement approach is different. The tactics that drive Google ranking (backlinks, keyword density, FAQ schema) are weak or irrelevant for AI citation rates.

The tactics that drive GEO performance (author attribution, branded web mentions, heading structure, specific sourced claims) are well-evidenced across multiple independent studies and should be treated as the foundational layer of any content strategy aimed at AI search visibility.

→ChatGPT SEO: what the controlled research actually shows →E-E-A-T for AI search: where it diverges from traditional Google SEO →GPT-4o and Google share only 4% of sources — what this means for your strategy →AI crawlability audit: 62 webpage elements tested across 6 crawlers

Frequently asked questions

What is generative engine optimisation (GEO)?

Generative engine optimisation (GEO) is the practice of optimising content to be selected as a citation source in AI-generated answers from systems like ChatGPT, Perplexity, Google AI Overviews, and Gemini. It was formally defined in a peer-reviewed Princeton study (2024) that tested nine optimisation strategies across 9,679 queries. GEO differs from SEO in that AI systems select sources through retrieval mechanisms that weight author authority, brand recognition, content structure, and factual specificity rather than backlinks and keyword match.

How is GEO different from SEO?

SEO targets Google's ranking algorithm (PageRank, domain authority, backlinks, keyword density). GEO targets AI retrieval mechanisms that select sources for synthesised answers. The core difference: a peer-reviewed arXiv study found GPT-4o's source pool overlaps with Google by only 4%. What predicts Google ranking (backlinks) barely predicts AI citation (0.218 Spearman correlation), while branded web mentions, weaker for Google, have 3x stronger correlation with AI citation rates (0.664). The tactics and the target systems are different.

What is the best-evidenced GEO tactic?

Author credentials and attribution produce the largest single measured impact on AI citation rates. Presence AI tracked 1,200 pages over 90 days and found expert-attributed pages achieved a 72% AI citation rate versus 25% for unattributed pages, a 2.4x difference. This is confirmed across multiple platforms (ChatGPT, Perplexity, Google AI Overviews, Gemini) and multiple independent studies.

Does GEO work for all AI search platforms?

GEO tactics show different performance by platform. Google AI Overviews benefits most from traditional SEO signals because it builds on Google Search infrastructure. Perplexity weights content freshness most aggressively (content updated in the last 30 days). ChatGPT citation is most strongly predicted by branded web mentions and author credentials. Gemini overlaps more with Google ranking than other platforms. Most content-level and authority-level tactics transfer across all platforms; technical tactics may be platform-specific.

What are the most common GEO mistakes?

Three common GEO mistakes: (1) Relying on FAQ schema: it appears in only 1.8% of AI-cited pages despite being the most widely recommended structured data type for AEO. (2) Publishing anonymous content: unattributed pages achieve 2.4x fewer citations than pages with expert author attribution. (3) Applying Google SEO metrics to measure GEO performance: Google Search Console data does not capture ChatGPT or Perplexity citation rates. These require separate monitoring with AI answer tracking tools.

Related tactics in the database

Topical content clusters→E-E-A-T for AI search→Heading structure→Brand entity & mentions→

About the author

BetterAISearch Editorial Team

BetterAISearch

The BetterAISearch team synthesises peer-reviewed studies, platform documentation, and independent research into actionable, scored tactics.

←All research posts