The Complete GEO Optimization Checklist
This checklist covers every major optimisation that improves your website's visibility in AI-generated answers. Each step is backed by research from Princeton University, Georgia Institute of Technology, Gartner, and real-world AI citation analysis.
Use this as a systematic guide to audit and improve your site. Most UK businesses score between 30 and 55 on their first GEO assessment — completing even half of these steps can push you above 70.
Technical Accessibility (Steps 1-8)
These steps ensure AI crawlers can physically access and process your content. Research shows fast, crawlable pages receive 3x more AI citations than slow or blocked pages.
1. Create and deploy a robots.txt file
Your robots.txt must exist at yourdomain.com/robots.txt. Without it, AI crawlers have no explicit permission signals. Include specific Allow directives for AI user agents.
2. Allow GPTBot in robots.txt
Add User-agent: GPTBot / Allow: / to your robots.txt. OpenAI's GPTBot is the primary crawler for ChatGPT, which commands 17% of all web queries globally.
3. Allow ClaudeBot in robots.txt
Add User-agent: ClaudeBot / Allow: / — Anthropic Claude is the second most popular AI assistant and growing rapidly in the UK professional market.
4. Allow PerplexityBot in robots.txt
Perplexity AI has become the go-to research tool for millions. Blocking PerplexityBot means your site cannot appear in its cited sources.
5. Allow Google-Extended in robots.txt
Google-Extended controls whether your content is used for Gemini and Google AI Overviews. With Gemini capturing 18.2% AI chatbot market share, this is critical.
6. Create and deploy llms.txt
Create an llms.txt file at your domain root following the specification at llmstxt.org. This is the AI equivalent of robots.txt but for content discovery. Include your company description, services, key pages, and common questions you answer.
7. Create and submit an XML sitemap
Your sitemap should include every public page with accurate lastmod dates. AI crawlers use sitemaps to discover and prioritise content for re-crawling.
8. Ensure server-side rendering
If your site uses React, Vue, or Angular, ensure server-side rendering is configured. AI crawlers cannot execute JavaScript — content that requires client-side rendering is invisible to them. Verify by viewing your page source (not the rendered DOM).
Schema & Structured Data (Steps 9-17)
JSON-LD schema markup helps AI systems understand your content structure. HubSpot research shows properly marked-up pages receive up to 30% more AI citations.
9. Add Organization schema
Every site needs an Organization schema with name, url, logo, description, sameAs links (social profiles), and contactPoint. This establishes your entity in knowledge graphs.
10. Add WebSite schema with SearchAction
The WebSite schema tells AI systems that your site is a searchable resource. Include a SearchAction with your URL template.
11. Add FAQPage schema to FAQ content
FAQPage schema makes your Q&A content trivially extractable by AI systems. Princeton University research shows this produces a +40% citation improvement for FAQ content.
12. Add Article schema to blog posts
Every blog post should have Article schema with headline, author (as Person, not Organization), datePublished, dateModified, and publisher fields.
13. Add BreadcrumbList schema
BreadcrumbList schema helps AI systems understand your site hierarchy. Add it to all interior pages with proper nesting.
14. Add Product schema (if applicable)
If you sell products, Product schema with name, price, currency, and availability helps AI assistants include your products in recommendation responses.
15. Add HowTo schema (if applicable)
If your pages contain step-by-step instructions, HowTo schema makes each step individually extractable by AI systems.
16. Include 3+ sameAs links in Organization schema
Link to your social profiles (LinkedIn, Twitter/X, GitHub, etc.) via sameAs. This helps AI systems verify your entity across platforms. Wikipedia and Wikidata links are especially valuable — Wikipedia accounts for 47.9% of ChatGPT citations.
17. Validate all schema with Google Rich Results Test
Test every schema implementation. Incomplete or invalid schema can be worse than no schema at all — it signals carelessness to AI systems evaluating your authority.
Content Quality & E-E-A-T (Steps 18-28)
Content quality carries the highest weight in GEO scoring. Moz research confirms that E-E-A-T signals directly influence AI citation decisions.
18. Add statistics to your content (5+ per 1000 words)
Princeton University research shows that adding statistics and data points to content improves AI visibility by +22%. Aim for at least 5 specific data points per 1000 words.
19. Include expert quotations with attribution
Direct quotes from named experts produce a +37% AI visibility boost according to Princeton research. Use blockquote elements with clear attribution.
20. Add citations and source references
Cite your sources inline. Reference research papers, government data, industry reports, and authoritative organisations. Princeton found this produces a +40% visibility improvement.
21. Build named entity density
Mention specific organisations, people, places, and products by their full names. AI systems rely on entity recognition to connect your content to knowledge graphs. Aim for 5+ unique named entities per 1000 words.
22. Add author bios with credentials
Assign named authors with professional credentials to your content. Backlinko analysis shows that 96% of AI Overview citations come from sources with strong E-E-A-T signals.
23. Create a comprehensive FAQ section (5+ questions)
McKinsey reports that AI assistants most frequently access homepage and FAQ pages when evaluating a business. Include at least 5 well-answered questions.
24. Maintain a blog or resources section
Regular, substantive content publishing builds topical authority. AI systems favour domains that demonstrate ongoing expertise.
25. Create a detailed About page
Your About page should include team credentials, company history, mission statement, and contact information. This is a core E-E-A-T signal.
26. Link to authoritative external sources
Link to .gov, .edu, .org, Wikipedia, and reputable news sources. This signals confidence in your own content and positions you within credible networks.
27. Create deep content (1000+ words) on key topics
Comprehensive coverage signals expertise. At least your primary topic pages should exceed 1000 words with structured, scannable formatting.
28. Maintain accessible readability (grade 8-10)
Target a Flesch-Kincaid reading level of grade 8-10. Content that is too complex (grade 13+) is harder for AI systems to extract clean answers from.
Content Structure (Steps 29-37)
How you format content determines whether AI systems can extract and cite it. Search Engine Journal data shows that 44.2% of AI citations come from the first 30% of a page.
29. Use exactly one H1 per page
Your H1 is the primary topic signal. Multiple H1s confuse AI systems about what the page is actually about.
30. Maintain proper heading hierarchy
Follow H1 → H2 → H3 order without skipping levels. This creates a machine-readable content outline that AI systems use for extraction.
31. Keep paragraphs at 50-150 words
Princeton research identifies 50-150 words as the ideal paragraph length for AI extraction. Longer paragraphs are harder to cite verbatim.
32. Create answer capsules (40-100 word blocks)
An answer capsule is a self-contained paragraph that can be cited without any surrounding context. Place these after question-format headings for maximum extractability.
33. Use question-based headings
Headings like "What is GEO?" or "How do AI crawlers work?" directly match how users query AI assistants. Use at least 3 question-format headings per content page.
34. Put key answers in the first 30% of content
Since 44.2% of AI citations come from the first third of a page, lead with your most important and citable information.
35. Use numbered and bulleted lists
Listicle-format content receives 43.8% of all ChatGPT citations — the highest of any content format. Use numbered lists for processes and bulleted lists for features.
36. Add descriptive alt text to images
AI systems process image context through alt text. Use descriptive, keyword-rich alt attributes on all images.
37. Write meta descriptions for every page
Every page needs a unique 50-160 character meta description. AI systems use these as content summaries when deciding relevance.
Brand & Authority (Steps 38-43)
Brand signals determine whether AI systems recognise your business as a citable entity. Forrester research indicates that entity recognition is the single strongest predictor of AI citation.
38. Pursue Wikipedia and Wikidata presence
Wikipedia accounts for 47.9% of ChatGPT citations. If your business qualifies for a Wikipedia article, this is the highest-impact GEO action available.
39. Add 3+ social media profiles
Link to at least 3 social platforms (LinkedIn, Twitter/X, GitHub, etc.) from your website and reference them in your Organization schema sameAs property.
40. Display contact information prominently
Show your email, phone number, and physical address on your website. These are fundamental trust signals that AI systems use for entity verification.
41. Create a privacy policy and terms of service
Legal pages signal a legitimate, professional business. Both should be linked from your footer and included in your sitemap.
42. Ensure HTTPS is enabled
HTTPS is a baseline trust signal. AI systems may deprioritise content served over insecure HTTP connections.
43. Maintain brand name consistency across pages
Use your exact brand name consistently in page titles, headings, and schema. Inconsistency confuses entity resolution in AI knowledge graphs.
Content Freshness (Steps 44-47)
AI systems have a strong recency bias. HubSpot data shows that 65% of AI crawler hits target content published within the last year.
44. Add dateModified to Article schema
Include a dateModified field in every Article schema and update it whenever you make substantive content changes. AI crawlers use this to prioritise fresh content.
45. Display visible "Last updated" dates
Show "Last updated: [date]" or "Updated [Month Year]" prominently in your content body — not just in schema. AI systems look for visible freshness signals in page text.
46. Update sitemap lastmod dates
When content changes, update the corresponding lastmod value in your sitemap. Stale lastmod dates signal an unmaintained site.
47. Publish blog content regularly
Aim for at least one substantive blog post per month. Gaps longer than 3-6 months cause sharp drops in AI citation frequency.
Conversational Readiness (Steps 48-50)
These final steps ensure your content matches how people query AI assistants.
48. Create comparison content
Users frequently ask AI assistants comparison questions ("X vs Y", "best tools for..."). Create dedicated comparison pages for your competitive landscape.
49. Use natural language throughout
Avoid keyword stuffing. Princeton research shows that keyword-stuffed content receives a -10% visibility penalty in AI-generated responses. Write naturally.
50. Anticipate follow-up questions
Include "Related questions", "See also", or "Next steps" sections that anticipate what a user might ask after reading your content. This keeps AI systems within your content ecosystem.
Tracking Your Progress
After implementing these steps, measure your progress by:
- Running a free GEO analysis to establish your baseline score
- Implementing changes in priority order (highest-weight categories first)
- Re-scanning after 2-4 weeks to measure improvement
- Testing your target queries against ChatGPT, Perplexity, and Gemini directly
Most UK businesses see measurable improvement in AI citations within 4-6 weeks of implementing the technical and structural changes. Content and entity authority changes take longer — 1-3 months is realistic for meaningful citation improvements.
Check your GEO score for free
See how your website scores across all 8 GEO categories. Takes 60 seconds.
Get your free GEO score →